Media and community

Icon

Communication experiments in a global laboratory

Investigative reporting 3.0–or, Web stalking

The following is based on an Investigative Reporters and Editors seminar this weekend in Birmingham, Alabama.

IRE talked about how the Web--both the Surface ("visible") and Deep ("invisible") Webs--can help reporters address the occupational hazard of having to know everything about anything at any given moment.

The hour-long presentation, Effective use of the Internet, was fittingly framed by the first word in the title. Mark Horvit, IRE’s executive director, began by emphasizing that reporters should approach online research armed with a strategy (i.e., key words to search and a general idea of what’s available and desirable) to avoid getting distracted by the Web’s potentially cavernous detours. Step one, Horvit said, is not to log on, but to sketch out a plan.

Important for every investigative journalist to know about search engines is that a Google search, for example, does not look through the actual Internet, per se. It searches Google’s servers, which are stocked with information that the search engine company’s Web “crawlers” have found and stored.

What they’re missing – eye-opening stats:

  • Google searches far less than half of what’s out there
  • Total shared results of any two search engines: 8.9 percent
  • Any three search engines: 2.2 percent
  • Above figures from 2007 study by Dogpile, Penn State and Queensland University of Technology
  • Some estimate the “invisible” Web is 550 times bigger than the “visible” Web.
  • Google says more than 1,000 federal government sites can’t be crawled.

If (way) more than half the Web isn’t showing up in a search engine result, then it is important for investigative reporters to know where to go to find it. Here are some of the principles behind efficiently conducting those searches, with both superficial tools and subterraneous means.

Surface Web – Savvy searching tips:

  • Treat info online as one would any source (confirm)
  • Find out who owns the Web site
  • Know Google advanced search options (esp. domain and file type)
  • Archived Web: Gone doesn’t mean forever. (Google cache, Wayback Machine)
  • Consult at least two other search engines–each has its own strengths and weaknesses.
  • People finders (i.e., http://www.pipl.com, http://www.whitepages.com, etc.)
  • Social media searches (i.e., http://www.whostalkin.com…; Who’s Talkin’, not Who Stalkin’… or so they say)
  • Use Wikipedia for the footnotes only

The session then took Web searches to the next level… well, at least a step above what amateur voyeurs might use to get information.

Deep Web – Search like a pro:

  • Know what search engines typically miss (databases, content behind firewalls and registration screens, ASP/dynamically generated pages, Robo.txt excluded pages)
  • The information is out there, but the key is to find organizations that make is more easily accessible. Bookmark these!
  • Directories by and for journalists (‘Net Tour and Reporter’s Desktop)
  • Know the gateways to public records
  • Pipl actually claims to access the Deep Web. Try it. Pipl yourself. It’s scary how much information it digs up with just a name.
  • The census is your friend, especially in 2010
  • To get fully submerged… go to IRE’s Web site!

I’m not going to copy-paste in this post all of the useful links for discovering the “hidden Web” and the “dead Web,” which were hyper-linked in the PowerPoint presentation that Mark offered to send out to anybody at the day-long seminar who asked for it. All of this stuff is available at the organization’s site, and I can see what the nominal membership fees pay for, seriously.

Filed under: Investigative journalism, New media technology, , , , , , , , , , , , , , , , , ,

Twitter

Error: Twitter did not respond. Please wait a few minutes and refresh this page.

RSS NewsTurfs

  • Journalisted 10 May 2010
    Nieman lab discusses new site that gives readers info on journalist, so can assess cred, experience, etc. Possible end game – j builds following and revenue, hires staff, etc.?
    wilsonlowrey
  • Gatekeeping ecology 1 May 2010
    Further thoughts on the news ecology model — I just finished reading “Gatekeeping Theory” by Pam Shoemaker and Tim Vos, and they make a plea for models that push the five hierarchical levels of influence on media messages to include impact of history, or time. That’s just what I’m working on, so I was gratified [...]
    wilsonlowrey
  • Get your news in the novel 1 May 2010
    Listened to an insightful interview on Bob Edward’s Sunday show on PRI. Richard Nash, founder of “Cursor,” talked about the future of independent book publishing in a digital age. Many memorable comments, but the one that stuck with me concerned thinking of books as interactive communities, with a lead author and a host of contributors [...]
    wilsonlowrey

RSS Brett Bralley

  • instagram update 27 February 2013
    Brett Bralley Jaillet
  • a Saturday afternoon | Avondale Brewing Co. & Saw’s Soul Kitchen 25 February 2013
    There are a great many things Birmingham has to offer that let  you feel that you are enjoying life to the fullest. I’ve often found myself searching for just the right suggestions for out-of-towners searching for somewhere to go, somewhere to truly experience Birmingham. The city has several attractions: the Civil Rights Institute, the Vulcan, […]
    Brett Bralley Jaillet
  • on newsstands now: Celebrate Weddings! 21 February 2013
    Whether or not you’re planning a big day, you definitely should take a look at Celebrate‘s first ever weddings special issue! Our whole team worked hard on this. We love a challenge, and I think this issue shows it. You can pick this issue up on newsstands, or you can order it here! (Hey, it’s […]
    Brett Bralley Jaillet

RSS Journalista

  • Gmail’s Best April Fools Joke Yet 1 April 2011
    You know I’ve reallllllly got to be busy to have forgotten that today is April Fools Day. It is my favorite holiday. I used to do all kinds of things to my friends and family back in the day. Now, I couldn’t even remember that the best holiday of all is today. Well, thanks Gmail [...]
    klw09
  • Broadcastr: Listen Closely, Every Place Has a Story 21 March 2011
    I know I said I would post when I have time. I do not have time right now, but I wanted to post this before I forgot. Check out this article from TechCrunch on Broadcastr, a very cool tool that I think could be useful to journalists in our storytelling.
    klw09
  • I know, I know…it’s been a while. 12 March 2011
    I decided I’m going to continue using this (when I have time) to post all the cool social media and hyperlocal tools and information that I find because I come across some pretty incredible stuff. Plus, I like knowing about the hottest, newest things and then sharing these things. So here’s one. It’s called NowMov. [...]
    klw09

RSS Caitlin’s blog

  • IRE Conference – Paper Trails and Databases 26 January 2010
    This past Saturday, the Community Journalism fellows traveled to UAB’s campus in downtown Birmingham for a Watchdog Journalism Conference, hosted by the IRE (Investigative Reporters and Editors). Each of us was asked to report on one of the sessions throughout the day. When writing a story or beat that involves something in local government or [...]
    bonnec04
  • Media tools video 10 December 2009
    View my video made for Media tools class on the Allen & Jemison building in downtown Tuscaloosa
    bonnec04
  • Check out my website! 1 December 2009
    Allen & Jemison building mock website
    bonnec04

RSS Crimsonjackson

  • Be a Better Watchdog: Watch Your Time! 25 January 2010
    At IRE’s workshop this weekend, USA TODAY’s Alison Young helped school all of us about managing and juggling our time in this circus known as journalism.  I couldn’t have had a better topic to blog about this weekend.  Story assignments, … Continue reading →
    crimsonjackson
  • UA ROTC Video Project 10 December 2009
    Well it has been a blast hanging out with the young men and women of UA’s ROTC program.  Check out the final project: my video. After many early mornings (and late nights of editing) I present to you my finished … Continue reading →
    crimsonjackson
  • This is it: Where Investigative Journalism and Digital Media Collide 6 December 2009
    It is intriguing; however not shocking that investigative journalism has included digital media in its communication sphere. When one thinks of investigative journalism, he or she might consider the awe-inspiring and legendary cross generational focal-point of what we now consider … Continue reading →
    crimsonjackson

RSS Gaddy News

  • IRE Blog 26 January 2010
    What’s up everybody?  My area to blog about was the open records segment with speakers James Pewitt and John Archibald.  Both of their speeches focused more on Alabama open records statutes than FOIA.  However, Pewitt did provide a link that gives users an automatic draft of a FOIA request.  And that link is: www.rcfp.org He [...]
    sobergonzo
  • Here’s My Video Story 10 December 2009
    The Undead Take UA
    sobergonzo
  • Here’s My Dreamweaver Project 2 December 2009
    The Webpage
    sobergonzo

RSS Rachel’s blog

  • IRE Conference, January 23rd. It was freezing! 26 January 2010
    The weathermen lied to us. That’s all I have to say. On to the review! The IRE conference at the University of Alabama at Birmingham this past weekend was certainly eye opening, if nothing else. I made sure to take notes during the presentations to keep for future reference. Some of the stuff discussed, like [...]
    jnrbennett
  • Tuscaloosa Housing Market and Economy – Video 10 December 2009
    Hey all! Here’s my video for media production tools. http://www.youtube.com/watch?v=0Bp5b5C_SUE
    jnrbennett
  • Best front page news video ever?! 13 November 2009
    … Well, it’s up there, at least. Al.Com Features the Zelda Overworld Theme. Hah, I’m such a nerd. Anyway, since I’m here I might as well review a somewhat local news website, Al.com. This site hosts The Birmingham News, The Huntsville Times and the Mobile Press-Register. Combined, these three papers are the largest in the [...]
    jnrbennett

RSS Shea’s blog

  • IRE in Birmingham 27 January 2010
    The IRE workshop in Birmingham this past weekend was an extremely valuable assortment of useful information, tools to use and experiences shared from some of the best in the business. Overall, the conference was an amazing experience. The conference concluded with a wrap-up session given by the moderator of the conference from IRE, Mark Horvit, [...]
    sjzirlott
  • Youtube Link 10 December 2009
    sjzirlott
  • The Voice of America 21 November 2009
    VOAnews.com- The Voice of America This news source started out in the broadcast news format in 1942 and is funded by the United States government though the Broadcasting Board of Governors. According to their about us they broadcast “approximately 1500 hours of news, information, educational and cultural programming every week to an estimated worldwide audie […]
    sjzirlott
Follow

Get every new post delivered to your Inbox.