The focus of this section is advanced search techniques:
- What sort of information you want.
- Where information might be.
Improve your Google searches
Imagine what the page you are searching for, might look like.
1) Search operators
- Search with an exact phrase
- Quotation marks – crime statistics versus “crime statistics”
- Narrow your search
- A minus sign to exclude – “crime statistics” versus “crime statistics”-ONS
- Minus signs can be repeated to exclude more
- Broadening your search with a wild card
- An asterisk within an exact phrase – “crime statistics * 2013”
- What language
- Find what the jargon is in your field
- Pick up the telephone and ask
2) Site operator
- Searching within a site:
- site:gov.uk statistics will give UK government websites.
- site:police.uk will give police websites.
- Search within country extension
- site:nl will only give back results of sites in the Netherlands.
- site:nl will not give Dutch sites with a different domain such as .com or .org.
3) Filetype operator
- filetype:xls will give you results where the address ends in .xls – in other words, a Microsoft Excel spreadsheet.
- filetype:doc – Microsoft Word documents
- filetype:pdf – PDF's
4) Databases
- The contents of a database might not be visible to the search engine.
- Use the word database "search by" instead of filetype.
5) Advanced search operators
- Google’s advanced search
- Google Guide advanced search operaters page
- Combine different search operators
Manage your data
1) Social Bookmarking services
Proper tagging allows you to find your data fast.
2) Spreadsheets (14:17)
- Keep record of your sources in a master document (Google Docs, Microsoft Excel).
Key points of this section
How to search by filetype, site, and search operators, including:
- quotation marks “ “
- wild card *
- minus sign –
- site:
- filetype:
==
In the next section, we will look at scraping, and how to get difficult to reach data.
But first, I'd like to you to carry out some exercise questions. Please continue by clicking the 'Next' button below.
==
Reading list
- Bazzell, Michael (2013) Open Source Intelligence Techniques, CreateSpace
- Dick, Murray (2013) Search: Theory and Practice in Journalism Online
- National Security Agency: Untangling the Web: A Guide to Internet Research