canberrabirds

Canberra Bird Notes - new search facility

To: 'Julian Robinson' <>
Subject: Canberra Bird Notes - new search facility
From: Philip Veerman <>
Date: Mon, 5 Mar 2018 06:46:11 +0000

This is a very nice development and whilst I have no idea how much work was involved, thanks for doing it. I tried it and it looks good. I will point out also, although Julian’s note sort of indicates as much, it sources only individual issues in which a word was included regardless of articles or context. For example I searched on “Debus” and it finds only the mention of the article about the Little Eagle mentioning him (maybe because that is the first mention of the name, in that issue). However it does not mention him again for the several mentions of the name in my review of his book. Thus if you have in mind the book review, this search does not help a lot to locate that, unless you think that it was already listed for other reasons.  

 

I also noticed in a flip through, that there are blank pages included where I doubt the printed original was blank, although trying again I did not find these. Sorry that does not help a lot.

 

It is also worth pointing out that CBN did issue quite good printed indexes (indices) in blocks of 4 volumes, being V1 to 4, V 5 to 8, V 9 to 12, V 13 to 16 and V 17 to 20. (I did one of them.) These do give page references for species (in taxonomic sequence), to authors and to some general topics. These cover the years 1968 to 1995 but appear to not have been done since then. I don’t know whether these indices are also included in this electronic archive. Would be useful to do so.

 

Julian also wrote: The older editions of CBN were scanned from paper copies and used OCR (optical character recognition) to read the text.  I am curious as to what is the dividing line of “older”.

 

Philip

 

 

 

On 4 March 2018 at 22:14, Julian Robinson <> wrote:

It is now possible to search the entire CBN archive via a search box found at the top of the CBN website page – go to  http://canberrabirds.org.au/ > Publications > Canberra Bird Notes.

 

This is a beta version, meaning it is still being perfected.  To use it, enter your search term in the box and click the magnifying glass.  You will get a page of results, sometimes many results.  They don’t appear in any order so be guided by the date and/or the issue number (at the end of the green link). 

 

When you click one of the results you’ll download that whole issue of CBN.  Your search term is NOT highlighted in the resulting document so you need to search again within the document  – Control-f in windows to ‘find’ and insert the same search term.

 

It seems to work well but is completely reliant on the cleverness of your search words to avoid getting thousands of results or none at all.  Sometimes using apostrophes “like this” around your words helps by restricting to that exact phrase, but be aware that then it will not find plurals or other parts of speech.  You’d need to search for “Fuscous honeyeater” and again for “fuscous honeyeaters” to get them all.  On the other hand if you search for Fuscous honeyeater without quotes, it will find honeyeater or honeyeaters but may return items that just contain one of the words.  A technique to find ‘serious’ mentions of a particular bird is to use the scientific name – this generally works well without false alarms.

 

The older editions of CBN were scanned from paper copies and used OCR (optical character recognition) to read the text.  Errors in the OCR process (commonly where words are concatenated or misspelled) mean that sometimes you may miss an occurrence of something you are looking for.  Unfortunately this will be permanent limitation, though it seems to be quite rare.

 

You may be asked to prove you’re not a robot at various times, be prepared to do what it asks and then “submit”.

 

As mentioned this is a work in progress and not all issues of CBN have yet been indexed by google – at the time of writing 108 of 188 have been indexed. 

 

I would appreciate comments on any problems encountered, or any suggestions.

 

This has been a long time coming – hopefully it will prove to be useful.

 

Julian

Cog website

 

 

 

<Prev in Thread] Current Thread [Next in Thread>
Admin

The University of NSW School of Computer and Engineering takes no responsibility for the contents of this archive. It is purely a compilation of material sent by many people to the Canberra Ornithologists Group mailing list. It has not been checked for accuracy nor its content verified in any way. If you wish to get material removed from the archive or have other queries about the list contact David McDonald, list manager, phone (02) 6231 8904 or email . If you can not contact David McDonald e-mail Andrew Taylor at this address: andrewt@cse.unsw.EDU.AU