Archive for the ‘Searching’ Category

10
Apr

A note on automated transcription

   Posted by: Greg   in Searching

I was looking through my referrals this morning and came across a link from Podscope. The person had done a search for “technology and libraries” and Open Stacks #18 produced the only result. The apparent problem with this search is the presumption that this search tool operates in a Boolean manner. It does not.

Podscope is, at present, only capable of searching for words/phrases. It just so happens that I actually say “technology and libraries” in Open Stacks #18, so this searcher got a hit on their search. But I can’t imagine that they were actually looking for someone to explicitly say those words in that order. Maybe I’m wrong.

A quick look at the FAQ reveals the following:

My complex search isn’t returning any results. What’s going on?
Supporting complex boolean searches is tricky when dealing with audio files. We do plan on providing boolean search capabilities, but we want some community feedback on how it should behave.

I don’t see why Podscope would think Boolean is tricky. Once you’ve transcribed the audio, isn’t it like any other text? Podcast search cousin Podzinger seems to treat all multi-term searches with a Boolean AND, unless quotes are used, which is more in-line with searcher expectations.

But here’s something that’s genuinely tricky: getting accurate transcription results. Automated speech-to-text transcription technology is shockingly mediocre.

During my presentation at CiL, I played a sample Podscope result on the search term “libraries” and, wouldn’t you know it, no mention of “libraries” in the clip. Most folks in the audience were naturally expecting the word libraries and therefore totally missed the word “Blackberries” that was erroneously transcribed.

Podzinger shows keyword-in-context in their results, so you can get a feel for just how poor the technology is. A sample result for “libraries” reads as follows: “expensive business and that Contribution double life will be economy by libraries like good bet that there’s certainly”

All this to say that, while these automated speech-to-text technologies are interesting and fun, you have to understand their limitations before relying on them as a search tool. Added bonus tip: Until the technology improves, use fewer and shorter search terms/phrases to minimize the likelihood of transcription error masking the results you’re looking for.

7
Apr

PAC it in

   Posted by: Greg   in Searching

Meredith has two nifty screencasts that accompany her four seven-part series on “The Failure of Middleware” (link is to part 3): screencast 1 and screencast 2.

18
Feb

Yahoo Search

   Posted by: Greg   in Searching

Just did a quick search on the new Yahoo! search engine (thanks, MP) for “Open Stacks” and not only was I listed first, I was given the option to “View as XML” and “Add to My Yahoo!” Nice.

18
Feb

Copernic Meta

   Posted by: Greg   in Searching

Gary Price linked to the new Copernic Meta toolbar that brings the “add any search engine” functionality of Needlesearch (only for Mozilla - I mostly surf with Opera) to the IE environment. It took me about 30 seconds to add a “Search dmoz” function and a search for my library’s catalog, by simply typing “addengine” into the search box for the desired engine. So simple! Library Lookup never worked on my library’s catalog, but Copernic Meta made it seem so basic, I can hardly believe it took so long to bring it to the masses. Plus Copernic offers a nice starter list of searchable sites to broaden your search horizons. Despite my limited test time, this toolbar is very highly recommended for you to at least give a test drive.

24
Jun

The mother lode

   Posted by: Greg   in Searching

So on the suggestion of Search Engine Watch, I downloaded the HotBot Quick-Search Deskbar. Look out people, I think we have a winner.

First, this little workhorse opens through the right-click menu in your taskbar, so it doesn’t shrink your window size like plugin browsers. Pop a term into the search box and it searches one of the four main search engines (FAST, Google, Teoma, Inktomi) and makes it easy to switch between them. But that’s just the beginning. It has a (seriously) huge menu of reference applications that it can also drop your search term into.

Imagine that you’re new to XML (perish the thought). So you run a basic search with the engines, but you want a little more focus. With your term still in the search box, you peruse the menu hierarchy and see Webopedia. Selecting it drops your term right in to Webopedia and brings up the definition.

Plus there’s a handy shortcut syntax for each resource. Wanna see what Amazon has on XML? Type ‘amaz XML’ into the box. Bam! There are some issues with the shortcuts - if you wanted to search the term RSS, you will be taken to Feedster as rss is the shortcut. Nonetheless, this is astonishingly cool.

I can’t get over how much there is and how painless it is to use. Track stocks, track packages, hex conversions, currency conversions. It just goes on and on. Try the phone number speller. It takes your # and generates some amusing possibilities.

This is some serious good stuff and all from one application. Extremely impressive.

If you’ve ever wondered if dmoz and Google were somehow in cahoots, this article will provide some enlightenment: Google and Dmoz - Are They in Love?

As the article clarifies, getting listed in dmoz is a fairly reliable way to appear in a variety of search engine indexes. So bring it on people.