Transcribing Podcasts and Other Audio for Fun, Profit and SEO

October 31st, 2008, 15 Comments »

For one of our clients, we’re doing an informal series of audio interviews with tech gurus and web geeks about how they manage and back up their digital life. There are various marketing angles on this little project, but one is search engine optimization.

The first interview was with man-about-the-web Chris Pirillo. We talked for about 15 minutes over a dodgy Skype line. I had it transcribed by one of our contractors, and it turned into 2500 words of text.

That’s 2500 words of relevant text that’s reasonably rich with keywords. Assuming a going low-end rate of $25 per blog post, and 250 words a post, that’s $250 of text for the price of transcription and, all-in, about an hour of my time.

Exploring My Transcription Options

Yesterday I did the second interview in the series with the incomparable Vanessa Fox. I figured I could get the transcription done for less money, and I didn’t want to torture our contractor. I started shopping around for options. I also asked on Twitter, and these were the options that I came up with. The interview is 18 minutes, so I’ll include the pricing for that duration where available (all amounts, presumably, are in US dollars):

Then Avi reminded me of Andy Baio’s experiments in transcription with Amazon’s Mechanical Turk project. Andy has written up thorough instructions on how to outsource your transcription in chunks to workers in the system.

I decided to give Turk a try. It’s a little more work than just sending off the file to one of the aforementioned services, but I’ve always been charmed by Turk’s micro-task model. I divided the transcription into three six-minute chunks and posted the job. I offered $6 per chunk, or $1 per minute. I’m sure I could have gotten it done for less, but I’m looking for inexpensive, not cheap as humanly possible.

Mechanical Turk-Powered Transcription

It was mostly a success. The turn-around time was ridiculously quick. Despite my specifying a turnaround of a week being acceptable, I got all three transcriptions back in less than six hours. The quality seems excellent, though I don’t have a ton of experience with transcription to compare.

Despite my best efforts to ensure it didn’t happen, two workers transcribed the same section. Thus, I had to re-post a section to get complete results. And, of course, I have to paste the three chunks of text together and do some proofing. After Amazon’s fees, I paid $19.80 for 18 minutes of transcription.

Next time, I may just give CastingWords a try, as it would save me time and the difference in price is pretty marginal. There’s some interesting discussion on Andy’s blog about the ethics of pricing jobs on Turk. I was interested to learn that 75% of Turk workers are American, and that they have a variety of motivations for working on the service.

Posting Transcripts for Their SEO Benefits

I wondered on Twitter why more people don’t have their podcasts transcribed. The general response was that, for the hobbyist, it was too expensive. I guess it depends on if and how podcasters are monetizing their sites.

If you can get reasonably good transcription for $0.75 per minute, then you’ve got about 3700 words (my two transcriptions average out to 185 words a minute) to post on one or more pages for $15. Can you generate more than $15 from advertising on those pages over, say, two years? Maybe, maybe not, but it’s worth considering if you’re relying on advertising (and particular Google AdWords and the like) for revenue.

In any case, if they care one iota about SEO, a company has no excuses for not posting transcriptions of audio and video content. It’s not world-changing, but it’s another few steps in the marathon that is marketing.

UPDATE: Speaking of Mechanical Turk, Andy’s fruitful link blog points to the Turker’s Gospel, where Turk workers (’Turkers’) rewrite Matthew, Mark, Luke and John in their own words.

15 Comments »

Success for Boot and Blade

April 17th, 2008, 2 Comments »

Just a quick note to say thanks to everybody who linked to Julie’s figure skating blog. She’s now in the number one spot for the Google search ‘figure skating blog’. There’s obviously not a lot of stiff competition, because Brian, Richard and I all have results in the top ten (though that may just be Google favouring newer pages).

For all you aspiring adult figure skaters out there, you can learn how to centre a spin.

2 Comments »

Can SEO Get Julie to the Olympics?

April 6th, 2008, 12 Comments »

A couple of years ago my wife Julie started a figure skating blog. She’s got a particular purpose in mind for this project: she wants to get media accreditation for the skating events at the 2010 Olympics in Vancouver.

One way to do this is to write the most popular figure skating blog in the world, according to Google. She currently sits in the #2 spot.

The goal is to get her up to the top spot, and I need your help. If you’re so inclined, please link to Julie’s blog with the phrase ‘figure skating blog‘. Collectively, we ought to be able to boost her up to the top spot, eh?

To entice you, she’s collected eight of the worst falls in figure skating (Digg that mofo, if you’re thus inclined). It’s, uh, not for the squeamish. My favourite is the hip check that Laetitia Hubert lays on Midori Ito.

12 Comments »

Clean is Happy, and the Prospects for Video-Centric Websites

March 25th, 2008, 2 Comments »

Via Amber, I discovered the unusual design for Washlet.com, a site promoting a, uh, toilet. The site is more or less entirely comprised of little bits of video (note to designers: fix your title tags), discussing the pros and…well, just the pros, of their fancy new toilet.

The toilet-promoting monologues are hilarious because they’re so carefully worded. They never say the word ‘toilet’, and are constantly dancing around the actual nuts and bolts of our bodies and the device’s operations. I kind of feel for the poor actors–they seem so darned convinced of the Washlet’s awesomeness.

I’ve been watching the slow but steady rise of video-centric websites over the past few years. I actually don’t mind this approach at all. However, all of the information should be conveyed and easily available (and linkable) in text form. Why? Arguments for SEO aside, maybe I’m a verbal learner. Maybe I want to cut and paste a chunk to send to my bidet-obsessed friend. Maybe I want to blog about a particular product.

I actually prefer a hybrid model,with video naturally integrated into a text-based site. Coast Capital Savings and Steady Hand do this well.

I was reminded of an amusing Phillips campaign that I blogged about a while back, called Shave Everywhere. They’ve changed the content on that site, but the new stuff looks as good or better than the old.

2 Comments »

Riddle Me This SEO Mystery

January 24th, 2008, 3 Comments »

One of my posts on the popular (but crappy) email newsletter service Constant Contact used to be ranked #3 when you searched for “Constant Contact”.

Finding the site down today, I was reminded to do a search on the company again to see where my site now ranked. My post has been bumped down to ninth.

That’s fair enough, as timeliness should be an important part of search result ranking. However, in the top eight results, there are separate results for constantcontact.ca, constantcontact.org and addme.roving.com. All of these simply redirect to ConstantContact.com, and have few or no incoming links to them.

I don’t usually see that kind of behaviour in search results. I don’t think it’s particularly kosher that Google ranks redirecting alternative URLs so highly. What gives?

3 Comments »

Want an Invitation to Qassia?

January 22nd, 2008, 8 Comments »

I’ve been running this Knol blog (though, ironically, Google’s project may actually be called Unipedia) with an occasional post, and paying attention to the space. Today I encountered Qassia (link goes to my profile page, as the site is in barely-private beta), which seems to be a startup in the Squidoo, Wikia and Mahalo vein. From their FAQ:

Qassia is a site to which you can add your websites. You can also add your knowledge, in the form of tidbits of information called “intel”. The more intel you add, the better your sites will rank, the more backlinks you get, and the more money you make.

Qassia is 100 percent free, and does not require reciprocal links. You can get unlimited quality backlinks to your websites from Qassia.

Before you get too excited, it’s not real money. It’s Qassia dollars. Which, according to the FAQ again, you’ll be able to spend on “front-page advertising, site-wide links, and other novel ways for you to burn through your hard-earned Qassia dollars”. Er, wahoo.

I built a page, just to check out the editing interface. Like the rest of the site, it’s pretty unremarkable. Clearly it’s just another attempt at the user-generated content plus SEO equals profit equation. None of these sites, as far as I can figure, is a threat to Wikipedia. Google Knol (or whatever), however, may be.

In any case, if you want to check out Qassia, there’s a sign up link on my profile page. In the interests of full disclosure, I get some magical Qassia bucks if you sign up. Maybe I’ll spend them on a puppy. Oh, uh, never mind.

8 Comments »

The Hyper-Optimized Ebook Website

October 9th, 2007, 10 Comments »

I’ve been doing some reading about ebooks recently, and enjoyed a Copyblogger post about creating ebooks that sell. Brian references an ebook site featuring a book on, uh, writing ebooks.

When I visited the site, I was reminded of a phenomenon I’ve observed in recent years around ebooks and similar digital offerings. You can also see it on GoogleAdSecrets.com and the prolix URL SearchEngineOptimizationStrategies.com.

It’s a particular (and peculiar) kind of website. Really just one very long page, it features a single, centered column, few images, and many bold offers, claims and testimonials. It pretty much defies every major web design trend of this millennium. To the sophisticated web surfer, it looks profoundly tacky.

Clearly it must sell ebooks, though, or people wouldn’t use it. Did one person prove this was the optimal selling strategy, and everybody emulated them? I’d imagine so. The pages certainly don’t inspire confidence in me, but I guess they’re not selling to me.

10 Comments »

A Common Email Reply That I Keep On Hand

September 24th, 2007, 2 Comments »

Hi,
You may be unaware of this, but it’s inappropriate to post commercial messages as comments on a weblog without first contacting the weblog’s owner.

I am this weblog’s owner. With rare exceptions I don’t permit commercial messages in the comments associated with each article. So, I’ve deleted your comment. Next time, please contact me in the same way you would a journalist, to discuss the possibility of my writing about your project. Thanks. DB.

There’s a type of visitor–often but not always a small business owner–who randomly finds a page on my website and leaves a promotional message in the comments. The most recent example was a blurb to promote a new documentary on Tommy Douglas (posted to this page on my site).

By my definition (copped from Tim Bray)–unsolicited commercial email from people I don’t know–this qualifies as spam. However, it’s not like these people have a thousand zombified servers spitting out millions of blog comments. They’re not running spamming server farms from Abuja or Vladivostok. They’re just uninformed.

They probably read some SEO whitepaper that advised them to ‘leave messages in forums’ promoting their services, and that’s what they think they’re doing on my (and your) blog.

So, instead of identifying their comment as spam and ignoring it, I take the extra ten seconds to cut and paste this text into a reply and delete their comment. Most of them are highly apologetic when they get my message.

Do I ever accidentally reply to a real spammer? I don’t think so–it’s still pretty easy to tell the difference.

UPDATE: I tweaked the final sentence a bit to be a little more educational.

2 Comments »

Mahalo and How We Search

September 13th, 2007, 1 Comment »

Mahalo founder Jason Calacanis invited me to check out his new human-powered search engine. Specifically, I installed Mahalo Follow, a kind of ’search buddy’ Firefox plug-in that pops open a sidebar when it thinks Mahalo has some content relevant to the page I’m on (mostly search engine results, but I think it’s opened on other pages as well). That content tends to be a list of the ‘best’ links associated with the content. Here’s an example.

Obviously this service is straight-out-of-alpha, and needs to be populated with much more carefully-selected content. But I don’t think it’s for me.

Like you, my dear readers, I’m a pretty sophisticated searcher. I’ve been doing it a long time, I understand how the search engines work, and so I usually have good intuition about where (and more importantly whether) I’m going to find a particular piece of information.

As an exercise in thinking about how I search, I made a little list of search queries I ran the other day. This isn’t complete, but it’s a pretty representative sampling:

  1. How many units did the Sony Walkman sell?
  2. What is the URL for iLife on Apple’s site?
  3. What is the URL for Google AdWords?
  4. When did Malta achieve independence?
  5. Are there any Bill Callahan videos on YouTube?
  6. What nationality is KT Tunstall?
  7. Verify the correct spelling of ‘tchotchke’.
  8. Where’s the trailer for ‘Atonement’?
  9. What, if anything, do Dennis Leary and the BC Lions have in common?
  10. What century was the Great Siege of Malta in?
  11. What’s the URL for a Malta Times article I read in the paper?
  12. What does the BlackBerry Curve 8310 Smartphone look like?
  13. Where’s the Wikipedia entry for Geocities?
  14. How wide is Sicily?
  15. Who is playing Johanna in Tim Burton’s “Sweeney Todd”?

General Knowledge About Plasma TVs

My searches are really specific. Mahalo seems to want to help me out most with general information (by providing links) on a topic.

I can imagine that, if I was seeking some general knowledge about a suject, Mahalo might be a decent resource. If, say, I wanted to know more about plasma TVs. But it’s quite rare that I want that kind of generalized information. And when I do, Wikipedia rarely fails me.

Speaking of Wikipedia, Mahalo will live and die on user-generated content. It pays contributors US $10 to $15 per page of search results they create. I might give it a try, but that money isn’t worth my time. If Mahalo agreed to share the revenue generated from that page with me, then it might eventually become a better deal.

Besides, I’d much rather contribute to the emerging collective knowledge of humanity that is Wikipedia. Mahalo, after all, is just a company.

UPDATE: In writing this article, I’d meant to cite a recent post by Seth Godin, in which he touches on the problem I gather Mahalo is trying to solve:

The fact is that search engines are very good at fairly simple searches, and very good at finding information about single products, services, people and ideas.

But they’re terrible at connections, at rankings, at horizontal results. They can’t help me find the 25 most important up and coming artists in the United States. They can’t help me find six products that are viable alternatives to something that was just discontinued. They can’t help me rank the service of four accounting firms.

1 Comment »

How To Get Google to Show Your Site’s Navigation in Search Results

July 11th, 2007, 5 Comments »

A client recently asked me how they could get their site to show up like this in search results (that’s not my client, just an example I found):

SEO Shot

The answer is disappointingly vague. From Google’s help system:

The links shown below some sites in our search results, called Sitelinks, are meant to help users navigate your site. Our systems analyze the link structure of your site to find shortcuts that will save users time and allow them to quickly find the information they’re looking for.

We only show Sitelinks for results when we think they’ll be useful to the user. If the structure of your site doesn’t allow our algorithms to find good Sitelinks, or we don’t think that the Sitelinks for your site are relevant for the user’s query, we won’t show them.

At the moment, Sitelinks are completely automated. We’re always working to improve our Sitelinks algorithms, and we may incorporate webmaster input in the future.

I also found this discussion about Sitelinks. It indicates that they have nothing to do with your site’s popularity or traffic.

If it were more important, I’d go do an exhaustive analysis of sites with Sitelinks in the search results, and figure out what makes them tick. I believe the Sitelinks only show up when you search for the domain itself (here’s an alternative example for the screenshot above, without Sitelinks), so those searchers are very, very likely to click the first result, Sitelinks or not.

5 Comments »