<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Blogging and podcast start-up paradigms</title>
	<atom:link href="http://benmetcalfe.com/blog/2005/08/blogging-and-podcast-start-up-paradigms/feed/" rel="self" type="application/rss+xml" />
	<link>http://benmetcalfe.com/blog/2005/08/blogging-and-podcast-start-up-paradigms/</link>
	<description>The Virtual Investor</description>
	<lastBuildDate>Tue, 16 Mar 2010 23:25:53 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Arion</title>
		<link>http://benmetcalfe.com/blog/2005/08/blogging-and-podcast-start-up-paradigms/comment-page-1/#comment-267696</link>
		<dc:creator>Arion</dc:creator>
		<pubDate>Tue, 10 Jul 2007 06:56:38 +0000</pubDate>
		<guid isPermaLink="false">http://benmetcalfe.com/blog/index.php/2005/08/29/blogging-and-podcast-start-up-paradigms/#comment-267696</guid>
		<description>interesting</description>
		<content:encoded><![CDATA[<p>interesting</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Frankie Roberto</title>
		<link>http://benmetcalfe.com/blog/2005/08/blogging-and-podcast-start-up-paradigms/comment-page-1/#comment-492</link>
		<dc:creator>Frankie Roberto</dc:creator>
		<pubDate>Wed, 31 Aug 2005 11:20:07 +0000</pubDate>
		<guid isPermaLink="false">http://benmetcalfe.com/blog/index.php/2005/08/29/blogging-and-podcast-start-up-paradigms/#comment-492</guid>
		<description>&lt;blockquote&gt;I have seen some very good demonstrations of speech-to-text technology within the BBC.&lt;/blockquote&gt;

Interesting, but I wonder what &#039;very good&#039; means. If it means &#039;we&#039;ve got this speech-to-text product and it&#039;s 90% accurate, so only 10% to go&#039;, then that&#039;s a bit of a fallacy, as getting the last 10% accurate is infinitely harder than the inital 90%.

As well as the practical problems I outlined above (in determining speech from non-speech content), there&#039;s also problems with homophones, where two words that are spelt differently but sound the same. You can try and use context to determine which word is more likely, but that&#039;s pretty tricky. I imagine also that many podcasters use fairly technical jargon which is difficult for a speech analyser to keep up with. Another problem is that in conversations/interviews, people often tend to speak over one another (&#039;overlap&#039;, in linguistic speak), which is difficult for a human to deciper, let alone a computer.

Clearly, these are all technical issues, but they&#039;re not &#039;just&#039; technical issues - they are deep problems which go to the heart of trying to create speech-to-text programs, and there are no easy solutions. If you&#039;ve ever used a voice recognition package, you&#039;ll know that it takes hours of training, requires you to speak unnaturally, and then still gets bits wrong.

&lt;blockquote&gt;For me that’s the same as saying “What’s the point of spidering websites when search engines could just rely on the description and keyword fields in a pages meta tags?”.&lt;/blockquote&gt;

I think that the crucial distinction here is that webpages are a text format, and so indexing them as text is a natural process, whereas podcasts originate as audio, and so to index them as text requires translating them into a different form, as well as doing the clever indexing stuff.

Yes, I agree that there is value in indexing speech-to-text conversions of podcasts, and perhaps if it&#039;s just so you can do keyword searches, you can live with a threshold of inaccuracies. But ultimately, I&#039;m guessing that some of more useful podcast-indexing work will come from looking at the show notes, folksonomies, link text to that show, and so on.

&lt;blockquote&gt;With it’s ubiquity and ease of search, text is generally a far better medium in general to communicate on the Internet than in audio (at least for now). Audio lends itself to entertainment, music and similar kinds of genres - along with roundtable discussions.&lt;/blockquote&gt;

I agree, and this is something content producers need to think carefully about. Audio-only is a great medium, with years of history (radio is fully-grown industry), and lots of potential, but it does only fit some kinds of content. We shouldn&#039;t be podcasting just because we can...</description>
		<content:encoded><![CDATA[<blockquote><p>I have seen some very good demonstrations of speech-to-text technology within the BBC.</p></blockquote>
<p>Interesting, but I wonder what &#8216;very good&#8217; means. If it means &#8216;we&#8217;ve got this speech-to-text product and it&#8217;s 90% accurate, so only 10% to go&#8217;, then that&#8217;s a bit of a fallacy, as getting the last 10% accurate is infinitely harder than the inital 90%.</p>
<p>As well as the practical problems I outlined above (in determining speech from non-speech content), there&#8217;s also problems with homophones, where two words that are spelt differently but sound the same. You can try and use context to determine which word is more likely, but that&#8217;s pretty tricky. I imagine also that many podcasters use fairly technical jargon which is difficult for a speech analyser to keep up with. Another problem is that in conversations/interviews, people often tend to speak over one another (&#8216;overlap&#8217;, in linguistic speak), which is difficult for a human to deciper, let alone a computer.</p>
<p>Clearly, these are all technical issues, but they&#8217;re not &#8216;just&#8217; technical issues &#8211; they are deep problems which go to the heart of trying to create speech-to-text programs, and there are no easy solutions. If you&#8217;ve ever used a voice recognition package, you&#8217;ll know that it takes hours of training, requires you to speak unnaturally, and then still gets bits wrong.</p>
<blockquote><p>For me that’s the same as saying “What’s the point of spidering websites when search engines could just rely on the description and keyword fields in a pages meta tags?”.</p></blockquote>
<p>I think that the crucial distinction here is that webpages are a text format, and so indexing them as text is a natural process, whereas podcasts originate as audio, and so to index them as text requires translating them into a different form, as well as doing the clever indexing stuff.</p>
<p>Yes, I agree that there is value in indexing speech-to-text conversions of podcasts, and perhaps if it&#8217;s just so you can do keyword searches, you can live with a threshold of inaccuracies. But ultimately, I&#8217;m guessing that some of more useful podcast-indexing work will come from looking at the show notes, folksonomies, link text to that show, and so on.</p>
<blockquote><p>With it’s ubiquity and ease of search, text is generally a far better medium in general to communicate on the Internet than in audio (at least for now). Audio lends itself to entertainment, music and similar kinds of genres &#8211; along with roundtable discussions.</p></blockquote>
<p>I agree, and this is something content producers need to think carefully about. Audio-only is a great medium, with years of history (radio is fully-grown industry), and lots of potential, but it does only fit some kinds of content. We shouldn&#8217;t be podcasting just because we can&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ben</title>
		<link>http://benmetcalfe.com/blog/2005/08/blogging-and-podcast-start-up-paradigms/comment-page-1/#comment-491</link>
		<dc:creator>Ben</dc:creator>
		<pubDate>Wed, 31 Aug 2005 10:06:51 +0000</pubDate>
		<guid isPermaLink="false">http://benmetcalfe.com/blog/index.php/2005/08/29/blogging-and-podcast-start-up-paradigms/#comment-491</guid>
		<description>&lt;em&gt;(in response to Frankie&#039;s comment)&lt;/em&gt;

&lt;blockquote&gt;You seem to assume that ’speech-to-text technology’ is readily available and good quality. Perhaps I’m not looking in the right places, but I’ve yet to see any good evidence of this.&lt;/blockquote&gt;

I have seen some very good demonstrations of speech-to-text technology within the BBC.  Unfortunately it&#039;s all under NDP so I wasn&#039;t able to elaborate any further in my post, nor can I here.  However do check the &lt;a href=&quot;http://audio.search.yahoo.com/&quot; rel=&quot;nofollow&quot;&gt;Yahoo! audio search beta&lt;/a&gt; - which I think is very good in terms of it&#039;s speech-to-text performance.

&lt;blockquote&gt;Is this problem not avoided by simply indexing the show notes, which should give a reasonable indication of contnet, or the full-transcripts provided by podcasters (or perhaps their readers, ala Livejournal)?&lt;/blockquote&gt;

For me that&#039;s the same as saying &quot;What&#039;s the point of spidering websites when search engines could just rely on the description and keyword fields in a pages meta tags?&quot;.  The successes of &quot;full spidering&quot; search engines like AltaVista and Excite, and obviously Google more recently, were down to their ability to pull keywords from &lt;strong&gt;any&lt;/strong&gt; part of the document.  For me, this user expectation continues into the &quot;full body&quot; of podcasts and other audio and video files.

&lt;blockquote&gt;There’s a school of thought which suggests that speech podcasts should be made available as full text anyway (accessibility, people without speakers, not to mention that it’s faster to read)…&lt;/blockquote&gt;

I agree with your accessibility concerns.  Rather than making transcripts of podcasts available, I do think there is validity in questioning why many podcasts are podcasts at all, and not just (written) blogs.

With it&#039;s ubiquity and ease of search, text is generally a far better medium in general to communicate on the Internet than in audio (at least for now).  Audio lends itself to entertainment, music and similar kinds of genres - along with roundtable discussions.  But sometimes I do wonder whether some podcasts which just consist of a monologue by the presenter would be more appropriate as a (typed) blog post instead.</description>
		<content:encoded><![CDATA[<p><em>(in response to Frankie&#8217;s comment)</em></p>
<blockquote><p>You seem to assume that ’speech-to-text technology’ is readily available and good quality. Perhaps I’m not looking in the right places, but I’ve yet to see any good evidence of this.</p></blockquote>
<p>I have seen some very good demonstrations of speech-to-text technology within the BBC.  Unfortunately it&#8217;s all under NDP so I wasn&#8217;t able to elaborate any further in my post, nor can I here.  However do check the <a href="http://audio.search.yahoo.com/" rel="nofollow">Yahoo! audio search beta</a> &#8211; which I think is very good in terms of it&#8217;s speech-to-text performance.</p>
<blockquote><p>Is this problem not avoided by simply indexing the show notes, which should give a reasonable indication of contnet, or the full-transcripts provided by podcasters (or perhaps their readers, ala Livejournal)?</p></blockquote>
<p>For me that&#8217;s the same as saying &#8220;What&#8217;s the point of spidering websites when search engines could just rely on the description and keyword fields in a pages meta tags?&#8221;.  The successes of &#8220;full spidering&#8221; search engines like AltaVista and Excite, and obviously Google more recently, were down to their ability to pull keywords from <strong>any</strong> part of the document.  For me, this user expectation continues into the &#8220;full body&#8221; of podcasts and other audio and video files.</p>
<blockquote><p>There’s a school of thought which suggests that speech podcasts should be made available as full text anyway (accessibility, people without speakers, not to mention that it’s faster to read)…</p></blockquote>
<p>I agree with your accessibility concerns.  Rather than making transcripts of podcasts available, I do think there is validity in questioning why many podcasts are podcasts at all, and not just (written) blogs.</p>
<p>With it&#8217;s ubiquity and ease of search, text is generally a far better medium in general to communicate on the Internet than in audio (at least for now).  Audio lends itself to entertainment, music and similar kinds of genres &#8211; along with roundtable discussions.  But sometimes I do wonder whether some podcasts which just consist of a monologue by the presenter would be more appropriate as a (typed) blog post instead.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Frankie Roberto</title>
		<link>http://benmetcalfe.com/blog/2005/08/blogging-and-podcast-start-up-paradigms/comment-page-1/#comment-490</link>
		<dc:creator>Frankie Roberto</dc:creator>
		<pubDate>Wed, 31 Aug 2005 09:38:22 +0000</pubDate>
		<guid isPermaLink="false">http://benmetcalfe.com/blog/index.php/2005/08/29/blogging-and-podcast-start-up-paradigms/#comment-490</guid>
		<description>You seem to assume that &#039;speech-to-text technology&#039; is readily available and good quality. Perhaps I&#039;m not looking in the right places, but I&#039;ve yet to see any good evidence of this. Speech recognition is notoriously difficult, and when you throw in factors like sung lyrics, speech-over-music, poor bitrate quality, accents, regional spellings, multi-languages, and so on, it&#039;s impossible to get anything near 100% accuracy.

Is this problem not avoided by simply indexing the show notes, which should give a reasonable indication of contnet, or the full-transcripts provided by podcasters (or perhaps their readers, ala Livejournal)? 

There&#039;s a school of thought which suggests that speech podcasts should be made available as full text anyway (accessibility, people without speakers, not to mention that it&#039;s faster to read)...

Frankie</description>
		<content:encoded><![CDATA[<p>You seem to assume that &#8217;speech-to-text technology&#8217; is readily available and good quality. Perhaps I&#8217;m not looking in the right places, but I&#8217;ve yet to see any good evidence of this. Speech recognition is notoriously difficult, and when you throw in factors like sung lyrics, speech-over-music, poor bitrate quality, accents, regional spellings, multi-languages, and so on, it&#8217;s impossible to get anything near 100% accuracy.</p>
<p>Is this problem not avoided by simply indexing the show notes, which should give a reasonable indication of contnet, or the full-transcripts provided by podcasters (or perhaps their readers, ala Livejournal)? </p>
<p>There&#8217;s a school of thought which suggests that speech podcasts should be made available as full text anyway (accessibility, people without speakers, not to mention that it&#8217;s faster to read)&#8230;</p>
<p>Frankie</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sam Sethi</title>
		<link>http://benmetcalfe.com/blog/2005/08/blogging-and-podcast-start-up-paradigms/comment-page-1/#comment-485</link>
		<dc:creator>Sam Sethi</dc:creator>
		<pubDate>Tue, 30 Aug 2005 07:30:54 +0000</pubDate>
		<guid isPermaLink="false">http://benmetcalfe.com/blog/index.php/2005/08/29/blogging-and-podcast-start-up-paradigms/#comment-485</guid>
		<description>The podcasting first wave is waning.  Amatuer - one man and his cat - podcasters hoping to strike gold are seeing their listener numbers dwindle and most are giving up. Thank God! 

But sadly this means the iPod podcast chart is now looking more commerical - moyes, mayo, virgin radio etc. The better podcasts show like the Gillmore Gang, Pirillo Show etc which are not just repeats of mainstream broadcast radio programs are not appearing on the chart . So unless you know how to find them, people are not going to listen to them. Odeo is making a good go of it but very US-centric.   There is a good middle ground and I guess that is what you are hinting at?

Quality production is the key and I guess with your BBC connections your &quot;friends&quot; will have no problem there.  I personally use Audioblog.com to produce and serve my podcasts. 

As for the discovery Firefox with LiveBookmarks will probably add podcast support as they are looking to do so in the next version of ThunderBird and then there will be IE7.  

At the moment I use Attensa which is a good RSS/podcast client in Outlook but again still waiting on the ThunderBird update.  

But the key for me is the work going on around Attention.xml and how to discover pods/blogs that I&#039;m interested outside of the mainstream.</description>
		<content:encoded><![CDATA[<p>The podcasting first wave is waning.  Amatuer &#8211; one man and his cat &#8211; podcasters hoping to strike gold are seeing their listener numbers dwindle and most are giving up. Thank God! </p>
<p>But sadly this means the iPod podcast chart is now looking more commerical &#8211; moyes, mayo, virgin radio etc. The better podcasts show like the Gillmore Gang, Pirillo Show etc which are not just repeats of mainstream broadcast radio programs are not appearing on the chart . So unless you know how to find them, people are not going to listen to them. Odeo is making a good go of it but very US-centric.   There is a good middle ground and I guess that is what you are hinting at?</p>
<p>Quality production is the key and I guess with your BBC connections your &#8220;friends&#8221; will have no problem there.  I personally use Audioblog.com to produce and serve my podcasts. </p>
<p>As for the discovery Firefox with LiveBookmarks will probably add podcast support as they are looking to do so in the next version of ThunderBird and then there will be IE7.  </p>
<p>At the moment I use Attensa which is a good RSS/podcast client in Outlook but again still waiting on the ThunderBird update.  </p>
<p>But the key for me is the work going on around Attention.xml and how to discover pods/blogs that I&#8217;m interested outside of the mainstream.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
