<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>TextWise Blog &#187; semantic</title>
	<atom:link href="http://blog.textwise.com/tag/semantic/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.textwise.com</link>
	<description>A blog about the SemanticHacker API by TextWise</description>
	<lastBuildDate>Wed, 31 Aug 2011 18:50:52 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>TextWise and Innography® Form Partnership for Patent Semantic Search</title>
		<link>http://blog.textwise.com/2010/12/06/textwise-and-innography%c2%ae-form-partnership-for-patent-semantic-search/</link>
		<comments>http://blog.textwise.com/2010/12/06/textwise-and-innography%c2%ae-form-partnership-for-patent-semantic-search/#comments</comments>
		<pubDate>Mon, 06 Dec 2010 18:40:15 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[patent]]></category>
		<category><![CDATA[semantic]]></category>

		<guid isPermaLink="false">http://blog.textwise.com/?p=437</guid>
		<description><![CDATA[TextWise and Innography® have announced a strategic partnership to incorporate TextWise semantic search into Innography’s intellectual property business intelligence solution. The new functionality available with Innography® Fall ’10™ enables Innography customers to perform contextual semantic search using specific patent numbers or long blocks of text as the query. “TextWise facilitates near effortless querying for patent [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.textwise.com%2F2010%2F12%2F06%2Ftextwise-and-innography%25c2%25ae-form-partnership-for-patent-semantic-search%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.textwise.com%2F2010%2F12%2F06%2Ftextwise-and-innography%25c2%25ae-form-partnership-for-patent-semantic-search%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>TextWise and Innography® have announced a strategic partnership to incorporate TextWise semantic search into Innography’s intellectual property business intelligence solution. The new functionality available with Innography® Fall ’10™ enables  Innography customers to perform contextual  semantic search  using specific <a class="tw_contentlink" href="http://en.wikipedia.org/w/index.php?search=patent&#038;go=Go">patent</a> numbers or  long blocks of text as the query.</p>
<p>“TextWise facilitates near effortless querying for patent searchers and we’re pleased to be working with Innography as the leader in IP business intelligence,” says Connie Kenneally, President of TextWise. “The incorporation of our technology into Innography’s latest release mitigates the need to perform repetitive searches that rely solely on identifying keywords to generate relevant search results. In contrast, our semantic search performs well with very long queries and one does not have to find the perfect keyword combinations to get the most relevant results.”</p>
<p>A TextWise search is performed by incorporating the full context of either a paragraph, claim, abstract or any longer piece of text to generate more relevant matches to similar information contained in the US patent database. “TextWise takes an innovative approach to facilitating rapid identification of patents for monetizing IP assets when coupled with Innography’s intellectual property business intelligence solution,” said Doug Miller, Chief Marketing Officer at Innography. “Our customers have expressed great interest in patent semantic search, so we are very pleased to be offering this new functionality as an option with their Innography subscriptions.”</p>
<p>The target areas where this joint offering would be most advantageous are:<br />
•	Expediting products for clearance from infringing on other offerings<br />
•	Rapid idea screening for patenting<br />
•	Lead identification  for patent licensing, sales and  patent acquisitions<br />
•	NPE defense / invalidity protection</p>
<p>A demonstration of Innography Fall ’10™, including the TextWise patent semantic search feature, can be found at: <a href="http://innography.com/assets/files/fall10demo/fall10-demo.html">http://innography.com/assets/files/fall10demo/fall10-demo.html</a>.</p>
<p><strong>About Innography</strong><br />
Innography® delivers a comprehensive, online Intellectual Property Business Intelligence (IPBI) application that enables companies of all types and sizes to achieve the optimal return on their IP investments.  By correlating patent and trademark data with financial, litigation and other key business information, Innography instantly generates a variety of unique visualizations to help organizations reduce the time it takes to perform IP research and reduce associated legal expenditures.  This enables corporations to get products to market faster, uncover new and more lucrative revenue sources, keep better track of competitors, manage litigation claims, and stay on top of additional IP-associated functions.  Visit www.innography.com to view a brief online product demo or call 1.512.306.8688 for more information.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fblog.textwise.com%2F2010%2F12%2F06%2Ftextwise-and-innography%25c2%25ae-form-partnership-for-patent-semantic-search%2F';
  addthis_title  = 'TextWise+and+Innography%C2%AE+Form+Partnership+for+Patent+Semantic+Search';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://blog.textwise.com/2010/12/06/textwise-and-innography%c2%ae-form-partnership-for-patent-semantic-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Learning</title>
		<link>http://blog.textwise.com/2009/12/21/learning/</link>
		<comments>http://blog.textwise.com/2009/12/21/learning/#comments</comments>
		<pubDate>Mon, 21 Dec 2009 17:24:49 +0000</pubDate>
		<dc:creator>Clinton Mah</dc:creator>
				<category><![CDATA[semantics]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[learning]]></category>
		<category><![CDATA[meaning]]></category>
		<category><![CDATA[semantic]]></category>
		<category><![CDATA[vocabulary]]></category>

		<guid isPermaLink="false">http://blog.textwise.com/?p=147</guid>
		<description><![CDATA[Consider how we humans learn language. Even with formal education, it takes a child about 15 years starting from infancy to be able to read and understand general news articles in the New York Times. Over this period, one would probably hear or read at least on the order of 10 billion words. Even so, [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.textwise.com%2F2009%2F12%2F21%2Flearning%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.textwise.com%2F2009%2F12%2F21%2Flearning%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>Consider how we humans learn language. Even with formal education, it takes a child about 15 years starting from infancy to be able to read and understand general news articles in the New York Times. Over this period, one would probably hear or read at least on the order of 10 billion words. Even so, most high schoolers will need many additional years of schooling to become able to comprehend technical material.</p>
<p>So, how can anyone expect a computer to understand something like medical text after training on only about 100 million words of data? A computer of course runs on nanosecond cycles while the human brain operates on millisecond cycles; but we have had about 50,000 generations to evolve our language software, while the electronic computer has had only about 10 generations.</p>
<p>The bottom line here is that language learning is difficult; and it requires sifting through immense amounts of data. There probably is no magic technological shortcut here, but we have reached now the stage where our systems can routinely handle the volumes of data that would support semantic capabilities equivalent to an 8th-grade education. Decent commercial language processing tools are also now available.</p>
<p>Consequently, we are making major progress on semantic dictionaries, but have to be realistic about the work still ahead of us. Expect no overnight miracles from us or anyone else, especially when these are based on measly samples of data. There is still no royal road to semantics.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fblog.textwise.com%2F2009%2F12%2F21%2Flearning%2F';
  addthis_title  = 'Learning';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://blog.textwise.com/2009/12/21/learning/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Name of the Name</title>
		<link>http://blog.textwise.com/2009/08/05/the-name-of-the-name/</link>
		<comments>http://blog.textwise.com/2009/08/05/the-name-of-the-name/#comments</comments>
		<pubDate>Wed, 05 Aug 2009 14:04:03 +0000</pubDate>
		<dc:creator>Clinton Mah</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[semantics]]></category>
		<category><![CDATA[dictionary]]></category>
		<category><![CDATA[lewis carroll]]></category>
		<category><![CDATA[meaning]]></category>
		<category><![CDATA[name]]></category>
		<category><![CDATA[semantic]]></category>

		<guid isPermaLink="false">http://blog.textwise.com/?p=60</guid>
		<description><![CDATA[In Chapter 8 of Lewis Carroll&#8217;s &#8220;Alice Through the Looking Glass,&#8221; our intrepid logical adventurer is talking to the White Knight, who wants to sing to her. He says, &#8220;The name of the song is called &#8216;HADDOCK&#8217;S EYES.&#8217;&#8221; It turns out of course that the name of the song is really &#8220;THE AGED AGED MAN,&#8221; [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.textwise.com%2F2009%2F08%2F05%2Fthe-name-of-the-name%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.textwise.com%2F2009%2F08%2F05%2Fthe-name-of-the-name%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>In Chapter 8 of Lewis Carroll&#8217;s &#8220;Alice Through the Looking Glass,&#8221; our intrepid logical adventurer is talking to the White Knight, who wants to sing to her. He says, &#8220;The name of the song is called &#8216;HADDOCK&#8217;S EYES.&#8217;&#8221;
<p>It turns out of course that the name of the song is really &#8220;THE AGED AGED MAN,&#8221; though the song is actually called &#8220;WAYS AND MEANS.&#8221; The confusion here about naming is quite understandable to anyone who has ever ordered TenderSweet™ clams at HoJo&#8217;s and discovered that they are neither tender nor sweet.
<p>All of this would be hilarious except that we have to build semantic dictionaries that must deal extensively with the meaning of names in text. This problem will take a while to talk about adequately; and so please tune in tomorrow.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fblog.textwise.com%2F2009%2F08%2F05%2Fthe-name-of-the-name%2F';
  addthis_title  = 'The+Name+of+the+Name';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://blog.textwise.com/2009/08/05/the-name-of-the-name/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ingredients</title>
		<link>http://blog.textwise.com/2009/07/27/ingredients/</link>
		<comments>http://blog.textwise.com/2009/07/27/ingredients/#comments</comments>
		<pubDate>Mon, 27 Jul 2009 12:12:26 +0000</pubDate>
		<dc:creator>Clinton Mah</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[semantics]]></category>
		<category><![CDATA[association]]></category>
		<category><![CDATA[dictionary]]></category>
		<category><![CDATA[dimension]]></category>
		<category><![CDATA[sample]]></category>
		<category><![CDATA[semantic]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[term]]></category>
		<category><![CDATA[training]]></category>

		<guid isPermaLink="false">http://blog.textwise.com/?p=55</guid>
		<description><![CDATA[This posting will probably make the eyes of most people glaze over, but current and prospective users of our SemanticHacker API should really be informed consumers. So think of this as being like one of those federally mandated labels on your bottle of Red Bull. The ingredients of a semantic dictionary are a set of [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.textwise.com%2F2009%2F07%2F27%2Fingredients%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.textwise.com%2F2009%2F07%2F27%2Fingredients%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>This posting will probably make the eyes of most people glaze over, but current and prospective users of our SemanticHacker API should really be informed consumers. So think of this as being like one of those federally mandated labels on your bottle of Red Bull.
<p>The ingredients of a semantic dictionary are a set of hundreds of thousands of terms, a set of thousands of dimensions, and various numbers expressing the strength of association between a given term and a given dimension. Most of these associations will have zero strength, indicating that we have no information about them; but there will still be millions of non-zero numbers to provide a rigorous undergirding for statistical semantics.
<p>We build a semantic dictionary by obtaining large training samples of documents relevant to each of its dimensions. The strength of association is then estimated as being proportional to the relative frequency of occurrence in training documents for a term in a dimension versus in those for all other other dimensions. The process is actually more complicated than this, but the differences are just refinements of the overall scheme as described.
<p>Now we all understand what terms are (e.g. britney_spears, midfielder, rugelach, purple), but where do dimensions come from? The answer is that they are somewhat arbitrary. A dimension can be defined around any kind of category for which someone has provided requisite training documents. In many cases, we can find prior sets of categories to work from (ODP, USPTO), but we also can ourselves try to infer categories from some available pool of potential training data.
<p>However we proceed here, it is necessary that the resulting dimensions be pertinent to an application of interest, be independent of each other, be supported by adequate training data, and be associated with enough terms to support semantic analysis of target text. This all can be tricky to achieve, but if it were easy, everyone would be doing it.</p>
<script type="text/javascript">
  addthis_url    = 'http%3A%2F%2Fblog.textwise.com%2F2009%2F07%2F27%2Fingredients%2F';
  addthis_title  = 'Ingredients';
  addthis_pub    = '';
</script><script type="text/javascript" src="http://s7.addthis.com/js/addthis_widget.php?v=12" ></script>
]]></content:encoded>
			<wfw:commentRss>http://blog.textwise.com/2009/07/27/ingredients/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

