<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Gödel's Lost Letter and P=NP</title>
	<atom:link href="http://rjlipton.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://rjlipton.wordpress.com</link>
	<description>a personal view of the theory of computation</description>
	<lastBuildDate>Tue, 10 Nov 2009 12:49:49 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='rjlipton.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/378512e8408e8542c98a8704795f81ab?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>Gödel's Lost Letter and P=NP</title>
		<link>http://rjlipton.wordpress.com</link>
	</image>
			<item>
		<title>Rumors and Playing Games</title>
		<link>http://rjlipton.wordpress.com/2009/11/08/rumors-and-playing-games/</link>
		<comments>http://rjlipton.wordpress.com/2009/11/08/rumors-and-playing-games/#comments</comments>
		<pubDate>Mon, 09 Nov 2009 04:27:12 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[People]]></category>
		<category><![CDATA[Proofs]]></category>
		<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[approximation]]></category>
		<category><![CDATA[games]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[Nash]]></category>
		<category><![CDATA[polynomial]]></category>
		<category><![CDATA[Problems]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3833</guid>
		<description><![CDATA[
 A rumor from FOCS on approximate Nash Equilibrium is partially true  



Paul Spirakis is a senior researcher who has made many important contributions to theory. He has hundreds of publications that cover many areas of theory. What is so impressive about Paul is that he has been able to blend theory and practice [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3833&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font color="”#0066cc?"><br />
<em> A rumor from FOCS on approximate Nash Equilibrium is partially true  </em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/11/images1.jpeg?w=74&#038;h=100" alt="images" title="images" width="74" height="100" class="alignright size-full wp-image-3835" /></p>
<p>
Paul Spirakis is a senior researcher who has made many important contributions to theory. He has hundreds of publications that cover many areas of theory. What is so impressive about Paul is that he has been able to blend theory and practice in a very fruitful way. This is a pretty unique skill that few have.</p>
<p>
Today I want to talk about a new result of his on finding approximate Nash Equilibrium for non-zero sum games. </p>
<p>
The coolest rumor I heard at this year&#8217;s FOCS was that &#8220;someone&#8221; had proved a sub-exponential bound on the running time of an algorithm for approximate Nash Equilibrium. This was quite exciting and it is what I refer to in my discussion of <a href="http://rjlipton.wordpress.com/2009/10/27/highlights-of-focs-theory-day/">Theory Day</a>. I eventually tracked the rumor down and discovered the source was the pretty paper that I plan to discuss today. The paper does not quite have such a result, but definitely makes an important step in our understanding of games.</p>
<p>
The rumor that I heard at FOCS turned out to be wrong. Oh well. That&#8217;s the nature of rumors: by definition a rumor is not always true; otherwise, it would not be a rumor. At least this rumor was partially true. </p>
<p>
Years ago, in 1969, there was a great rumor that started right after Steve Cook proved his famous <a href="http://portal.acm.org/citation.cfm?id=805437">theorem</a> that characterizes polynomial time. He proved: </p>
<blockquote><p><b>Theorem: </b> <em> The following three conditions are equivalent for any language <img src='http://l.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' />: </p>
<ol>
<li> <img src='http://l.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> is accepted by some deterministic auxiliary pushdown machine with logspace storage.
<li> <img src='http://l.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> is accepted by some <b>non</b>-deterministic auxiliary pushdown machine with logspace storage.
<li> <img src='http://l.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> is in P.
</ol>
<p> </em></p></blockquote>
<p>This is a beautiful result, which in my opinion was not expected. An auxiliary pushdown automata with auxiliary logspace storage is a machine that can read the input two-ways, and store information on a regular Turing tape of length at most <img src='http://l.wordpress.com/latex.php?latex=%7BO%28%5Clog+n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(\log n)}' title='{O(\log n)}' class='latex' />. In addition, the machine has a pushdown, which it can use also to store information. </p>
<p>
Note, a pushdown only allows operations on the top: the top symbol can be read, a symbol can be pushed onto the pushdown, or a symbol can be removed from the top. That&#8217;s all. Symbols below the top symbol cannot be read without popping off the top symbol. </p>
<p>
Almost immediately someone well known, I will call them X, claimed that Steve&#8217;s result could be extended &#8220;slightly.&#8221; In particular, X claimed that the same theorem could be proved with pushdown replaced by stack. A stack is <i>not</i> the same as a pushdown. It has one important extra property: a stack allows the reading of the symbols in the pushdown without popping off any symbols. A stack only allows this in a read-only mode. That is when inside the pushdown, symbols can only be read. Symbols can be changed at the top as usual. </p>
<p>
We could not figure out how the new proof went, and worse X went on a camping vacation. So he was unreachable&#8212;this is well before cell phones. The theory community was all abuzz, since it was believed that an auxiliary stack machine could accept more than P. It would have been a great result. Clearly, X did not see this, or he would have been more circumspect about his claim. </p>
<p>
Finally, X returned and was asked how his proof went. He explained the proof, and it was immediately clear that the proof did not work. He had forgotten one small point: a stack machine could run for <em>more</em> than exponential time unlike a pushdown machine. [I fixed an error here.] He had to retract his claim. Sometimes rumors are false. </p>
<p>
Now let&#8217;s turn to non-zero sum games and a new approach to getting approximate Nash Equilibrium. </p>
<p>
<p><b> Win-Lose Games </b></p>
<p><p>
The <a href="http://adsabs.harvard.edu/abs/2009arXiv0909.4686T">paper</a> of Haralampos Tsaknakis, and Paul Spirakis is mostly a new approach to finding approximate Nash Equilibrium (NE) for general two player game through a reduction to a restricted class of games.</p>
<p>
This class is special, but still quite interesting. It is the class of <i>symmetric win-lose</i> games. Such a game has one <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' />-<img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> matrix <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />: the matrix <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is the payoff for the row player and <img src='http://l.wordpress.com/latex.php?latex=%7BA%5E%7BT%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A^{T}}' title='{A^{T}}' class='latex' /> is the payoff for the column player. </p>
<p>
The restriction to <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' />-<img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> values is why it is called a &#8220;win-lose&#8221; game, and the fact that one player uses the transpose of the other player&#8217;s matrix is why it is called symmetric. Finding a Nash Equilibrium in even win-lose games is as hard as the general case thanks to the beautiful <a href="http://web.mit.edu/tabbott/www/papers/FOCS05.pdf">result</a> of Tim Abbott, Daniel Kane, and Paul Valiant. </p>
<p>
Their proof is quite clever. As someone who has worked a bit on games, I was impressed that they could prove this result. In many complexity situations the <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' />-<img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> case is universal; however, proving that for games is not so easy. My intuition was that this, while probably true, would be very hard to prove. </p>
<p>
<p><b> Approximate Solutions </b></p>
<p><p>
Since the problem of finding exact Nash Equilibrium for even win-lose games is as hard as solving the general case, it is natural to look for approximate NE&#8217;s. This is exactly what Haralampos and Paul do in their paper. They compare their new result with a result of Evangelos Markakis, Aranyak Mehta, and myself, which we proved in a earlier <a href="http://www.cs.toronto.edu/~vangelis/research/ec03-approx-nash.ps">paper</a>:</p>
<blockquote><p><b>Theorem: </b> <em> For any <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cepsilon%3E0%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\epsilon&gt;0}' title='{\epsilon&gt;0}' class='latex' /> there is an algorithm that finds an <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cepsilon%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\epsilon}' title='{\epsilon}' class='latex' /> approximate Nash equilibrium for a symmetric win-lose game of size <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' />, in running time bounded by <img src='http://l.wordpress.com/latex.php?latex=%7B+n%5El+%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{ n^l }' title='{ n^l }' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%7Bl+%3D+O%28%5Clog+n+%2F+%5Cepsilon%5E%7B2%7D%29%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{l = O(\log n / \epsilon^{2})}' title='{l = O(\log n / \epsilon^{2})}' class='latex' />. </em></p></blockquote>
<p><p>
The main result of Paul&#8217;s paper is:</p>
<blockquote><p><b>Theorem: </b> <em> For any <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cepsilon%3E0%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\epsilon&gt;0}' title='{\epsilon&gt;0}' class='latex' /> there is an algorithm that finds an <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cepsilon%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\epsilon}' title='{\epsilon}' class='latex' /> approximate Nash equilibrium for a symmetric win-lose game of size <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' />, in running time bounded by <img src='http://l.wordpress.com/latex.php?latex=%7B+n%5El+%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{ n^l }' title='{ n^l }' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%7Bl+%3D+O%28m+%2F+n%5Cepsilon%29%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{l = O(m / n\epsilon)}' title='{l = O(m / n\epsilon)}' class='latex' />. The value of <img src='http://l.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> is equal to:
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Csum_%7B%5Clambda_%7Bk%7D+%3E+0%7D+%5Clambda_%7Bk%7D+&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  \sum_{\lambda_{k} &gt; 0} \lambda_{k} ' title='\displaystyle  \sum_{\lambda_{k} &gt; 0} \lambda_{k} ' class='latex' /></p>
<p> where <img src='http://l.wordpress.com/latex.php?latex=%7B%5Clambda_%7B1%7D%2C+%5Cdots%2C+%5Clambda_%7Bn%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\lambda_{1}, \dots, \lambda_{n}}' title='{\lambda_{1}, \dots, \lambda_{n}}' class='latex' /> are the eigenvalues of the graph induced by the game matrix. </em></p></blockquote>
<p><p>
This result is neat, in my opinion, since it addresses the barrier discovered by Constantinos Daskalakis and Christos Papadimitriou in their <a href="http://portal.acm.org/citation.cfm?id=1536414.1536427">paper</a> titled &#8220;On oblivious PTAS&#8217;s for Nash Equilibrium.&#8221; Constantinos and Christos show essentially that to get good approximations for NE&#8217;s, one must look at the structure of the payoff matrices: it is not enough to just examine the strategies as we did in our result. </p>
<p>
<p><b> Sparse Games </b></p>
<p><p>
I am currently running an open problem seminar and we discussed Paul&#8217;s paper in a recent class. We noticed first that the value of <img src='http://l.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> in his theorem could be changed from
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Csum_%7B%5Clambda_%7Bk%7D+%3E+0%7D+%5Clambda_%7Bk%7D+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \sum_{\lambda_{k} &gt; 0} \lambda_{k} ' title='\displaystyle  \sum_{\lambda_{k} &gt; 0} \lambda_{k} ' class='latex' /></p>
<p> to
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Csum_%7Bk%3D1%7D%5E%7Bn%7D+%7C+%5Clambda_%7Bk%7D+%7C.+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \sum_{k=1}^{n} | \lambda_{k} |. ' title='\displaystyle  \sum_{k=1}^{n} | \lambda_{k} |. ' class='latex' /></p>
<p> This follows since the sum of eigenvalues is <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' />, so the new measure is exactly twice the old, and this will have no effect on the results&#8212;all can be swept under the constant in the big-O notation. </p>
<p>
Then, we considered sparse games: games where the induced graph has <img src='http://l.wordpress.com/latex.php?latex=%7BO%28n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(n)}' title='{O(n)}' class='latex' /> edges. For such games the value of the critical parameter <img src='http://l.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> is bounded by <img src='http://l.wordpress.com/latex.php?latex=%7BO%28n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(n)}' title='{O(n)}' class='latex' />. The proof of this is quite simple:
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cbegin%7Barray%7D%7Brcl%7D++%09m+%26%3D%26+%5Csum_%7B%5Clambda+%5Cle+1%7D+%7C+%5Clambda_%7Bk%7D+%7C+%2B+%5Csum_%7B%5Clambda+%3E+1%7D+%7C+%5Clambda_%7Bk%7D+%7C+%5C%5C+%09%09%26%5Cle%26+n+%2B+%5Csum_%7B%5Clambda+%3E+1%7D+%5Clambda_%7Bk%7D%5E%7B2%7D+%5C%5C+%09%09%26%5Cle%26+n+%2B+%5Csum_%7Bk%3D1%7D%5E%7Bn%7D+d_%7Bk%7D+%5Cend%7Barray%7D+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{array}{rcl}  	m &amp;=&amp; \sum_{\lambda \le 1} | \lambda_{k} | + \sum_{\lambda &gt; 1} | \lambda_{k} | \\ 		&amp;\le&amp; n + \sum_{\lambda &gt; 1} \lambda_{k}^{2} \\ 		&amp;\le&amp; n + \sum_{k=1}^{n} d_{k} \end{array} ' title='\displaystyle  \begin{array}{rcl}  	m &amp;=&amp; \sum_{\lambda \le 1} | \lambda_{k} | + \sum_{\lambda &gt; 1} | \lambda_{k} | \\ 		&amp;\le&amp; n + \sum_{\lambda &gt; 1} \lambda_{k}^{2} \\ 		&amp;\le&amp; n + \sum_{k=1}^{n} d_{k} \end{array} ' class='latex' /></p>
<p> where <img src='http://l.wordpress.com/latex.php?latex=%7Bd_%7Bk%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{d_{k}}' title='{d_{k}}' class='latex' /> is the degree of the <img src='http://l.wordpress.com/latex.php?latex=%7Bk%5E%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{k^{th}}' title='{k^{th}}' class='latex' /> vertex. This last quantity is easily seen to be at most <img src='http://l.wordpress.com/latex.php?latex=%7BO%28n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(n)}' title='{O(n)}' class='latex' /> if the graph is sparse. </p>
<p>
Based on these simple insights my students Atish Das Sarma, Subrahmanyam Kalyanasundaram, and Shiva Kintali believe that they can prove the following theorem: </p>
<blockquote><p><b>&#8220;Theorem&#8221;:</b> <em> There is a PTAS for finding approximate NE&#8217;s for the class of symmetric win-lose games whose induced graph has bounded degree. </em></p></blockquote>
<p><p>
Unfortunately, the proof of this, while it looks like it will work, does not seem to be able handle sparse graphs. The reason is that a sparse graph could fail to have some technical properties that Paul&#8217;s theorem requires. We are hopeful that this can be fixed.</p>
<p>
For random sparse symmetric win-lose games they do believe that it should be possible to prove the following theorem: </p>
<blockquote><p><b>&#8220;Theorem&#8221;: </b> <em> There is a PTAS for finding approximate NE&#8217;s for the class of random sparse win-lose games that works with high probability. </em></p></blockquote>
<p><p>
There has already been some nice <a href="http://www.math.mcgill.ca/~vetta/Research.dir/lvgames.ps">work</a> on random games by Imre Bárány, Santosh Vempala and Adrian Vetta. Their result works for dense games where the values are Gaussian, while we are looking at sparse games with <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' />-<img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> values.</p>
<p>
<p><b> Open Problems </b></p>
<p><p>
The major open problem is: is there an algorithm that runs in time <img src='http://l.wordpress.com/latex.php?latex=%7Bn%5E%7BO%281%2F%5Cepsilon%29%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n^{O(1/\epsilon)}}' title='{n^{O(1/\epsilon)}}' class='latex' /> for any non-zero sum game and finds an <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\epsilon}' title='{\epsilon}' class='latex' /> approximate Nash equilibrium? Can one even prove this for a special class of games?</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3833/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3833/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3833/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3833/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3833/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3833/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3833/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3833/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3833/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3833/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3833&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/11/08/rumors-and-playing-games/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/11/images1.jpeg" medium="image">
			<media:title type="html">images</media:title>
		</media:content>
	</item>
		<item>
		<title>On Mathematical Diseases</title>
		<link>http://rjlipton.wordpress.com/2009/11/04/on-mathematical-diseases/</link>
		<comments>http://rjlipton.wordpress.com/2009/11/04/on-mathematical-diseases/#comments</comments>
		<pubDate>Wed, 04 Nov 2009 23:29:49 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[History]]></category>
		<category><![CDATA[P=NP]]></category>
		<category><![CDATA[People]]></category>
		<category><![CDATA[Proofs]]></category>
		<category><![CDATA[crypto]]></category>
		<category><![CDATA[diseases]]></category>
		<category><![CDATA[graph isomorphism]]></category>
		<category><![CDATA[group isomorphism]]></category>
		<category><![CDATA[Harary]]></category>
		<category><![CDATA[jacobian conjecture]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3824</guid>
		<description><![CDATA[
 Mathematical diseases: symptoms and examples 



Underwood Dudley is a number theorist, who is perhaps best known for his popular books on mathematics. The most famous one is A Budget of Trisections, which studies the many failed attempts at the ancient problem of trisecting an angle with only a ruler and a compass. This problem [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3824&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font color="”#0066cc?"><br />
<em> Mathematical diseases: symptoms and examples </em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/11/images.jpeg?w=86&#038;h=98" alt="images" title="images" width="86" height="98" class="alignright size-full wp-image-3825" /></p>
<p>
Underwood Dudley is a number theorist, who is perhaps best known for his popular books on mathematics. The most famous one is <a href="http://www.abebooks.com/servlet/SearchResults?isbn=0387965688">A Budget of Trisections</a>, which studies the many failed attempts at the ancient problem of trisecting an angle with only a ruler and a compass. This problem is <a href="http://www.jimloy.com/geometry/trisect.htm">impossible</a>, yet that has not stopped some people from working day and night looking for a solution. Trying to find such a solution is an obsession for some; it&#8217;s almost like they have a malady that forces them to work on the problem. </p>
<p>
Today I plan on talking about other mathematical obsessions. They are like diseases that affect some, and make them feel they <i>have</i> to work on certain mathematical problems. Perhaps P=NP is one?<br />
<span id="more-3824"></span></p>
<p>
Dudley&#8217;s book is quite funny, in my opinion, although it does border on being a little bit unkind. As the title suggests, in &#8220;A Budget of Trisections,&#8221; he presents one attempt after another at a general method for trisecting any angle. For most he points out that when the angle is equal to some value what the exact error is. For others he adds a comment like:  	</p>
<blockquote><p>
This construction almost worked, if only the points <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7BB%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{B}' title='{B}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7BC%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{C}' title='{C}' class='latex' /> <i>had</i> really been co-linear it would have worked. Perhaps the author could move <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cdots%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\dots}' title='{\dots}' class='latex' />
</p></blockquote>
<p>His book is about the kind of mathematical problems that I will discuss today: problems that act almost like a real disease.</p>
<p>
I cannot resist a quote from Underwood that attacks bloggers. Note he uses &#8220;he&#8221; to refer to himself in this quote:  </p>
<blockquote><p>
He has done quite a bit of editing in his time&#8211;the College Mathematics Journal for five years, the Pi Mu Epsilon Journal for three, the Dolciani Mathematical Expositions book series (six years), and the New Mathematical Library book series (three years). As a result he has a complete grasp of the distinction between &#8220;that&#8221; and &#8220;which&#8221; (very rare) and the conviction that no writing, including this, should appear before the public before passing through the hands, eyes, and brain of an editor. <b>Take that, bloggers!</b>
</p></blockquote>
<p> (Bold added by me.) </p>
<p>
Oh well.</p>
<p>
<p><b> What Is a Mathematical Disease? </b></p>
<p><p>
This is the flu season in Atlanta, and many are getting it. I hope you either miss the bug, or if you are unfortunate enough to get it, get a mild case. </p>
<p>
There is another type of &#8220;bug&#8221; that affects mathematicians&#8212;the attempt to solve certain problems. These problems have been called &#8220;diseases,&#8221; which is a term coined by the great graph theorist Frank Harary. They include many famous problems from graph theory, some from algebra, some from number theory, some from complexity theory, and so on. </p>
<p>
The symptoms of the flu are well known&#8212;I hope again you stay away from fever, chills, and the aches&#8212;but the symptoms for a mathematical disease (MD) are less well established. There are some signs however that a problem is a MD. </p>
<ol>
<li> A problem must be easy to state to be a MD. This is not sufficient, but is required. Thus, the <a href="http://en.wikipedia.org/wiki/Hodge_conjecture">Hodge-Conjecture</a> will never be a disease. I have no clue what it is about.
<li> A problem must seem to be accessible, even to an amateur. This is a key requirement. When you first hear the problem your reaction should be: <i>that is open?</i> The problem must <i>seem</i> to be easy.
<li> A problem must also have been repeatedly &#8220;solved&#8221; to be a true MD. A good MD usually has been &#8220;proved&#8221; many times&#8212;often by the same person. If you see a paper in arXiv.org with many &#8220;updates&#8221; that&#8217;s a good sign that the problem is a MD.
</ol>
<p>
Unlike real diseases, MD&#8217;s have no known cure. Even the solution of the problem will not stop attempts by some to continue working on it. If the proof shows that something is impossible&#8212;like the situation with angle trisection&#8212;those with the MD will often still work hard on trying to get around the proof. Even when there is a fine proof, those with the disease may continue trying to find a simple proof. For example, Andrew Wiles&#8217; proof of Fermat&#8217;s Last Theorem has not stopped some from trying to find Pierre de Fermat&#8217;s &#8220;the truly marvellous proof.&#8221;</p>
<p>
<p><b> Some Mathematical Diseases </b></p>
<p><p>
Here are some of the best known MD&#8217;s along with a couple of lesser known ones. I would like to hear from you with additional suggestions. As I stated earlier Harary was probably the first to call certain problems MD&#8217;s. His original <a href="http://www.math.ilstu.edu/cve/speakers/Rosa-CVE-Talk.pdf">list</a> was restricted to graph problems, however. </p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <b>Graph Isomorphism:</b> This is the classic question of whether or not there is a polynomial time algorithm that can tell if two graphs are isomorphic. The problem seems so easy, but it has resisted all attempts so far. I admit to being mildly infected by this MD: in the 1970&#8217;s I worked on GI for special classes of graphs using a method I called the <a href="http://www.cs.yale.edu/publications/techreports/tr135.pdf">beacon set method</a>. </p>
<p>
There are some beautiful partial results: for example, the work of László Babai, Yu Grigoryev, and David Mount on the case where the graphs have bounded multiplicity of eigenvalues is one of my favorites. Also the <a href="http://ix.cs.uoregon.edu/~luks/iso.pdf">solution</a> by Eugene Luks of the bounded degree case is one of the major milestones. </p>
<p>
I would like to raise one question that I believe is open: Is there a polynomial time algorithm for the GI problem for <i>expander graphs</i>? I asked several people at the recent Theory Day and no one seem to know the answer. Perhaps you do.</p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <b>Group Isomorphism:</b> This problem is not as well known as the GI problem. The question is given two finite groups of size <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> are they isomorphic? The key is that the groups are presented by their multiplication tables. The best known result is that isomorphism can be done in time <img src='http://l.wordpress.com/latex.php?latex=%7Bn%5E%7B%5Clog+n+%2BO%281%29%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n^{\log n +O(1)}}' title='{n^{\log n +O(1)}}' class='latex' />. This result is due to Zeke Zalcstein and myself and independently Bob Tarjan. It is quite a simple observation based on the fact that groups always have generator sets of cardinality at most <img src='http://l.wordpress.com/latex.php?latex=%7B%5Clog+n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log n}' title='{\log n}' class='latex' />. </p>
<p>
I have been affected with this MD for decades. Like some kind of real diseases I get &#8220;bouts&#8221; where I think that I have a new idea, and I then work hard on the problem. It seems so easy, but is also like GI&#8212;very elusive. I would be personally excited by any improvement over the above bound. Note, the hard case seems to be the problem of deciding isomorphism for <img src='http://l.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' />-groups. If you can make progress on such groups, I believe that the general case might yield. In any event <img src='http://l.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' />-groups seem to be quite hard.</p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <b>Graph Reconstruction:</b> This is a famous problem due to Stanislaw Ulam. The <a href="http://en.wikipedia.org/wiki/New_digraph_reconstruction_conjecture">conjecture</a> is that the vertex deleted subgraphs of a graph determine the graph up to isomorphism, provided it has at least <img src='http://l.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' /> vertices. It is one of the best known problems in graph theory, and is one of the original diseases that Harary discussed. </p>
<p>
I somehow have been immune to this disease&#8212;I have never thought about it at all. The problem does seem to be solvable; how can all the subgraphs not determine a graph? My only thought has been that this problem somehow seems to be related to GI. But, I have no idea why I believe that is true.</p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <b>Jacobian Conjecture:</b> This is a famous problem about when a polynomial map has an inverse. Suppose that we consider the map that sends a pair of complex numbers <img src='http://l.wordpress.com/latex.php?latex=%7B%28x%2Cy%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{(x,y)}' title='{(x,y)}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%7B%28p%28x%2Cy%29%2Cq%28x%2Cy%29%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{(p(x,y),q(x,y))}' title='{(p(x,y),q(x,y))}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%7Bp%28x%2Cy%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{p(x,y)}' title='{p(x,y)}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7Bq%28x%2Cy%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{q(x,y)}' title='{q(x,y)}' class='latex' /> are both integer polynomials. The conjecture is that the mapping is 1-1 if and only if the mapping is locally 1-1. The reason it is called the Jacobian Conjecture is that the map is locally 1-1 if and only if the determinant of the matrix
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cleft%28+%7B%5Cbegin%7Barray%7D%7Bcc%7D+p_%7Bx%7D%28x%2Cy%29+%26+q_%7Bx%7D%28x%2Cy%29+%5C%5C+p_%7By%7D%28x%2Cy%29+%26+q_%7By%7D%28x%2Cy%29+%5C%5C+%5Cend%7Barray%7D+%7D+%5Cright%29+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \left( {\begin{array}{cc} p_{x}(x,y) &amp; q_{x}(x,y) \\ p_{y}(x,y) &amp; q_{y}(x,y) \\ \end{array} } \right) ' title='\displaystyle  \left( {\begin{array}{cc} p_{x}(x,y) &amp; q_{x}(x,y) \\ p_{y}(x,y) &amp; q_{y}(x,y) \\ \end{array} } \right) ' class='latex' /></p>
<p> is a non-zero constant. Note, <img src='http://l.wordpress.com/latex.php?latex=%7Bp_%7Bx%7D%28x%2Cy%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{p_{x}(x,y)}' title='{p_{x}(x,y)}' class='latex' /> is the partial derivative of the polynomial with respect to <img src='http://l.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' />. The above matrix is called the Jacobian of the map. </p>
<p>
This is a perfect example of a MD. I have worked some on it with one of the experts in the area&#8212;we proved a small result about the problem. During the time we started to work together, within a few months the full result was claimed twice. One of the claims was by a faculty member of a well known mathematics department. They even went as far to schedule a series of &#8220;special&#8221; talks to present the great proof. Another expert in the area had looked at their proof and announced that it was &#8220;correct.&#8221; Eventually, the talks were cancelled, since the proof fell apart.</p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <b>Crypto-Systems:</b> This is the quest to create new public key crypto-systems. While factoring, discrete logarithm, and elliptic curves seem to be fine existing public key systems, there is a constant interest in creating new ones that are based on other assumptions.</p>
<p>
Some of this work is quite technical, but it seems a bit like an MD to me. There are amateurs and professionals who both seem to always want to create a new system. Many times these systems are broken quite quickly&#8212;it is really hard to design a crypto-system. </p>
<p>
A recent example of this was the work of Sarah Flannery and David Flannery in creating a new system detailed in their book <a href="http://astore.amazon.com/sosmath/detail/1565123778">In Code</a>. The book gives the story of her discovery of her system, and its eventual collapse. </p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <b>P=NP:</b> You all know this problem. See <a href="http://www.win.tue.nl/~gwoegi/P-versus-NP.htm">this</a> for a nice list of attempts over the years to resolve the problem. Thanks to Gerhard Woeginger for maintaining the list. </p>
<p>
<p><b> Open Problems </b></p>
<p><p>
What are other MD&#8217;s? What is your favorite? Why do some problems become diseases? While others do not?</p>
<p>
I would love to see some progress made on group isomorphism&#8212;I guess I have a bad case of this disease. I promise that if you solve it I will stop thinking about it. Really.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3824/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3824/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3824/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3824/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3824/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3824/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3824/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3824/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3824/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3824/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3824&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/11/04/on-mathematical-diseases/feed/</wfw:commentRss>
		<slash:comments>34</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/11/images.jpeg" medium="image">
			<media:title type="html">images</media:title>
		</media:content>
	</item>
		<item>
		<title>The Iceberg Effect in Theory Research</title>
		<link>http://rjlipton.wordpress.com/2009/10/31/the-iceberg-effect-in-theory-research/</link>
		<comments>http://rjlipton.wordpress.com/2009/10/31/the-iceberg-effect-in-theory-research/#comments</comments>
		<pubDate>Sat, 31 Oct 2009 15:06:36 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[History]]></category>
		<category><![CDATA[People]]></category>
		<category><![CDATA[Proofs]]></category>
		<category><![CDATA[iceberg effect]]></category>
		<category><![CDATA[Karp]]></category>
		<category><![CDATA[optimal]]></category>
		<category><![CDATA[TSP]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3801</guid>
		<description><![CDATA[
The iceberg effect in research: how theorems can be lost  



Fred is my favorite name, when I need a &#8220;random&#8221; name. You might have also noticed that the picture is not even a picture of a person&#8212;it&#8217;s an iceberg. I will explain why in a moment.

Usually I talk about real people, but today I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3801&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font color="”#0066cc?"><br />
<em>The iceberg effect in research: how theorems can be lost  </em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/10/images8.jpeg?w=111&#038;h=136" alt="images" title="images" width="111" height="136" class="alignright size-full wp-image-3802" /></p>
<p>
Fred is my favorite name, when I need a &#8220;random&#8221; name. You might have also noticed that the picture is not even a picture of a person&#8212;it&#8217;s an iceberg. I will explain why in a moment.</p>
<p>
Usually I talk about real people, but today I thought I would use phony names to protect the innocent&#8212;or at least protect my friends. </p>
<p>
I plan to talk about an issue that comes up in research:  	</p>
<blockquote><p>
Is the fact X a new result? Or is it a known result?
</p></blockquote>
<p><span id="more-3801"></span></p>
<p>
At FOCS, a great researcher, who I will call &#8220;Fred,&#8221; asked me if a certain theorem, he had just proved, was new. He did this in the hall right outside one of the conference rooms, which did not allow us much time or any blackboard for drawing pictures. His result concerns a clever algorithm that yields a neat approximation algorithm to a well studied problem. He asked me and others if it was &#8220;new.&#8221; My immediate response was that I knew a related theorem, but that Fred&#8217;s theorem seemed new to me. </p>
<p>
Moreover, Fred&#8217;s theorem seemed to generalize the known theorem, which made me excited so I began to write an entire post on his result. In the post I talked about his theorem, its proof, and how it differed from the old known result. In order to do the latter I had to Google until I found the old paper. This was not a completely trivial task, since the paper was so old. I try to be careful in how I cite papers and wanted to see the original. That is where I hit a snag: his &#8220;new&#8221; theorem was about <img src='http://l.wordpress.com/latex.php?latex=%7B32%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{32}' title='{32}' class='latex' /> years old. It was &#8220;known.&#8221; Or was it?</p>
<p>
<p><b> The Iceberg Effect </b></p>
<p><p>
The confusion about whether or not Fred&#8217;s theorem was new could be traced to what I will call the <i>iceberg effect</i>. Often an author may write up a paper that becomes famous for some theorem T. But also in the paper there are other theorems, which are not nearly as exciting as the main theorem T. Over time everyone knows T, we teach T to our students, we use T in our papers, and T becomes part of the fabric of theory.</p>
<p>
The problem is that T is like the visible part of an iceberg. We see the tip, T, that is above the water, but the part below, the other theorems, are soon forgotten. They may be quite clever or hard to prove, but people just know about T. This is where the iceberg effect hits. You prove a &#8220;new&#8221; theorem, which is not new&#8212;it&#8217;s part of the invisible theorems that are below the water line. </p>
<p>
This is exactly what happened to Fred. He lost a theorem, and I lost a post. Oh well.</p>
<p>
<p><b> Other Effects </b></p>
<p><p>
There are many other effects that make it hard to know for sure that something is new. I think that this problem is less of an issue in &#8220;hot&#8221; areas, since the results are fairly recent.</p>
<p>
For results that are older the iceberg effect and other effects can be a major problem. There are so many on-line papers, workshops, conferences, journals, and other sources of information that it is easy to overlook something. This is not unique to theory, but it is an issue that we need to work on.</p>
<p>
<p><b> The Theorem In Question </b></p>
<p><p>
Dick Karp proved a theorem about TSP for random points in the unit square. His theorem is based on a famous <a href="http://myweb.lsbu.ac.uk/~whittyr/MathSci/TheoremOfTheDay/OR/BHH/TotDBHH.pdf">result</a> of Jillian Beardwood, John Halton and John Hammersley who prove that the length of the optimal TSP tour for uniform independent random points in the unit square is <img src='http://l.wordpress.com/latex.php?latex=%7B%5COmega%28%5Csqrt+n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\Omega(\sqrt n)}' title='{\Omega(\sqrt n)}' class='latex' />. </p>
<p>
Karp proved his famous result in this <a href="http://rjlipton.files.wordpress.com/2009/10/karp77.pdf">paper</a>:</p>
<blockquote><p><b>Theorem: </b> <em> Let <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> <b>random</b> points be in the unit square, and let the length of their optimal tour be <img src='http://l.wordpress.com/latex.php?latex=%7BL%5E%7B%2A%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{L^{*}}' title='{L^{*}}' class='latex' />. Then, there is a polynomial-time algorithm that given the points finds a tour of length <img src='http://l.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> so that
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++L+%3D+L%5E%7B%2A%7D+%2B+o%28%5Csqrt+n%29.&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  L = L^{*} + o(\sqrt n).' title='\displaystyle  L = L^{*} + o(\sqrt n).' class='latex' /></p>
<p> </em></p></blockquote>
<p>Since the optimal tour is <img src='http://l.wordpress.com/latex.php?latex=%7B%5COmega%28%5Csqrt+n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\Omega(\sqrt n)}' title='{\Omega(\sqrt n)}' class='latex' />, this result shows that the tour found is close to the optimal length: the error term is low order. </p>
<p>
This is the T result. The &#8220;new&#8221; theorem was:</p>
<blockquote><p><b>Theorem: </b> <em> Let <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> <b>arbitrary</b> points be in the unit square, and let the length of their optimal tour be <img src='http://l.wordpress.com/latex.php?latex=%7BL%5E%7B%2A%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{L^{*}}' title='{L^{*}}' class='latex' />. Then, there is a polynomial-time algorithm that given the points finds a tour of length <img src='http://l.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> so that
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++L+%3D+L%5E%7B%2A%7D+%2B+o%28%5Csqrt+n%29.&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  L = L^{*} + o(\sqrt n).' title='\displaystyle  L = L^{*} + o(\sqrt n).' class='latex' /></p>
<p> </em></p></blockquote>
<p><p>
What Karp proved in his original paper is much stronger than his theorem on random points, which is the T result that we all remember. He actually proved the new result in one of his below-the-waterline theorems. He proved exactly the above theorem, but it was stated in slightly different language:</p>
<p>
<img src="http://rjlipton.files.wordpress.com/2009/10/th.png?w=500" alt="th" title="th" width="500" class="aligncenter size-full wp-image-3807" /></p>
<p>
For a more recent treatment of Karp&#8217;s result check out <a href="http://www.di.ens.fr/~vergnaud/algo0910/TSP.pdf">this</a>.</p>
<p>
<p><b> The Proof </b></p>
<p><p>
I will include a sketch of Karp&#8217;s theorem, with a weaker bound. Actually, this is how Fred sketched his &#8220;new&#8221; result, when he spoke to me in the hall. Check out Karp&#8217;s paper or the notes for the better bound. The key ideas are all here, however.</p>
<p>
Suppose that <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> points are given that all lie in the unit square. <b>The points need not be random.</b></p>
<p>
The algorithm has three steps.</p>
<p>
 <b>Step 1:</b> Divide the square into <i>small square cells</i> of side <img src='http://l.wordpress.com/latex.php?latex=%7Bs+%3D+%5Cfrac%7B1%7D%7Bt%5Csqrt+n%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{s = \frac{1}{t\sqrt n}}' title='{s = \frac{1}{t\sqrt n}}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' /> is a slow growing function. Then, if a small cell has more than <img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> point in it, connect all the points together at their respective centroid. Now each small cell can be thought of as having at most <img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> point&#8212;call these points the <i>leaders</i>. The error introduced by this step is at most <img src='http://l.wordpress.com/latex.php?latex=%7BO%28n+s%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(n s)}' title='{O(n s)}' class='latex' />, which is <img src='http://l.wordpress.com/latex.php?latex=%7BO%28%5Csqrt+n%2Ft%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(\sqrt n/t)}' title='{O(\sqrt n/t)}' class='latex' />.</p>
<p>
<b>Step 2:</b> Next divide the square into <i>large square cells</i> of side <img src='http://l.wordpress.com/latex.php?latex=%7Bl+%3D+%5Cfrac%7Bt%7D%7B%5Csqrt+n%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{l = \frac{t}{\sqrt n}}' title='{l = \frac{t}{\sqrt n}}' class='latex' />. Each large cell contains at most <img src='http://l.wordpress.com/latex.php?latex=%7Bt%5E%7B4%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{t^{4}}' title='{t^{4}}' class='latex' /> leaders, since a large cell contains at most <img src='http://l.wordpress.com/latex.php?latex=%7Bt%5E%7B4%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{t^{4}}' title='{t^{4}}' class='latex' /> small cells. Use a brute force dynamic program to solve the TSP for each large cell exactly. This causes no error, but takes time at most,
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++n+2+%5E%7Bt%5E%7B4%7D%7D.+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  n 2 ^{t^{4}}. ' title='\displaystyle  n 2 ^{t^{4}}. ' class='latex' /></p>
<p> If <img src='http://l.wordpress.com/latex.php?latex=%7Bt+%3D+%28%5Clog+n%29%5E%7B1%2F4%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{t = (\log n)^{1/4}}' title='{t = (\log n)^{1/4}}' class='latex' />, then this runs in polynomial time.</p>
<p>
<b>Step 3:</b> Finally, connect all the large cells together by a simple path: the path snakes across each row of large cells. The total length of this path is linear in the number of rows of the large cells, <img src='http://l.wordpress.com/latex.php?latex=%7B1%2Fl%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1/l}' title='{1/l}' class='latex' />. This path can be all error so the total error from each step is at most:
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++O%28%5Csqrt+n%2Ft%29+%2B+0+%2B+1%2Fl.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  O(\sqrt n/t) + 0 + 1/l.' title='\displaystyle  O(\sqrt n/t) + 0 + 1/l.' class='latex' /></p>
<p> Since <img src='http://l.wordpress.com/latex.php?latex=%7Bl%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{l}' title='{l}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%7BO%28t%2F%5Csqrt+n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(t/\sqrt n)}' title='{O(t/\sqrt n)}' class='latex' />, the total error is bounded by <img src='http://l.wordpress.com/latex.php?latex=%7BO%28%5Csqrt+n%2Ft%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(\sqrt n/t)}' title='{O(\sqrt n/t)}' class='latex' />, which implies the theorem.</p>
<p>
<p><b> Open Problems </b></p>
<p><p>
I may be alone, but I often run into this question of what is known. The iceberg effect is one barrier that I have hit. But there are other issues. Can we do a better job of making sure that we do not re-invent what is known? </p>
<p>
Or was it really known if top researchers did not recall Karp&#8217;s &#8220;other&#8221; result? Perhaps the outcome here is good&#8212;I had forgotten that Karp&#8217;s ideas worked just fine for any collection of points. So perhaps the whole affair has worked out just fine.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3801/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3801/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3801/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3801/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3801/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3801/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3801/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3801/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3801/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3801/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3801&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/10/31/the-iceberg-effect-in-theory-research/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/images8.jpeg" medium="image">
			<media:title type="html">images</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/th.png" medium="image">
			<media:title type="html">th</media:title>
		</media:content>
	</item>
		<item>
		<title>Highlights of FOCS Theory Day</title>
		<link>http://rjlipton.wordpress.com/2009/10/27/highlights-of-focs-theory-day/</link>
		<comments>http://rjlipton.wordpress.com/2009/10/27/highlights-of-focs-theory-day/#comments</comments>
		<pubDate>Tue, 27 Oct 2009 13:16:07 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[History]]></category>
		<category><![CDATA[People]]></category>
		<category><![CDATA[Proofs]]></category>
		<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[Alon]]></category>
		<category><![CDATA[approximation]]></category>
		<category><![CDATA[Blum]]></category>
		<category><![CDATA[Factoring]]></category>
		<category><![CDATA[FOCS]]></category>
		<category><![CDATA[Karp]]></category>
		<category><![CDATA[matrix]]></category>
		<category><![CDATA[polynomial]]></category>
		<category><![CDATA[Problems]]></category>
		<category><![CDATA[Proof]]></category>
		<category><![CDATA[randomness]]></category>
		<category><![CDATA[spine]]></category>
		<category><![CDATA[Strassen]]></category>
		<category><![CDATA[Yannakakis]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3769</guid>
		<description><![CDATA[ 
 A summary of a great day of talks before FOCS at Theory Day 


Dick Karp was the leadoff speaker this Saturday at the FOCS Theory Day
 in Atlanta. He was followed by Mihalis Yannakakis, Noga Alon, and Manny Blum. Sounds to me like a lineup for a baseball team that is in the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3769&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p> <font color="”#0066cc?"><br />
<em> A summary of a great day of talks before FOCS at Theory Day </em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/10/all.png?w=160&#038;h=172" alt="all" title="all" width="160" height="172" class="alignright size-full wp-image-3771" /></p>
<p>Dick Karp was the leadoff speaker this Saturday at the <a href="http://www.aco.gatech.edu/conference/focs-aco/">FOCS Theory Day</a><br />
 in Atlanta. He was followed by Mihalis Yannakakis, Noga Alon, and Manny Blum. Sounds to me like a lineup for a baseball team that is in the World Series. I can almost hear an announcer saying:</p>
<blockquote><p>
 	Now batting cleanup, Mannnnny Blummmmm.
 </p></blockquote>
<p> And the crowd goes wild.</p>
<p> Today I thought I would write a summary of what Dick and the others said Saturday. The talks were webcast, but perhaps not all of you watched them. In any event I will add some additional comments that I hope you enjoy.<br />
<span id="more-3769"></span></p>
<p> All of these speakers always give great talks, and this day was no exception. I did notice that Karp and Blum both have titles that ask a question. Is this meaningful?</p>
<p> I cannot resist one story about Dick. Years ago he visited Atlanta to give the keynote at a conference. Rich DeMillo was the program chair, and he and I went out to the airport to pick Karp up. We planned to eat lunch, and then get Dick to the venue for his address. During the lunch we had a wonderful time chatting with Karp. Finally, at one point Dick said that he had a small issue that he was worried about. We of course asked what it was&#8212;we were prepared to help him in anyway possible.</p>
<p> Dick explained that he made his slides on the long airplane ride from SFO to Atlanta. In those days talks were made by writing on plastic transparencies with colored pens; my personal favorite were  <a href="http://www.amazon.com/gp/product/B00006IFGW">vis-a-vis markers</a>. Dick said that he was concerned about the new type of pens that he had used. He did not use vis-a-vis, but he had used a new brand. We said that  was fine. But he added that he was a bit concerned how well the slides would project, since they looked pretty pale. The pens apparently were not as good as old reliable vis-a-vis.</p>
<p> After lunch we jumped in Rich&#8217;s car, raced over to the conference hotel, and quickly found the room where Dick&#8217;s talk was going to be held. We immediately checked out the projector and more importantly Dick&#8217;s slides. He was right&#8212;the colors were extremely pale and the slides were nearly impossible to read. But it was way too late to redo them, since the talk was just about to start. Some of the audience were already entering the room.</p>
<p> We went forward&#8212;there was no other option. Rich gave a wonderful introduction of Karp, there was a long applause, and then Dick started his talk. He gave a great presentation. Whether the slides are perfect or not, Dick has a way of speaking and presenting his ideas that transcends everything else. Whether the red on a slide, for example, was pale or dark was unimportant. When Dick finished there was again a long applause. His talk was a huge success, but I believe he did throw the new pens away.</p>
<p> Let&#8217;s move on to discuss the talks, which were all quite different styles. Yet each was a masterpiece given by one of the leaders of the field.</p>
<p> <b>What Makes an Algorithm Great?</b></p>
<p> Karp started by quoting me, twice. I was a bit embarrassed&#8212;and secretly pleased. The first quote was that &#8220;Algorithms are Tiny,&#8221; which I have <a href="http://rjlipton.wordpress.com/2009/02/13/algorithms-tiny-yet-powerful/">discussed</a> earlier. By the way the ideas in that discussion are joint with Lenore Blum.</p>
<ul>
<li> Karp then began to discuss great algorithms based on various<br />
 criteria. The first list was historical:</p>
<ul>
<li> The Positional Number system of <a href="http://wvegter.hivemind.net/abacus/CyberHeroes/Alkarismi.htm">Alkarismi</a>.
<li> The Chinese Remainder Theorem&#8212;which is really an algorithm.
<li> The Euclidean Algorithm.
<li> Gaussian Elimination.
</ul>
<p> Impossible to argue with any on his list. Without positional numbers, I believe, that one could make the case that almost no modern algorithm is possible. The Chinese Remainder Theorem is one of my favorites, and the other two are clearly great.</p>
<li> Karp then talked about several algorithms that he said were based on great ideas.<br />
 They were,</p>
<ul>
<li> Linear Programming
<li> Primality Testing
<li> Fast Matrix Product
<li> TSP
<li> Integer Factoring
</ul>
<p>The famous simplex method for LP is still one of the best ways to solve linear programs. Karp pointed out that the ellipsoid method was a great theoretic result, but the interior-point methods were the first to solve LP&#8217;s in a way which was both in polynomial-time and practical. Karp gave a nice survey of the work on the TSP: he pointed out that Nicos Christofides&#8217; wonderful <img src='http://l.wordpress.com/latex.php?latex=%7B1.5%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1.5}' title='{1.5}' class='latex' />-approximation algorithm for the metric TSP is still the best known. If you somehow do not know it <a href="http://en.wikipedia.org/wiki/Christofides_algorithm">check it out</a>. It was discovered in 1976&#8212;that&#8217;s a lot of FOCS&#8217;s ago. Dick also pointed out that Volker Strassen&#8217;s famous fast matrix product algorithm was a <a href="http://en.wikipedia.org/wiki/Strassen_algorithm">surprise</a> to the mathematical community. </p>
<li> Karp is an expert in so many things, but perhaps his roots are in combinatorial optimization. So he went into some detail on the great algorithms from that area.
<ul>
<li> Minimum Spanning Tree
<li> Shortest Path in Graphs
<li> For-Fulkerson Max-flow Algorithm
<li> Gale-Shapley Stable Marriage
<li> General Matching in Graphs
</ul>
<p> In 1965 Jack Edmonds&#8217; <a href="http://www3.ntu.edu.sg/home/YHLow/article/path-tree-flowers.pdf">paper</a> titled &#8220;Paths, Trees, and Flowers&#8221; not only showed how to do matching for non- bipartite graphs, but also laid out an informal basis of what later became P, NP, and co-NP. An incredible achievement for that point in time.</p>
<li> Karp next discussed randomized algorithms. I sometimes think we could not have a FOCS meeting if we disallowed the use of randomization.
<ul>
<li> Primality Testing
<li> Volume of Convex Bodies
<li> Counting Matchings
<li> Min-Cut of Graphs
<li> String Matching
<li> Hashing
</ul>
<p>All of these algorithms are great, and Karp spent some details on the volume algorithm of Alan Frieze and Ravi Kannan. He said nothing about the beautiful string matching algorithm that is due to Michael Rabin and himself. He was modest, but I am under no constraint. Their matching algorithm is one of the examples of the immense power of randomness. Their algorithm is theoretically fast and is practical. Actually it is more than practical, it is I believe the algorithm of choice for most packages. A wonderful algorithm, that would make my personal top ten.</p>
<li> Karp talked next about heuristic algorithms. These, he said, present an important challenge to theory. The central question, of course, is why do they work so well in practice?
<ul>
<li> Local Search
<li> Shotgun Sequencing Algorithm
<li> Simulated Annealing
</ul>
<p>Dick stated that Myers&#8217; great work on sequencing was critical to the effort that first sequenced the human genome at Celera&#8212;then a company in competition with the NIH funded labs. What is so neat about Myers&#8217; algorithm is that it required him to understand the structure of the human genome. The genome is <i>not</i> random, has structure, and that structure makes what Gene did so difficult.</p>
<li> Karp and complexity theory:
<ul>
<li> NLOG is Closed Under Complement
<li> Undirected Connectivity is in DLOG
<li> Space is More Powerful Than Time
</ul>
<p>Karp agreed with my earlier <a href="http://rjlipton.wordpress.com/2009/09/22/its-all-algorithms-algorithms-and-algorithms/">discussion</a> that even complexity theory is really all about algorithms. He selected the above as the some of the most outstanding ones. John Hopcroft, Wolfgang Paul, and Leslie Valiant proved that deterministic time <img src='http://l.wordpress.com/latex.php?latex=%7Bt%28n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{t(n)}' title='{t(n)}' class='latex' /> is in deterministic space <img src='http://l.wordpress.com/latex.php?latex=%7Bo%28t%28n%29%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{o(t(n))}' title='{o(t(n))}' class='latex' />. Their proof relies on several algorithmic <a href="http://portal.acm.org/citation.cfm?id=322015">ideas</a>: the block-respecting method and the pebbling game strategy.</p>
<li> Karp only had time to list a number of great algorithms that show that theory can have impact on the &#8220;real-world.&#8221; All of the algorithms in his list have changed the world.
<ul>
<li> Fast Fourier Transform
<li> RSA Encryption
<li> Miller-Rabin Primality Test
<li> Reed-Solomon Codes
<li> Lempel-Ziv Compression
<li> Page Rank of Google
<li> Consistent Hashing of Akamai
<li> Viterbi and Hidden Markov Models
<li> Smith-Waterman
<li> Spectral Low Rank Approximation Algorithms
<li> Binary Decision Diagrams
</ul>
<p> Without these algorithms the world would stop: no web search, no digital music or movies, no security, <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cdots%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\dots}' title='{\dots}' class='latex' /> and on and on.</p>
<li> Karp finally gave a short list of algorithms from the area of information transfer.
<ul>
<li> Digital Fountain Codes
<li> CDMA
<li> Compressive Sensing of Images
</ul>
<p>The first is a breakthrough method of Michael Luby that is a near optimal erasure code. In many situations information may be lost, rather than corrupted. Since it is lost the decoder knows that it is missing, which allows for a whole different type of error correcting code. These codes are extremely powerful: they are easy to encode and decode, and need relatively few extra bits of redundancy. Compressive Sensing is a relatively recent idea that is based on deep mathematics, yet is rapidly changing image compression. The creators are David Donoho, Emmanuel  Candès,  Justin Romberg, and Terence Tao&#8212;see <a href="http://www.dsp.ece.rice.edu/cs">this</a> for many papers and articles from the area.</p>
</ul>
<p>The talk was wonderful. Wonderful. In one hour we were carried from ancient times, past the dawn of modern computational complexity, then to the modern era, and the present&#8212;with a peek at the future. A masterful talk by the master. Later in the meeting Alon said that he could not imagine doing what Karp did. I completely agree.</p>
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5CS+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \S ' title='\displaystyle  \S ' class='latex' /></p>
<p> <b>Computational Aspects of Equilibria.</b></p>
<p>Yannakakis spoke about equilibria phenomena in game theory and economic theory. He started by saying that he had two hours worth of slides, but kindly finished nicely on time: close to the <i>von Neumann Bound</i>. John von Neumann once defined the perfect length of a talk as a micro-century, which works out to just about <img src='http://l.wordpress.com/latex.php?latex=%7B52%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{52}' title='{52}' class='latex' /> minutes.</p>
<ul>
<li> Yannakakis first explained how many systems can evolve over time to a stable state. He spoke at some length about the classic example of neural nets and a 1982 <a href="http://en.wikipedia.org/wiki/Hopfield_net">theorem</a> due to John Hopfield that proves that they always converge to a local minimum. The argument is based on showing that a certain potential function decreases and eventually reaches a local minimum.
<li> Yannakakis then switched to a discussion that was mostly about various types of games. He discussed in detail the famous result of John Nash, proved in 1950, that any non-zero sum game always has at least <i>one</i> mixed equilibria point.
<li>
 Yannakakis&#8217; key point was that Nash&#8217;s proof that mixed equilibria always exist was based on the famous Brouwer Fixed Point Theorem (BFPT). This theorem that is <a href="http://en.wikipedia.org/wiki/Brouwer_fixed_point_theorem">named</a> after Luitzen Brouwer is also, as Mihalis said, used to prove that a whole host of other game/economic systems have a stable equilibrium point.</p>
<li> Yannakakis then showed the power of modern complexity theory. Just because the BFPT is used to prove something does not rule out the possibility that there is a proof that avoids it. But, this is not the case. He introduced three complexity classes: <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cmathsf%7BFIXP%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\mathsf{FIXP}}' title='{\mathsf{FIXP}}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cmathsf%7BPPAD%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\mathsf{PPAD}}' title='{\mathsf{PPAD}}' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cmathsf%7BPLS%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\mathsf{PLS}}' title='{\mathsf{PLS}}' class='latex' />. They capture respectively: general <img src='http://l.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' />-player games, <img src='http://l.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' />-player games, and finally games that have pure equilibria. A beautiful point is that he could use these classes to show that BFPT is equivalent to Nash&#8217;s famous theorem for multiple player games, and thus it is <i>essential</i> for the proof. The power of modern complexity should not be underestimated. Without the crisp notion of a &#8220;complexity class&#8221; I cannot see how any result like this could even be stated&#8212;let alone proved. Terrific.
<li> Yannakakis also pointed out that solutions to multiple player games may not be rational numbers, in general. This was, by the way, known to Nash back in 1950. The problem with this is that then there may be no way to write down the exact solution to the problem. This plays havoc with complexity theory. Mihalis gave the analogy to the sum of square root problem: Is the following true,
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Csum_%7Bi%3D1%7D%5E%7Bn%7D+%5Csqrt+%7Bd_%7Bi%7D%7D+%5Cle+b+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \sum_{i=1}^{n} \sqrt {d_{i}} \le b ' title='\displaystyle  \sum_{i=1}^{n} \sqrt {d_{i}} \le b ' class='latex' /></p>
<p> where the <img src='http://l.wordpress.com/latex.php?latex=%7Bd_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{d_{i}}' title='{d_{i}}' class='latex' /> are natural numbers and <img src='http://l.wordpress.com/latex.php?latex=%7Bb%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{b}' title='{b}' class='latex' /> is a rational number. It is a long standing open problem whether or not this can be solved in polynomial time. See this for a <a href="http://rjlipton.wordpress.com/2009/03/04/ron-graham-gives-a-talk/">discussion</a> that I had earlier on this topic.</p>
<li> Yannakakis finally gave us the usual news about the complexity classes: not much is known about their power. Oh well. The following diagram summarizes all that is known about them: (an arrow denotes inclusion)<br />
 <img src="http://rjlipton.files.wordpress.com/2009/10/classes1.png?w=375&#038;h=132" alt="classes" title="classes" width="375" height="132" class="aligncenter size-full wp-image-3793" /></p>
</ul>
<p>His talk summarized quite neatly the relationship between equilibria problems and certain complexity classes. I wish that Yannakakis had more time to give more details&#8212;perhaps another talk.</p>
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5CS+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \S ' title='\displaystyle  \S ' class='latex' /></p>
<p> <b>Disjoint Paths, Isoperimetric Problems, and Graph Eigenvalues.</b></p>
<p>Alon started by pointing out that his first FOCS paper was at the <img src='http://l.wordpress.com/latex.php?latex=%7B%7B25%7D%5E%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{{25}^{th}}' title='{{25}^{th}}' class='latex' />. At that meeting, in 1984, he spoke on eigenvalues and expanders&#8212;he said he seemed to be stuck. He then pulled out a small piece of paper and proceeded to read,</p>
<blockquote><p>
 	I would like to thank the many FOCS program committees for their hard work and effort&#8212;<b>except</b> for the FOCS program committees of 1989, 2003, and 2006.
</p></blockquote>
<p>Even one of the greatest theorists in the world gets papers rejected from FOCS. This got a big laugh, yet I think it actually raises a serious question about the role of conferences. I know that is being discussed both on-line and off, but let&#8217;s move on to the rest of his talk and leave that discussion for another time and place.</p>
<p>Noga really gave highlights of two problems:</p>
<ul>
<li> Alon first discussed a kind of routing game. He imagines that you have some fixed <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> node degree <img src='http://l.wordpress.com/latex.php?latex=%7Bd%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{d}' title='{d}' class='latex' /> expander graph. The game is this: you will be given a series of requests of the form:
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++s_%7B1%7D+%5Crightarrow+t_%7B1%7D%2C+%5Cdots%2C+s_%7Bm%7D+%5Crightarrow+t_%7Bm%7D+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  s_{1} \rightarrow t_{1}, \dots, s_{m} \rightarrow t_{m} ' title='\displaystyle  s_{1} \rightarrow t_{1}, \dots, s_{m} \rightarrow t_{m} ' class='latex' /></p>
<p>where this means that you are to find a set of edge disjoint paths from <img src='http://l.wordpress.com/latex.php?latex=%7Bs_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{s_{i}}' title='{s_{i}}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%7Bt_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{t_{i}}' title='{t_{i}}' class='latex' /> for each <img src='http://l.wordpress.com/latex.php?latex=%7Bi%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{i}' title='{i}' class='latex' />. He proves that on a special type of expander that not only can this be done for <img src='http://l.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%7Bm+%3D+O%28n%2F%5Clog+n%29%7D%2C&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m = O(n/\log n)},' title='{m = O(n/\log n)},' class='latex' /> but that it can be done <i>on-line</i>. That is there is an algorithm that selects the path from <img src='http://l.wordpress.com/latex.php?latex=%7Bs_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{s_{i}}' title='{s_{i}}' class='latex' /> to <img src='http://l.wordpress.com/latex.php?latex=%7Bt_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{t_{i}}' title='{t_{i}}' class='latex' /> without knowing the requests that will follow,</p>
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++s_%7Bi%2B1%7D+%5Crightarrow+t_%7Bi%2B1%7D%2C+%5Cdots%2C+s_%7Bm%7D+%5Crightarrow+t_%7Bm%7D.+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  s_{i+1} \rightarrow t_{i+1}, \dots, s_{m} \rightarrow t_{m}. ' title='\displaystyle  s_{i+1} \rightarrow t_{i+1}, \dots, s_{m} \rightarrow t_{m}. ' class='latex' /></p>
<p>This seems amazing to me. See the <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4389521">paper</a> joint with Michael Capalbo for full details on how this works.</p>
<li> Alon then spoke on <i>spines</i>. Imagine <img src='http://l.wordpress.com/latex.php?latex=%7Bd%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{d}' title='{d}' class='latex' /> dimension space divided into unit cubes. You are to select a subset of each cube so that this &#8220;spine&#8221; does not allow any non-trivial path that moves from cube to cube. In a sense the spines are a kind of geometric separator. Since separators play a key role in many parts of theory, it may come as no surprise that spines also are very useful. A deep result of Guy Kindler, Ryan O&#8217;Donnell, Anup Rao, and Avi Wigderson shows that a spine can have surface area bounded by <img src='http://l.wordpress.com/latex.php?latex=%7BO%28%5Csqrt+d%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(\sqrt d)}' title='{O(\sqrt d)}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%7Bd%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{d}' title='{d}' class='latex' /> is the dimension of the space. However, the shape of their spine is essentially not known; it cannot be seen. The <a href="http://tcsmath.wordpress.com/2008/09/">discussion</a> here may help understand more about spines and their applications. Noga&#8217;s main result is a new uniform proof that solves this and other spine problems.
<li> Alon also had a simple piece of advice for all of us. He pointed out that we all use search engines to try to find mathematics that can help us in our research The trouble is that it is impossible to type in mathematical formulas into search engines. His suggestion is: <i>give all your lemmas meaningful names</i>. This may allow people to find your work, and then reference it. He gave an example of one of his lemmas:
<p>
 <b>Lemma:</b> <i>(A simple discrete vertex isoperimetric inequality on the<br />
Dirichlet boundary condition)</i></p>
</ul>
<p>When I Googled this &#8220;name&#8221; the second hit was his paper&#8212;not too bad.</p>
<p>
A wonderful talk, with light touches, and beautiful results; even though it contained many technical parts, I believe that most came away with the basic ideas that Noga wanted to get across.</p>
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5CS+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \S ' title='\displaystyle  \S ' class='latex' /></p>
<p> <b>Can (Theoretical Computer) Science Come to Grips With Consciousness?</b></p>
<p>Blum spoke about a life long quest, that started when he was six years old, to understand consciousness. Blum has a record of looking at old problems in new ways, of looking at new problems in his own way, of creating whole fields of study, and thus we should take his ideas on what is &#8220;consciousness&#8221; very seriously. Let&#8217;s call this the: <i>What is consciousness problem.</i> (WICP).</p>
<p>It is hard to really do justice to his wonderful talk, but I think there are a few high level points that I can get right:</p>
<ul>
<li> Blum states that now may be the first time that real progress can be made on the WICP. This is due to the confluence of two events. The ability of researchers to use <a href="http://en.wikipedia.org/wiki/Functional_magnetic_resonance_imaging">FMRI</a> machines to watch a person&#8217;s brain as they think is relatively recent development. Second, is the maturing of a powerful theory called the <a href="http://cogweb.ucla.edu/CogSci/GWorkspace.html">Global Workspace Theory</a>, which was created by Bernie <a href="http://lifeboat.com/ex/bios.bernard.j.baars">Baars</a>. I will not explain the theory, since I do not understand it. But Blum says that it has a very computational flavor, which may mean that theory can make an important contribution to the WICP.
<li> Blum explained that when he was a graduate student he worked with some of the greats such as Warren McCulloch and Walter Pitts. They are famous for their notion of the McCulloch-Pitts <a href="http://en.wikipedia.org/wiki/Artificial_neuron">neuron</a>, which they proved could be the basis of a universal computing device. Today we study threshold functions and the complexity class <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cmathsf%7BTC%7D%5E%7B0%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\mathsf{TC}^{0}}' title='{\mathsf{TC}^{0}}' class='latex' /> that are modern versions of their pioneering work.
<li> Blum pointed out a simple fact about the eye that I found fascinating: In the dark take a light source, and quickly move it around in front of you. You will see a path of light. This happens, Manny explained, because the eye responds only to motion. Then, keep the light source fixed and now move your head around. You will not see a path of light: you will see just a point of light. Somehow the eye and brain together can tell the difference between these two cases. Terrific.
<li> Blum then shifted into a more theory oriented part of his talk. He explained what he called <i>templates</i>. They are his way of modeling how we solve problems. His stated goal is quite ambitious; if we could understand WICP he thinks that we might be able to make robots/agents that learn. The template model is a kind of tree. For example, suppose that you are trying to prove a theorem. Then, the root of the tree would contain a &#8220;hint&#8221; or some high-level idea that gets you thinking in the right way about your theorem. Below that would be more precise yet still informal pieces of information. Eventually, the leaves of the tree would contain more formal pieces that make up the actual proof.
<li> Blum had a simple but great piece of advice to us all. If you are faced with a problem that you cannot solve, then modify the problem. Change it. Try another problem. He argued that this often may be interesting by itself, and sometimes may shed light on the original problem. Great advice.
<li> Blum ended with a pretty puzzle. Is there a irrational number <img src='http://l.wordpress.com/latex.php?latex=%7B%5Calpha%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\alpha}' title='{\alpha}' class='latex' /> so that
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Calpha+%5E%7B%5Calpha%7D+%3D+%5Cbeta&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \alpha ^{\alpha} = \beta' title='\displaystyle  \alpha ^{\alpha} = \beta' class='latex' /></p>
<p>for a rational <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbeta%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\beta}' title='{\beta}' class='latex' />? The answer is yes, and he explained how his templates could guide you to the solution to his problem.</p>
</ul>
<p>A wonderful talk, from one of the great visionaries of the field.</p>
<p><b> Open Problems </b></p>
<p>Before turning to some open problems I want to thank the following people for making Theory day such a great success: Dani Denton, Milena Mihail, Dan Spielman, and Robin Thomas. Thanks for all your hard work.</p>
<p> Robin pointed out that it was also ACO&#8217;s <img src='http://l.wordpress.com/latex.php?latex=%7B%7B20%7D%5E%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{{20}^{th}}' title='{{20}^{th}}' class='latex' /> anniversary, and all of the talks should be <a href="http://www.aco.gatech.edu/conference/focs-aco/">online</a> later this week.</p>
<p>
Now some open problems that each speaker implicitly raised:</p>
<li> <strong>Karp:</strong> What are your favorite algorithms? Do you agree with Dick&#8217;s lists?
<li> <strong>Yannakakis:</strong> What are the relative powers of the complexity classes discussed? There is a <i>rumor</i> that part of this may be resolved&#8212;more on this soon.
<li> <strong>Alon:</strong> What does a spine really look like? Can the same bounds be found with spines that one can  actually &#8220;see?&#8221;
<li> <strong>Blum:</strong> What do you think about WICP? Can theory help us understand WICP? Is the Global Workspace Theory one we can contribute to?<br />
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3769/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3769/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3769/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3769/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3769/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3769/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3769/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3769/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3769/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3769/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3769&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/10/27/highlights-of-focs-theory-day/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/all.png" medium="image">
			<media:title type="html">all</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/classes1.png" medium="image">
			<media:title type="html">classes</media:title>
		</media:content>
	</item>
		<item>
		<title>Helping Wall Street Cheat With Theory</title>
		<link>http://rjlipton.wordpress.com/2009/10/22/helping-wall-street-cheat-with-theory/</link>
		<comments>http://rjlipton.wordpress.com/2009/10/22/helping-wall-street-cheat-with-theory/#comments</comments>
		<pubDate>Thu, 22 Oct 2009 12:09:50 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[P=NP]]></category>
		<category><![CDATA[People]]></category>
		<category><![CDATA[Proofs]]></category>
		<category><![CDATA[approximation]]></category>
		<category><![CDATA[derivatives]]></category>
		<category><![CDATA[finance]]></category>
		<category><![CDATA[Problems]]></category>
		<category><![CDATA[random]]></category>
		<category><![CDATA[randomness]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3754</guid>
		<description><![CDATA[ 
 How to create &#8220;bad&#8221; derivatives that cannot be detected 



Sanjeev Arora is the one of the most creative theorists in the world. He helped create the concept of Probabilistically Checkable Proofs along with Uriel Feige, Shafi Goldwasser, Carsten Lund, László Lovász, Rajeev Motwani, Shmuel Safra, Madhu Sudan, and Mario Szegedy. On his own [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3754&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p> <font color="”#0066cc?"><br />
<em> How to create &#8220;bad&#8221; derivatives that cannot be detected </em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/10/images7.jpeg?w=73&#038;h=117" alt="images" title="images" width="73" height="117" class="alignright size-full wp-image-3756" /></p>
<p>
Sanjeev Arora is the one of the most creative theorists in the world. He helped create the concept of <a href="http://en.wikipedia.org/wiki/PCP_theorem">Probabilistically Checkable Proofs</a> along with Uriel Feige, Shafi Goldwasser, Carsten Lund, László Lovász, Rajeev Motwani, Shmuel Safra, Madhu Sudan, and Mario Szegedy. On his own he found a wonderful approximation algorithm for the Euclidean TSP, and has done many other terrific things. I was at Princeton when we hired him, and it was clear back then that he was special.</p>
<p>
Today I had planned to talk about some of the papers of FOCS 2009, but will instead talk about a new <a href="http://www.cs.princeton.edu/~rongge/derivative.pdf">paper</a> of Arora with Boaz Barak, Markus Brunnermeier, and Rong Ge (ABBG) on finance. I really like their paper, and it seems to me to be one of those seminal papers that could launch a whole subfield.<br />
<span id="more-3754"></span></p>
<p>
Sanjeev is currently the PI of a large NSF project looking into intractability and P=NP. He is also the creator and current head of the theory committee that has and continues to try to work with NSF to get more support for theory. The previous head was Richard Karp who did an excellent job running the committee for a number of years.</p>
<p>
I feel like saying stop the presses. But this is not being printed so I guess that does not apply anymore. In one of my favorite movies, <a href="http://www.imdb.com/title/tt0110771/">The Paper</a>, there is a scene where the main character an editor of a newspaper, played by Michael Keaton, is told by one of his reporters, played by Randy Quaid, &#8220;you got to say it.&#8221; They are standing next to the high speed printing system. Papers are streaming by at a huge rate, and they want to stop the printing and change the lead story, since the the paper being printed has gotten it completely wrong. Quaid says again to Keaton you&#8217;ve got to say it. Finally, Keaton yells &#8220;stop the presses.&#8221; Of course nothing happens right away, next <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cdots%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\dots}' title='{\dots}' class='latex' /> See the movie if you want to know more. I really like this flick.</p>
<p>
As newspapers go out of business and news becomes electronic, what will we get to say in the future? Re-boot the servers?</p>
<p>
Well that&#8217;s what I am doing today. So let&#8217;s turn to the world of finance and the paper of ABBG.</p>
<p>
<p><b> Let&#8217;s Play Cards </b></p>
<p><p>
I am no expert on finance, so to explain ABBG I will use an analogy of playing a card game.</p>
<p>
Imagine that there are two players Alice and Bob who wish to play a simple game of cards. Alice has a standard deck of <img src='http://l.wordpress.com/latex.php?latex=%7B52%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{52}' title='{52}' class='latex' /> playing cards and she is the dealer. Suppose she shuffles the cards and then deals out a &#8220;hand&#8221; of one card to Bob. He wins if the hand is a red card; otherwise, he loses. If Alice shuffles fairly, Bob has exactly a <img src='http://l.wordpress.com/latex.php?latex=%7B50%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{50\%}' title='{50\%}' class='latex' /> chance of winning&#8212;it is a fair, if boring game. I could make the game more realistic, but that would change nothing except make the analysis more involved.</p>
<p>
The problem with the game is the simple phrase: <i>If Alice shuffles fairly</i>. What happens if Alice cheats? She could cheat in a number of ways; here are some that I have arranged in increasing order of skill required by Alice: </p>
<ul>
<li> <i>The deck might be a stacked deck</i>. Suppose that Alice has changed the deck so that it consists of only black cards. Then, Bob will have no chance of winning. Or she could have removed one red card and replaced it with a black one, then Bob&#8217;s chance of winning is reduced to <img src='http://l.wordpress.com/latex.php?latex=%7B25%2F52+%3C+0.5%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{25/52 &lt; 0.5}' title='{25/52 &lt; 0.5}' class='latex' />. Note, Alice requires no special skills; she can be shuffling away and let Bob cut the deck as much as he likes. The game is a still losing one for Bob.
<li> <i>The deck might be a marked deck</i>. In this case Alice cheats by having the deck contain all the usual cards, but the backs of the cards are &#8220;marked&#8221; so that she can tell whether or not a card is a red or a black card. Then, she just shuffles away until she sees that the top card is black, she then stops and deals the top card to Bob.
<li> <i>The deck might be a real deck</i>. In this case Alice cheats by not shuffling the deck properly. She could have originally arranged that the red cards so they are on the top of the deck. With a few careful shuffles of the right kind, she can vastly lower Bob&#8217;s chance of winning. Even worse if she is a really good &#8220;card mechanic&#8221; she can show Bob the deck&#8212;that is she can fan out the faces of the cards and show them to Bob. The cards will look to be in a random order. But if she can execute the correct type of shuffles, then she can force that Bob always gets a black card. Essentially, she undoes the &#8220;random&#8221; order of the cards by performing careful shuffles and moves a black card to the top. This requires quite a bit of skill: a word of advice, never play cards with magicians.
</ul>
<p>
<p><b> Let&#8217;s Play Derivatives </b></p>
<p><p>
Again there are two players who I will still call Alice and Bob. Alice is now creating financial derivatives and trying to sell them to Bob and to others. Before I even explain what a derivative is you probably can guess the high level insight:  Alice creating the derivative is the same as Alice dealing the cards. If Alice creates them fairly, then Bob is okay. However, just as Alice can cheat at cards, she can cheat at derivatives.  The reason Alice can cheat is the same as before: Alice may have extra information that she can exploit to cheat Bob. In a sense there is no surprise that Alice could cheat at cards, but that Alice could cheat at derivatives is the main result of ABBG.</p>
<p>
With all due respect I do not mean to diminish the brilliant insight of ABBG&#8212;I really like their paper. My goal is to explain that what is happening with derivatives is very much the same as cheating at cards.</p>
<p>
However, there are a number of interesting technical differences between cheating at cards and cheating at derivatives. One of them, by the way, is that the stakes are vastly larger. People play card games for stakes in the thousands or even hundred of thousand dollars, but banks &#8220;play derivatives&#8221; for billions.</p>
<p>
<p><b> The ABBG Result As A Card Game </b></p>
<p><p>
My usual disclaimer: read their paper for the details and full statements of their results. I would like to give a high level view of their basic model, what they assume, and what they prove.</p>
<p>
As I understand the key idea is that Alice has a collection of <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> assets <img src='http://l.wordpress.com/latex.php?latex=%7B%5C%7Ba_%7Bi%7D%2C%5Cdots%2Ca_%7Bn%7D%5C%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\{a_{i},\dots,a_{n}\}}' title='{\{a_{i},\dots,a_{n}\}}' class='latex' />. She then creates derivatives <img src='http://l.wordpress.com/latex.php?latex=%7BS_%7B1%7D%2C%5Cdots%2CS_%7Bm%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{S_{1},\dots,S_{m}}' title='{S_{1},\dots,S_{m}}' class='latex' /> where each <img src='http://l.wordpress.com/latex.php?latex=%7BS_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{S_{i}}' title='{S_{i}}' class='latex' /> is a random subset of size <img src='http://l.wordpress.com/latex.php?latex=%7BD%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{D}' title='{D}' class='latex' /> of the assets. Each asset will then either fail or succeed. The derivative <img src='http://l.wordpress.com/latex.php?latex=%7BS_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{S_{i}}' title='{S_{i}}' class='latex' /> pays off if at least one-half of the assets in it succeed; otherwise, it does not pay off. If Bob buys the derivative <img src='http://l.wordpress.com/latex.php?latex=%7BS_%7Bk%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{S_{k}}' title='{S_{k}}' class='latex' />, then he wins if a majority of his assets succeed; otherwise, he loses.</p>
<p>
Clearly, Alice and Bob are playing a type of game. Here is a way to model this as a card game. Alice has a deck of cards of the assets <img src='http://l.wordpress.com/latex.php?latex=%7B%5C%7Ba_%7Bi%7D%2C%5Cdots%2Ca_%7Bn%7D%5C%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\{a_{i},\dots,a_{n}\}}' title='{\{a_{i},\dots,a_{n}\}}' class='latex' /> and she deals Bob a hand that consists of <img src='http://l.wordpress.com/latex.php?latex=%7BD%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{D}' title='{D}' class='latex' /> cards. That is his derivative. Later on, it is revealed which assets fail and which succeed, and Alice and Bob can determine who won the game.</p>
<p>
In the derivative model Alice actually creates a number of derivatives at once. I must admit that I am not sure why this is done, but apparently it is the way that derivatives work. More on this later on.</p>
<p>
Let&#8217;s model this as a card type of game. We will use cards to stand for assets and hands to stand for derivatives. Each card has a label <img src='http://l.wordpress.com/latex.php?latex=%7Ba_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{a_{i}}' title='{a_{i}}' class='latex' /> on the back and a color red/black on the front. From Alice&#8217;s perspective, red stands for failure and black for success. Thus, for Alice, a winning hand is one with a majority of black cards. We will also assume that there are an equal number of red and black cards in the deck.</p>
<p>
Alice has <img src='http://l.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> copies of a special deck of cards. Each deck has <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> cards. All of these decks are identical. Alice shuffles the first one and deals out a hand of <img src='http://l.wordpress.com/latex.php?latex=%7BD%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{D}' title='{D}' class='latex' /> cards&#8212;this is the first derivative. She then takes the second deck and deals out another hand, and so on until all <img src='http://l.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> hands have been dealt.</p>
<p>
These hands define the <img src='http://l.wordpress.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{m}' title='{m}' class='latex' /> derivatives. Note, it is important that assets can be in more than one derivative, since Alice may wish to sell more than <img src='http://l.wordpress.com/latex.php?latex=%7Bn%2FD%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n/D}' title='{n/D}' class='latex' /> derivatives. Thus, it is critical that Alice has multiple copies of the deck.</p>
<p>
If Alice has no idea which asset with fail or succeed in the future, this is a fine game. However, if Alice knows that some of the assets are more likely to fail than others, then she can cheat. The ABBG paper calls these <i>lemons</i>. Note, since the back of the cards are marked with the name of the asset <img src='http://l.wordpress.com/latex.php?latex=%7Ba_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{a_{i}}' title='{a_{i}}' class='latex' /> it is like the cards are marked.</p>
<p>
Even if Alice does not have a perfect mapping from an asset to its color, Alice could possibly cheat. She can do that if she has some partial information about the color of the cards. She does this by the way she deals out the cards to make the hands. Her goal is to get more black cards into hands, so she will win those hands. Even some partial information here is enough to greatly increase the odds in her favor.</p>
<p>
<p><b> The ABBG Result </b></p>
<p><p>
In the ABBG paper the type of theorems they prove rely on two key points about the abilities of Alice and Bob.</p>
<p>
The first concerns Alice. She must have some partial information about the lemons, the color of the cards, if she is going to cheat Bob. The exact number of lemons and how much they affect the cost to Bob is a technical point. I suggest you read their paper to get the exact results.</p>
<p>
The second concerns Bob. If he is going to be unable to detect that Alice is cheating he must be <i>computationally limited</i>. In any cheating scam the person being cheated must be limited in their ability to detect the cheating, or it will be discovered. In the examples of ways that Alice can cheat at cards, Bob must not notice certain things. In one case, he must not notice that the deck is all black cards, in another that the deck has been marked, and in the last that the deck is in a pre-arranged order. His ability to detect Alice cheating in these ways gets harder: the first is easy&#8212;just look at the deck, while the last is quite hard&#8212;it is hard to notice that she is executing perfect shuffles. Notice that the more skill Alice uses to cheat, the harder it is for Bob to detect it. In similar ways, ABBG demonstrates clever skillful ways of cheating that makes it extremely hard for Bob to detect.</p>
<p>
 The ABBG results depend on Bob being limited in his computational ability. In particular, Bob must not be able to detect that Alice has dealt the hands in a non-random manner. The only way that Bob can detect this is their main theorem; in order for Bob to discover that Alice has cheated him in constructing the derivatives he must be able to discover that a graph has a dense subgraph. The intuition behind this result is that if Alice deals the hands, that is creates the derivatives, in a non-random manner the structure of the sets will define a graph that deviates enough from randomness to leave a proof of her cheating. However, the key insight is that detecting this is an intractable problem.</p>
<p>
In summary, the ABBG phenomena is: </p>
<p><em>
<ol>
<li> With enough lemons Alice can create non-random derivatives that will generate extra cost for Bob;
<li> With limited computational ability Bob will be unable to detect that Alice has acted in a non-random manner.
</ol>
<p></em></p>
<p>The exact statement of this result is technical, but I believe that the above captures the key insights that they have made. Here is an example, in their terminology of the type of results that they prove: </p>
<blockquote><p><b>Theorem: </b> <em> When <img src='http://l.wordpress.com/latex.php?latex=%7Bd-b+%3E+3%5Csqrt+D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{d-b &gt; 3\sqrt D}' title='{d-b &gt; 3\sqrt D}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%7Bn%2FN+%5Cll+d%2FD%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n/N \ll d/D}' title='{n/N \ll d/D}' class='latex' />, an <img src='http://l.wordpress.com/latex.php?latex=%7B%28m%2Cn%2Cd%29%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{(m,n,d)}' title='{(m,n,d)}' class='latex' /> subgraph will generate an extra lemon cost of at least <img src='http://l.wordpress.com/latex.php?latex=%7B%281-2p-o%281%29%29mV+%5Capprox+n%5Csqrt%7BN%2FM%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{(1-2p-o(1))mV \approx n\sqrt{N/M}}' title='{(1-2p-o(1))mV \approx n\sqrt{N/M}}' class='latex' />. </em></p></blockquote>
<p> What this is saying is: Alice can cheat and cause a certain amount of extra loss over fair play. For the meaning of the symbols see ABBG, but the symbols denote the main parameters of their model: the number of assets, the size of the derivative, the number of lemons, and so on.</p>
<p>
<p><b> Is ABBG Really A Problem? </b></p>
<p><p>
The last thing Wall Street seems to need is advice from theorists on how to cheat. They seem to have done quite well on their own in the past and in recent times. Just say <a href="http://en.wikipedia.org/wiki/Bernard_Madoff">Bernie Madoff</a>&#8212;if you type &#8220;bernie&#8221; into Google he immediately comes up&#8212;I feel sorry for anyone named Bernie.</p>
<p>
The ABBG paper has nevertheless caused a quite stir among finance bloggers. For example look at <a href="http://www.boingboing.net/2009/10/15/complex-derivatives.html">this</a>, <a href="http://www.dailykos.com/story/2009/10/18/14636/845">this</a>, and <a href="http://bonds.about.com/od/derivativesandexotics/a/CDO.htm">this</a>.</p>
<p>
The core of the cheating described in ABBG, in our terminology, is that Alice deals out more black cards than she would if the dealing was completely random. Thus, Bob can only detect that Alice is cheating if he is able to discover that she is dealing in this way. The clever insight of the ABBG paper is that this detection is equivalent to detecting a dense subgraph in a graph.</p>
<p>
The conventional wisdom is that this is a hard problem. You all know that I have some doubts about this, but even with those aside this is a reasonable issue. </p>
<ul>
<li> A problem can be hard in theory, but might be easy in practice. One needs to know the size of the graphs that are used in practice, since even worst case algorithms could do well.
<li> Another possibility is that there might be good approximation algorithms that work on the graphs ABBG uses.
<li> Finally, the problems they create are &#8220;planted&#8221; versions of the dense subgraph problem. I think that this version of the subgraph problem could be simpler.
</ul>
<p>
Overall I think that I would not want to have to solve these problems. But the simple claim that they are &#8220;intractable&#8221; is one that the authors, I am sure, would agree is a claim that needs to be carefully checked.</p>
<p>
<p><b> Solutions? </b></p>
<p><p>
One thing we know is that when there is a trust problem in a digital system often we can overcome the trust issue. Sometimes this requires quite simple ideas, sometimes this requires modern cryptography, and sometimes something in between. I think there could be a number of ways to make it so Alice cannot cheat when creating the derivatives.</p>
<p>
One idea that comes to mind is some protocol that forces her to really use proper random bits in constructing the derivatives. You could imagine that Bob is protected in this way. Another approach might be to let Bob select his own random derivative. This would not allow Alice to cheat, but would allow Bob to cheat. If he was the one with knowledge about the assets, then he could cheat Alice.</p>
<p>
The key insight seems to be that this is a crypto problem that we have seen before: having two or more parties agree on a random string. I think that this can be done. Perhaps the most interesting question is can some special properties of derivatives be used to make the protocol more efficient.</p>
<p>
<p><b> Open Problems </b></p>
<p><p>
Are there other types of ways to cheat in markets? Also one neat question might be how to develop a protocol that will allow both parties to be sure that all is well? I expect that we will see many more papers on this topic.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3754/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3754/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3754/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3754/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3754/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3754/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3754/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3754/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3754/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3754/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3754&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/10/22/helping-wall-street-cheat-with-theory/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/images7.jpeg" medium="image">
			<media:title type="html">images</media:title>
		</media:content>
	</item>
		<item>
		<title>Happy Fiftieth Birthday FOCS</title>
		<link>http://rjlipton.wordpress.com/2009/10/18/happy-fiftieth-birthday-focs/</link>
		<comments>http://rjlipton.wordpress.com/2009/10/18/happy-fiftieth-birthday-focs/#comments</comments>
		<pubDate>Sun, 18 Oct 2009 18:08:39 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[History]]></category>
		<category><![CDATA[People]]></category>
		<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[depth first]]></category>
		<category><![CDATA[FOCS]]></category>
		<category><![CDATA[FSA]]></category>
		<category><![CDATA[matching]]></category>
		<category><![CDATA[matrix]]></category>
		<category><![CDATA[Problems]]></category>
		<category><![CDATA[SWAT]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3731</guid>
		<description><![CDATA[
 A look back at some the early days of the FOCS conference 



Alvy Ray Smith is a famous researcher who has won many awards for his seminal work in computer graphics. He is a theorist also: he has had several papers published at FOCS, for example.

Today I thought, since FOCS&#8217;s  anniversary is soon, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3731&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font color="”#0066cc?"><br />
<em> A look back at some the early days of the FOCS conference </em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/10/images-2.jpeg?w=74&#038;h=74" alt="images-2" title="images-2" width="74" height="74" class="alignright size-full wp-image-3732" /></p>
<p>
Alvy Ray Smith is a famous researcher who has won many awards for his seminal work in computer graphics. He is a theorist also: he has had several papers published at FOCS, for example.</p>
<p>
Today I thought, since FOCS&#8217;s <img src='http://l.wordpress.com/latex.php?latex=%7B%7B50%7D%5E%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{{50}^{th}}' title='{{50}^{th}}' class='latex' /> anniversary is soon, that it would be fun to look back at the first conferences and see what researchers were doing. Below is a singing tribute to FOCS:<br />
<span id="more-3731"></span></p>
<p><span style="text-align:center; display: block;"><a href="http://rjlipton.wordpress.com/2009/10/18/happy-fiftieth-birthday-focs/"><img src="http://img.youtube.com/vi/dePMU8R131s/2.jpg" alt="" /></a></span></p>
<p>
The first observation is that they were not attending FOCS. The conference we call FOCS has been around for fifty years, but it has gone through two other names before becoming FOCS. The name was <a href="http://en.wikipedia.org/wiki/Symposium_on_Switching_and_Automata_Theory">changed twice</a>: In 1960 until 1965 it was the &#8220;Symposium on Switching Circuit Theory and Logical Design;&#8221; then, from 1966 to 1974 it was the &#8220;Symposium on Switching and Automata Theory;&#8221; and finally it switched to the &#8220;Foundations of Computer Science.&#8221; Hopefully, this is the last name, but who knows?</p>
<p>
When the name was: <i>Switching and Automata Theory</i>, it had the &#8220;bad&#8221; abbreviation as SWAT. I recall the business meeting where we discussed changing the name to something better. I believe&#8212;I could be wrong&#8212;that Emily Friedman made two key contributions. </p>
<p>First, she suggested <img src="http://rjlipton.files.wordpress.com/2009/10/images-31.jpeg?w=60" alt="images-3" title="images-3" width="60" class="alignright size-full wp-image-3736" />the name, FOCS, and also she helped design the &#8220;fox&#8221; logo. Emily was very active in the 1970&#8217;s and did some quite nice work on program schema and various aspects of language theory. Alvy Smith designed the famous <a href="http://www.alvyray.com/Art/Synapse.htm">cover</a> that has used for years on the proceedings. </p>
<p>
<img src="http://rjlipton.files.wordpress.com/2009/10/images6.jpeg?w=107&#038;h=127" alt="images" title="images" width="107" height="127" class="aligncenter size-full wp-image-3740" /></p>
<p>
Speaking of covers of proceedings, when I was the program chair for STOC I was very excited about the honor. I especially liked the idea that the program chair got to select the <i>color</i> of the proceedings, since I thought this would be fun. Finally the big day came, we had selected the papers, the accept and reject letters had been sent, and it was time to pick the color for the cover. The print person at ACM called me to ask about my choice of colors. I told him that I wanted the cover to be black with white letters, which I thought would be dramatic and unique. He told me that he would select a <i>color</i> and hung up the phone&#8212;really. I did not get to choose. Oh well.</p>
<p>
Let&#8217;s now turn to look at what the field was up to years ago.</p>
<p>
<p><b> Symposium on Switching Circuit Theory and Logical Design </b></p>
<p><p>
Here are the papers from the 1963 conference:</p>
<p><ul>
<li> <i>Infinite sequences and finite machines:</i> Muller, David E.</p>
<li> <i>Two-sided finite-state transductions:</i> by Elgot, C. C.; Mezei, J. E.
<li> <i>On computability by certain classes of restricted turing machines:</i> by Fischer, Patrick C.
<li> <i>Bilateral threshold nets:</i> by Frazer, W. D.
<li> <i>Threshold gate realizations of logical functions with don&#8217;t cares:</i> by Coates, C. L.; Lewis, P. M.
<li> <i>On the analysis of functional symmetry:</i> Arnold, R. F.; Lawler, E. L.
<li> <i>The minimal synthesis of tree structures:</i> by Lawler, E. L.
<li> <i>Determining the best ordering of variables in cascade switching circuits:</i> by Levien, Roger E.
<li> <i>Sequential circuit synthesis using input delays:</i> by Eichelberger, Edward B.
<li> <i>Finite automata and badly timed elements:</i> by McNaughton, Robert
<li> <i>Demonstrating Hazards in sequential relay circuits:</i> by Booth, Theodore M.
<li> <i>Logical design theory of NOR gate networks with no complemented inputs:</i> McCluskey, E. J.
<p><li> <i>A survey of asynchronous logic: Comparing various definitions and models for asynchronous switching circuits:</i> Miller, R. E.
</ul>
<p>
<p><b> Symposium on Switching and Automata Theory </b></p>
<p><p>
Here are the papers from the 1966 conference:</p>
<ul>
<li> <i>Context-free language processing in time <img src='http://l.wordpress.com/latex.php?latex=%7Bn%5E%7B3%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n^{3}}' title='{n^{3}}' class='latex' />:</i> by Daniel Younger.
<li> <i>Syntax directed transduction:</i> by Lewis, P. M.; Stearns, R. E.
<li> <i>Simple deterministic languages:</i> by Korenjak, A. J.; Hopcroft, J. E
<li> <i>One-way stack automata (Extended abstract):</i> by Ginsburg, Seymour; Greibach, Sheila A.; Harrison, Michael A.
<li> <i>Real-time computation by n-dimensional iterative arrays of finite-state machines:</i> by Cole, Stephen N.
<li> <i>The recognition problem for the set of perfect squares:</i> by Cobham, Alan
<li> <i>Roots of star events:</i> by Brzozowski, J. A.
<li> <i>Subdirect decompositions of transformation graphs:</i> by Yoeli, M.; Ablow, C. M
<li> <i>Pair algebra and its application:</i> by Liu, C. L.
<li> <i>Graphs of affine transformations, with applications to sequential circuits:</i> by Gill, Arthur
<li> <i>A method for the combined row-column reduction of flow tables:</i> by Grasselli, A.; Luccio, F.
<li> <i>Standard minimum transition time secondary assignments for asynchronous circuits:</i> by Epley, Donald L.
<li> <i>A row assignment for delay-free realizations of flow tables without essential hazards:</i> by Unger, Stephen H.
<li> <i>Synthesis of multiple sequential machines:</i> by Smith, Edward J.; Kohavi, Zvi
<li> <i>Realization of sequential machined with threshold elements:</i> by Hadlock, F. O.; Coates, C. L
<li> <i>Fast Simulation of Nondeterministic Turing Machines:</i> by Reading, R. U.
<li> <i>The application of threshold logic to the design of sequential machines:</i> by Masters, Gilbert M.; Mattson, Richard L.
<li> <i>Minimization and convexity in threshold logic:</i> by Dertouzos, Michael L.; Fluhr, Zachary C.
<li> <i>On minimal modulo 2 sums of products for switching functions:</i> by Even, S.; Kohavi, I.; Paz, A
<li> <i>Conjunctive encoding of Boolean matrices:</i> by Stover, D. R.; Epley, D. L.
<li> <i>Asynchronous propagation-limited logic:</i> by Goldberg, J.; Stone, H. S.
<li> <i>The synthesis of multipurpose logic devices:</i> by King, W. Frank
<li> <i>The universal logic block (ULB) and its application to logic design:</i> by Forslund, D. C.
<li> <i>Statistical properties of random digital sequences:</i> by Booth, Taylor L.
<li> <i>Reconsider the state minimization problem for stochastic finite state systems:</i> by Ott, Gene H.
<li> <i>An application of coding algebra to the design of a digital multiplexing system using linear sequential circuits:</i> by Helm, H. A.
<li> <i>Automorphism groups and quotients of strongly connected automata and monadic algebras:</i> by Bayer, R.
<li> <i>On the automorphism group of a reduced automaton:</i> by Paul, Manfred
</ul>
<p>
<p><b> Some Observations </b></p>
<p><p>
One issue that earlier researchers worried about was the copyright issue. You may see that papers are called &#8220;extended abstracts&#8221; or &#8220;preliminary version.&#8221; This was so that journals would allow the authors to publish a final version again. There were relatively few papers, and there were no parallel sessions. </p>
<p>
The topics of course were of course different from what we think about today. Yet not too far from things that we are interested in today. It is very clear that exact results were the goal. I think almost no papers were about approximations&#8212;in any sense of the word. The field had not discovered that one way to make progress on hard problems was to lower the bar. Since getting the answer was often too hard, try to get an approximation. This lesson would not arrive for a few years.</p>
<p>
The papers also just looked different, since they were typed not typeset. Take a look at some of them. You will see handwritten symbols:<br />
<img src="http://rjlipton.files.wordpress.com/2009/10/picture-1.png?w=344&#038;h=98" alt="Picture 1" title="Picture 1" width="344" height="98" class="aligncenter size-full wp-image-3749" /></p>
<p>
<p><b> SWAT 1971 </b></p>
<p><p>
Here are a few selected papers from the 1971 conference. You will notice that now the topics are very close to current ones. For example, in FOCS 2009 there is a <a href="http://www.cs.cmu.edu/~ryanw/focs09.pdf">paper</a> by Nikhil Bansal and Ryan Williams that is on boolean matrix product, which is related to the Fischer and Meyer paper below. </p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>Matching:</i> This is the famous paper of John Hopcroft and Richard Karp called: <a href="http://rjlipton.files.wordpress.com/2009/10/match1971.pdf">A <img src='http://l.wordpress.com/latex.php?latex=%7Bn%5E%7B5%2F2%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n^{5/2}}' title='{n^{5/2}}' class='latex' /> Algorithm For Maximum Matching In Bipartite Graphs</a>. The special symbols in this and other papers were written in by hand: in the abstract the <img src='http://l.wordpress.com/latex.php?latex=%7B+%5Csqrt+%7B%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{ \sqrt {}}' title='{ \sqrt {}}' class='latex' /> symbol was left out and the running time was claimed to be order <img src='http://l.wordpress.com/latex.php?latex=%7B%28m%2Bn%29+%5Cquad+n+%5Cquad+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{(m+n) \quad n \quad }' title='{(m+n) \quad n \quad }' class='latex' /> instead of <img src='http://l.wordpress.com/latex.php?latex=%7B%28m%2Bn%29%5Csqrt+n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{(m+n)\sqrt n}' title='{(m+n)\sqrt n}' class='latex' />: note the extra space around the last <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' />. The things we had to do before <img src='http://l.wordpress.com/latex.php?latex=%5CTeX&#038;bg=ffffff&#038;fg=333333&#038;s=0' alt='\TeX' title='\TeX' class='latex' />.</p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>DFS:</i> This is the famous paper by Robert Tarjan, no Endre as he added later on, called: <a href="//rjlipton.files.wordpress.com/2009/10/dfs1971.pdf">Depth-First Search And Linear Graph Algorithms</a>. Tarjan used a special font on his typed papers that we could all tell was from Stanford. Note also he calls it a &#8220;working paper.&#8221;</p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>Boolean Matrix Product:</i> This is a neat paper by Mike Fischer and Albert Meyer called: <a href="http://rjlipton.files.wordpress.com/2009/10/matrix1971.pdf">Boolean Matrix Multiplication and Transitive Closure</a>. In this paper they showed that fast matrix multiplication can be used to compute boolean matrix products: also they count bit operations not arithmetic operations. </p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>Description Size:</i> This is another neat paper by Mike and Albert, although this time it&#8217;s a &#8220;Meyer and Fischer&#8221; paper called: <a href="http://rjlipton.files.wordpress.com/2009/10/sizefsa1971.pdf">Economy of Description By Automata, Grammars, And Formal Systems</a>. The main results show that more powerful devices can tremendously affect the size of the description of a set. For example, there is an exponential gap between deterministic finite state automata and non-deterministic automata; even more impressive there is a <i>double</i> exponential gap between deterministic finite automata and pushdown automata. I believe I was asked about this earlier and this is the place to look for such results.</p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>Real Computation:</i> This is a paper that perhaps should be better known. It is by Fred Abramson and is called: <a href="http://rjlipton.files.wordpress.com/2009/10/reals1971.pdf">Effective Computation Over The Real Numbers</a>. Fred extends Turing Machines to have values that are real numbers. This paper was probably a bit ahead of its time; I will discuss the <i>real</i> P=NP question of Lenore Blum, Michael Shub and Stephen Smale in the future. </p>
<p>
<p><b> Open Problems </b></p>
<p><p>
I hope the FOCS meeting continues to be an important meeting for another fifty years and then fifty more. I do wonder what topics will be important for FOCS 2019 or FOCS 2029? Here are possible papers that could appear in the FOCS 2019:</p>
<p><ol>
<li> <i>Cryptography Without Assumptions:</i> by Alice Brown, Bob Green, John Lime, Hila Mustard, Lincoln Violet, Chuck Yellow, <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cdots%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\dots}' title='{\dots}' class='latex' />
<li> <i>An <img src='http://l.wordpress.com/latex.php?latex=%7B+%5Csqrt+n+%28%5Clog+n+%2F%5Clog%5Clog%5Clog+n%29%5E%7B3.1+%2B+%5Cepsilon%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{ \sqrt n (\log n /\log\log\log n)^{3.1 + \epsilon}}' title='{ \sqrt n (\log n /\log\log\log n)^{3.1 + \epsilon}}' class='latex' /> Space <img src='http://l.wordpress.com/latex.php?latex=%7B2+%2B+%5Cepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2 + \epsilon}' title='{2 + \epsilon}' class='latex' /> Approximation Algorithm in the River-Streaming Model for the iBrain Optimization Problem:</i> by Carol Smith and Fred Thomas;
<li> <i>The Extended Generalized Riemann Hypothesis Implies The Scott Quantum Hierarchy Collapses:</i> by Ryan Browodski and Sue Lin-Jun.
</ol>
<p> What do you think? What are your predictions?</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3731/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3731/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3731/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3731/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3731/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3731/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3731/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3731/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3731/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3731/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3731&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/10/18/happy-fiftieth-birthday-focs/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/images-2.jpeg" medium="image">
			<media:title type="html">images-2</media:title>
		</media:content>

		<media:content url="http://img.youtube.com/vi/dePMU8R131s/2.jpg" medium="image" />

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/images-31.jpeg" medium="image">
			<media:title type="html">images-3</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/images6.jpeg" medium="image">
			<media:title type="html">images</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/picture-1.png" medium="image">
			<media:title type="html">Picture 1</media:title>
		</media:content>
	</item>
		<item>
		<title>Multi-Party Protocols and FOCS 2009</title>
		<link>http://rjlipton.wordpress.com/2009/10/15/multi-party-protocols-and-focs-2009/</link>
		<comments>http://rjlipton.wordpress.com/2009/10/15/multi-party-protocols-and-focs-2009/#comments</comments>
		<pubDate>Thu, 15 Oct 2009 22:29:07 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[History]]></category>
		<category><![CDATA[People]]></category>
		<category><![CDATA[Proofs]]></category>
		<category><![CDATA[amplification]]></category>
		<category><![CDATA[concentration mean]]></category>
		<category><![CDATA[FOCS]]></category>
		<category><![CDATA[multi-party]]></category>
		<category><![CDATA[protocols]]></category>
		<category><![CDATA[random]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3692</guid>
		<description><![CDATA[
 Multi-party protocols, bounded width programs, concentration theorems, and FOCS 2009 



Ashok Chandra is a famous theorist who has done seminal work in the many different areas. His early work was in program schema, an area that was central to theory in the 1970&#8217;s. Chandra then moved into other areas of theory and among other [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3692&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font color="”#0066cc?"><br />
<em> Multi-party protocols, bounded width programs, concentration theorems, and FOCS 2009 </em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/10/images5.jpeg?w=82&#038;h=103" alt="images" title="images" width="82" height="103" class="alignright size-full wp-image-3695" /></p>
<p>
Ashok Chandra is a famous theorist who has done seminal work in the many different areas. His early work was in <a href="http://portal.acm.org/citation.cfm?id=803879">program schema</a>, an area that was central to theory in the 1970&#8217;s. Chandra then moved into other areas of theory and among other great things co-invented the notion of Turing Machine Alternation. </p>
<p>
Later in his career, while still an active researcher, he began to manage research groups: first a small one at IBM Yorktown, then a much larger one at IBM Almaden, and later groups at other major companies including Microsoft. As a manager, one example of his leadership was the invention by his group at Almaden of that little trackpoint device that plays the role of a mouse. The trackpointer eventually appeared on the keyboards of IBM&#8217;s Thinkpad laptops&#8212;now called Lenovo laptops&#8212;a wonderful example of successful technology transfer.</p>
<p>
Today I want to talk about mutli-party protocols, an area that Chandra helped create in the 1980&#8217;s. Then, I will connect this work with a new paper that is about to appear in FOCS in days. This is the <img src='http://l.wordpress.com/latex.php?latex=%7B50%5E%7Bth%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{50^{th}}' title='{50^{th}}' class='latex' /> FOCS and I am looking forward to attending it, even though I will not get any Delta miles for this one&#8212;it is in Atlanta.<br />
<span id="more-3692"></span></p>
<p>
I still remember vividly meeting Ashok for the first time: it was at the Seattle airport in 1974 at the taxi stand. We were both on our way to the sixth STOC conference&#8212;it was my first STOC. I knew of Ashok, since I had read his beautiful papers on program schema theory. Somehow we figured out that we were both going to the same hotel and we shared a cab. Our conversation in the cab made me feel I was welcome as a new member of the theory community: I have never forgotten his kindness.</p>
<p>
At that STOC Chandra presented some of his recent work on program schema, &#8220;Degrees of Translatability and Canonical Forms in Program Schemas: Part I.&#8221; Also presented at the <a href="http://www.informatik.uni-trier.de/~ley/db/conf/stoc/stoc74.html">conference</a> were these results, among others: </p>
<ul>
<li> Leslie Valiant: <i>The Decidability of Equivalence for Deterministic Finite-Turn Pushdown Automata.</i>
<li> John Gill: <i>Computational Complexity of Probabilistic Turing Machines.</i>
<li> Vaughan Pratt, Michael Rabin, Larry Stockmeyer: <i>A Characterization of the Power of Vector Machines.</i>
<li> Stephen Cook, Robert Reckhow: <i>On the Lengths of Proofs in the Propositional Calculus.</i>
<li> Robert Endre Tarjan: <i>Testing Graph Connectivity.</i>
<li> Allan Borodin, Stephen Cook: <i>On the Number of Additions to Compute Specific Polynomials.</i>
<li> David Dobkin, Richard Lipton: <i>On Some Generalizations of Binary Search.</i>
</ul>
<p> The topics then were very different from today: then, there was no quantum theory, no game theory, no learning theory, and many complexity classes had yet to be invented. Yet many of the topics are still being studied today&#8212;perhaps a discussion for another day.</p>
<p>
Let&#8217;s now turn to multi-party protocols and eventually to concentration theorems. </p>
<p>
<p><b> Multi-Party Protocols </b></p>
<p><p>
Chandra, Merrick Furst, and I discovered the basic idea of multi-party protocols because we <a href="http://rjlipton.wordpress.com/2009/09/27/surprises-in-mathematics-and-theory/">guessed wrong</a>. We were working on proving lower bounds on bounded width branching programs, when we saw that a lower bound on multi-party protocols would imply a non-linear lower bound on the length of such programs.</p>
<p>
Let me back up, one summer Merrick Furst and I were invited to spend some time at IBM Yorktown by Ashok. The three of us started to try to prove a lower bound on the size of bounded width programs. </p>
<p>
A <i>bounded width program</i> operates as follows. Suppose that the input is <img src='http://l.wordpress.com/latex.php?latex=%7Bx_%7B1%7D%2C%5Cdots%2Cx_%7Bn%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x_{1},\dots,x_{n}}' title='{x_{1},\dots,x_{n}}' class='latex' />. I like to think of a bounded width program as a series of &#8220;boxes.&#8221; Each one takes a state from the left, reads an input <img src='http://l.wordpress.com/latex.php?latex=%7Bx_%7Bi%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x_{i}}' title='{x_{i}}' class='latex' /> which is the same for the box each time, and passes a state to the right. The first box is special and gets a start state from the left; the last box is special too and passes the final state to the right. The total number of states is fixed independent of the length of the computation. Note, also while the boxes read the same input each time, an input can be read by different boxes. The length, the number of boxes, is the size of the bounded width program.</p>
<p>
The reason this model is called bounded width is that the number of states is fixed independent of the number of inputs. The power of the model comes from the ability to read inputs more than once. Note, with that ability the model could not compute even some very simple functions. </p>
<p>
We guessed that there should be simple problems that required length that is super-linear in the number of inputs, <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' />. This guess was right. What we got completely wrong, like most others, was that we thought the length might even be super-polynomial for some simple functions. David Barrington&#8217;s famous <a href="http://www.cs.umass.edu/~barring/publications/bwbp.pdf">theorem</a> proved that this was false,</p>
<blockquote><p><b>Theorem: </b> <em> For any boolean function in <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cmathsf%7BNC%7D%5E%7B1%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\mathsf{NC}^{1}}' title='{\mathsf{NC}^{1}}' class='latex' />, the length of a bounded width program for that function is polynomial. </em></p></blockquote>
<p><p>
The connection between bounded width programs and protocols is simple. Suppose that there are <img src='http://l.wordpress.com/latex.php?latex=%7B2n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2n}' title='{2n}' class='latex' /> boxes, which we wish to show is too few to compute some function. Divide the boxes into three groups: <img src='http://l.wordpress.com/latex.php?latex=%7BL%2CM%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{L,M}' title='{L,M}' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%7BR%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{R}' title='{R}' class='latex' />. The boxes in <img src='http://l.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' /> are the first <img src='http://l.wordpress.com/latex.php?latex=%7B2n%2F3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2n/3}' title='{2n/3}' class='latex' /> boxes, the boxes in <img src='http://l.wordpress.com/latex.php?latex=%7BM%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{M}' title='{M}' class='latex' /> are the next <img src='http://l.wordpress.com/latex.php?latex=%7B2n%2F3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2n/3}' title='{2n/3}' class='latex' /> boxes, and <img src='http://l.wordpress.com/latex.php?latex=%7BR%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{R}' title='{R}' class='latex' /> are the last <img src='http://l.wordpress.com/latex.php?latex=%7B2n%2F3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2n/3}' title='{2n/3}' class='latex' />&#8212;<em>I fixed a typo here thanks to Andy D</em>. Think of three players <img src='http://l.wordpress.com/latex.php?latex=%7BL%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{L}' title='{L}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%7BM%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{M}' title='{M}' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%7BR%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{R}' title='{R}' class='latex' />. They each do not have enough bits to determine the function if it depends on all the bits. Yet any two players together have enough bits, possibly, to determine the function: this follows because,
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++2n%2F3+%2B+2n%2F3+%3E+n.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  2n/3 + 2n/3 &gt; n.' title='\displaystyle  2n/3 + 2n/3 &gt; n.' class='latex' /></p>
<p> Thus, we needed to somehow argue that they needed to move more than a fixed number of bits from one to another to compute the given function. This was the hard part. </p>
<p>
IBM Yorktown is in the middle of nowhere. Well it is in the middle of a beautiful area, but then there were no lunch places nearby. So every day we ate in the IBM cafeteria, which was a fine place to eat. However, one day Merrick and I decided to take off and have lunch somewhere else. We had a long lunch &#8220;hour.&#8221; A really long hour. When we got back to IBM Ashok was a bit upset: where have we been? Luckily a theory talk was just about to start and we all went to hear the talk. </p>
<p>
Talks are a great place to think. There is no interruption, which was especially true back then since there were no wireless devices. I used the time to try to put together all the pieces that we had discovered. Suddenly I saw a new idea that might be the step we were missing. </p>
<p>
As soon as the talk was over I outlined the idea that I had to Ashok and Merrick. They were excited, and to my relief all was forgotten about the long lunch. Working that afternoon we soon had a complete outline of the proof. There were, as usual, lots of details to work out, but the proof would eventually all work. I still think we made progress on this result because of the long lunch and the quiet time of the talk. The <a href="http://portal.acm.org/citation.cfm?id=808737&amp;dl=GUIDE&amp;coll=GUIDE&amp;CFID=56446803&amp;CFTOKEN=27116187">result</a> appeared in STOC 1983.</p>
<p>
We were excited that we had a lower bound, but we missed two boats. Missing one is pretty bad, but two was tough. The first was our protocol lower bound : this was later vastly improved by much better technology, and the second was that we missed thinking about upper bounds. I do not know if we would have found Barrington&#8217;s beautiful result; however, we never even thought about upper bounds. Oh well.</p>
<p>
<p><b> Concentration Results </b></p>
<p><p>
Let&#8217;s jump forward to the present. There is a terrific book, published this year, by Devdatt Dubhashi and Alessandro Panconesi called <a href="http://www.dsi.uniroma1.it/~ale/Papers/master.pdf">Concentration of Measure for the Analysis of Randomized Algorithms</a>. I just got a copy and it is beautifully written, contains all the major concentration results, and is a must to have on your desk. </p>
<p>
Of course research stops for no one, and there are two beautiful papers that contain new concentration theorems that will appear in the next FOCS. I am sure they will be included in the second edition of Dubhashi and Panconesi. </p>
<p>
The first paper is: <a href="http://arxiv.org/abs/0809.2477">A Probability inequality using typical moments and Concentration Results</a> by Ravi Kannan. The second paper is: <a href="http://www.eccc.uni-trier.de/report/2009/078/">A Probabilistic Inequality with Applications to Threshold Direct-product Theorems</a> by Falk Unger. </p>
<p>
The theme of both these papers, and the book, is to prove theorems that show that a sum of random variables is likely to be near its mean. More precisely, if
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++X_%7B1%7D%2C+%5Cdots%2C+X_%7Bn%7D+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  X_{1}, \dots, X_{n} ' title='\displaystyle  X_{1}, \dots, X_{n} ' class='latex' /></p>
<p> are random variables, a <i>concentration theorem</i> tries to get sharp bounds on
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cmathsf%7BProb%7D+%5Cleft+%5B+%5Cmid+%5Csum_%7Bi%3D1%7D%5E%7Bn%7D+%28X_%7Bi%7D+-+E%28X_%7Bi%7D%29%29+%5Cmid+%5Cge+t+%5Cright+%5D.+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \mathsf{Prob} \left [ \mid \sum_{i=1}^{n} (X_{i} - E(X_{i})) \mid \ge t \right ]. ' title='\displaystyle  \mathsf{Prob} \left [ \mid \sum_{i=1}^{n} (X_{i} - E(X_{i})) \mid \ge t \right ]. ' class='latex' /></p>
<p>
In general a sum of random variables will not have concentration unless there is some additional property. To see this just consider the case where they all are <img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> if a fair coin is heads and <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> otherwise. Clearly, this sum is <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' /> or <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> and does not satisfy any meaningful concentration theorem. </p>
<p>
One approach to get a concentration theorem is to assume that the random variables are independent, but this is often too restrictive for many applications. Look at their book for the myriad number of ways to get concentration theorems. </p>
<p>
Another approach is to assume that the random variables satisfy certain constraints on the conditional expectations. This is what Ravi is able to improve, since the usual theorems assume &#8220;worst-case&#8221; bounds on these conditional expectations. Ravi can weaken these to &#8220;average&#8221; bounds. I think this is a brilliant insight&#8212;one that could have far reaching consequences. For example, Ravi can prove: (It is a corollary in his paper.) </p>
<blockquote><p><b>Theorem: </b> <em> Suppose that <img src='http://l.wordpress.com/latex.php?latex=%7BX_%7B1%7D%2CX_%7B2%7D%2C%5Cdots%2CX_%7Bn%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{X_{1},X_{2},\dots,X_{n}}' title='{X_{1},X_{2},\dots,X_{n}}' class='latex' /> are real-valued random variables satisfying the SNC condition. Let <img src='http://l.wordpress.com/latex.php?latex=%7Bt%3E0%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{t&gt;0}' title='{t&gt;0}' class='latex' />. Suppose that <img src='http://l.wordpress.com/latex.php?latex=%7B%5Csigma%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\sigma}' title='{\sigma}' class='latex' /> is a positive real with
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++E%5BX_%7Bi%7D%5E%7B2l%7D+%5Cmid+X_%7B1%7D+%2B+%5Ccdots+X_%7Bi-1%7D%5D+%5Cle+%5Csigma%5E%7B2l%7D%28n%2Fp%29%5E%7Bl-1%7Dl%5E%7B.9l%7D+&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  E[X_{i}^{2l} \mid X_{1} + \cdots X_{i-1}] \le \sigma^{2l}(n/p)^{l-1}l^{.9l} ' title='\displaystyle  E[X_{i}^{2l} \mid X_{1} + \cdots X_{i-1}] \le \sigma^{2l}(n/p)^{l-1}l^{.9l} ' class='latex' /></p>
<p>
for all <img src='http://l.wordpress.com/latex.php?latex=%7Bl%3D1%2C2%2C%5Cdots%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{l=1,2,\dots}' title='{l=1,2,\dots}' class='latex' />. There is an absolute positive constant <img src='http://l.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' /> such that
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++Pr+%5Cleft+%28+%5Cmid+%5Csum_%7Bi%3D1%7D%5E%7Bn%7D+X_i+%5Cmid+%5Cge+t+%5Cright+%29+%5Cle+4e%5E%7B-w%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  Pr \left ( \mid \sum_{i=1}^{n} X_i \mid \ge t \right ) \le 4e^{-w}' title='\displaystyle  Pr \left ( \mid \sum_{i=1}^{n} X_i \mid \ge t \right ) \le 4e^{-w}' class='latex' /></p>
<p> where <img src='http://l.wordpress.com/latex.php?latex=%7Bw+%3D+%5Cfrac%7Bt%5E%7B2%7D%7D%7Bcn%5Csigma%5E%7B2%7D%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{w = \frac{t^{2}}{cn\sigma^{2}}}' title='{w = \frac{t^{2}}{cn\sigma^{2}}}' class='latex' />. </em></p></blockquote>
<p> See his full paper for what SNC is and the role of <img src='http://l.wordpress.com/latex.php?latex=%7Bp%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{p}' title='{p}' class='latex' /> in that condition.</p>
<p>
<p><b> Multi-Party Protocols Again </b></p>
<p><p>
Unger&#8217;s concentration paper is quite interesting also. I will not be able to do it justice so again please take a look at the full paper. His result has to do with results that are quite close to multi-party protocols. In particular, he has an explicit application to this area. </p>
<p>
His theorem on protocols is quite technical, but the flavor is an amplification type result. Suppose that <img src='http://l.wordpress.com/latex.php?latex=%7Bq%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{q}' title='{q}' class='latex' /> players can compute some boolean function <img src='http://l.wordpress.com/latex.php?latex=%7Bf%28x%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{f(x)}' title='{f(x)}' class='latex' /> with probability <img src='http://l.wordpress.com/latex.php?latex=%7Bp%3D1%2F2%2B%5Cepsilon%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{p=1/2+\epsilon}' title='{p=1/2+\epsilon}' class='latex' />. Then, he can bound how well they can compute multiple independent copies
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++f%28x%5E%7B%281%29%7D%29%2Cf%28x%5E%7B%282%29%7D%29%2C%5Cdots%2Cf%28x%5E%7B%28m%29%7D%29&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  f(x^{(1)}),f(x^{(2)}),\dots,f(x^{(m)})' title='\displaystyle  f(x^{(1)}),f(x^{(2)}),\dots,f(x^{(m)})' class='latex' /></p>
<p> where the inputs are all independent uniform random bits. Then, he can get exponential bounds on how well a protocol can do even if it only has to get a majority of the functions right.</p>
<blockquote><p><b>Theorem: </b> <em> Let <img src='http://l.wordpress.com/latex.php?latex=%7Bf%3A%5C%7B0%2C1%5C%7D%5E%7Bn+%5Ctimes+q%7D+%5Crightarrow+%5C%7B-1%2C%2B1%5C%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{f:\{0,1\}^{n \times q} \rightarrow \{-1,+1\}}' title='{f:\{0,1\}^{n \times q} \rightarrow \{-1,+1\}}' class='latex' /> be a function such that no <img src='http://l.wordpress.com/latex.php?latex=%7Bq%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{q}' title='{q}' class='latex' />-party protocol which also uses <img src='http://l.wordpress.com/latex.php?latex=%7Bq%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{q}' title='{q}' class='latex' /> bits of communication can compute <img src='http://l.wordpress.com/latex.php?latex=%7Bf%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{f}' title='{f}' class='latex' /> better than with probability <img src='http://l.wordpress.com/latex.php?latex=%7B1%2F2+%2B+%5Cepsilon%2F2%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{1/2 + \epsilon/2}' title='{1/2 + \epsilon/2}' class='latex' />, for <img src='http://l.wordpress.com/latex.php?latex=%7B0+%5Cle+%5Cepsilon+%5Cle+1%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{0 \le \epsilon \le 1}' title='{0 \le \epsilon \le 1}' class='latex' />, when the inputs are chosen uniformly at random. Chose <img src='http://l.wordpress.com/latex.php?latex=%7B%5Clambda%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\lambda}' title='{\lambda}' class='latex' /> such that <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cepsilon+%5Cle+%5Clambda%5E%7B2%5E%7Bq%7D%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\epsilon \le \lambda^{2^{q}}}' title='{\epsilon \le \lambda^{2^{q}}}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7B%5Clambda+%5Cle+1%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\lambda \le 1}' title='{\lambda \le 1}' class='latex' />. Then for any <img src='http://l.wordpress.com/latex.php?latex=%7Bq%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{q}' title='{q}' class='latex' />-party protocol <img src='http://l.wordpress.com/latex.php?latex=%7B%5Ccal+P%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\cal P}' title='{\cal P}' class='latex' /> with uniformly random inputs <img src='http://l.wordpress.com/latex.php?latex=%7Bx%5E%7B1%2C1%7D%2C%5Cdots%2Cx%5E%7Bq%2Ck%7D+%5Cin+%5C%7B0%2C1%5C%7D%5E%7Bn%7D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{x^{1,1},\dots,x^{q,k} \in \{0,1\}^{n}}' title='{x^{1,1},\dots,x^{q,k} \in \{0,1\}^{n}}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> output bits, which uses <img src='http://l.wordpress.com/latex.php?latex=%7Bc%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{c}' title='{c}' class='latex' /> bits of communication, it holds: The probability that the output of <img src='http://l.wordpress.com/latex.php?latex=%7B%5Ccal+P%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\cal P}' title='{\cal P}' class='latex' /> agrees with <img src='http://l.wordpress.com/latex.php?latex=%7Bf%28x%5E%7B1%2C1%7D%2C%5Cdots%2Cx%5E%7Bq%2C1%7D%29%5Cdots+f%28x%5E%7B1%2Ck%7D%2C%5Cdots%2Cx%5E%7Bq%2Ck%7D%29%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{f(x^{1,1},\dots,x^{q,1})\dots f(x^{1,k},\dots,x^{q,k})}' title='{f(x^{1,1},\dots,x^{q,1})\dots f(x^{1,k},\dots,x^{q,k})}' class='latex' /> on at least <img src='http://l.wordpress.com/latex.php?latex=%7B%281%2F2+%2B+%5Clambda%2F2+%29k%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{(1/2 + \lambda/2 )k}' title='{(1/2 + \lambda/2 )k}' class='latex' /> positions is at most
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++2%5E%7Bc%7De%5E%7B-w%7D.&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  2^{c}e^{-w}.' title='\displaystyle  2^{c}e^{-w}.' class='latex' /></p>
<p> </em></p></blockquote>
<p> Here the term <img src='http://l.wordpress.com/latex.php?latex=%7Bw%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{w}' title='{w}' class='latex' /> is order <img src='http://l.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> times certain other terms. The point is that their protocol cannot easily compute the function multiple times without using many bits of communication. </p>
<p>
<p><b> Open Problems </b></p>
<p><p>
One open question is to find additional applications of these new concentration theorems. Can the average idea of Kannan work in other situations? Are there further concentration theorems yet to be discovered?</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3692/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3692/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3692/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3692/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3692/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3692/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3692/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3692/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3692/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3692/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3692&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/10/15/multi-party-protocols-and-focs-2009/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/images5.jpeg" medium="image">
			<media:title type="html">images</media:title>
		</media:content>
	</item>
		<item>
		<title>Magical Results and P=NP</title>
		<link>http://rjlipton.wordpress.com/2009/10/11/magical-results-and-pnp/</link>
		<comments>http://rjlipton.wordpress.com/2009/10/11/magical-results-and-pnp/#comments</comments>
		<pubDate>Sun, 11 Oct 2009 20:03:47 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[P=NP]]></category>
		<category><![CDATA[People]]></category>
		<category><![CDATA[Proofs]]></category>
		<category><![CDATA[cryptography]]></category>
		<category><![CDATA[Kruskal Count]]></category>
		<category><![CDATA[zero knowledge]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3678</guid>
		<description><![CDATA[
 Which is more likely: intractable problems or magical results?



Martin Kruskal was a famous mathematician who won many awards for his work in diverse areas of applied mathematics. He is perhaps best known, among his peers, for his important joint work with Norman Zabusky; work which helped start the soliton revolution. He is also known [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3678&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font color="”#0066cc?"><br />
<em> Which is more likely: intractable problems or magical results?</em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/10/kruskal.jpg?w=100" alt="kruskal" title="kruskal" width="100" class="alignright size-full wp-image-3680" /></p>
<p>
Martin Kruskal was a famous mathematician who won many awards for his work in diverse areas of applied mathematics. He is perhaps best known, among his peers, for his important joint work with Norman Zabusky; work which helped start the <a href="http://www.fas.org/sgp/othergov/doe/lanl/pubs/00285946.pdf">soliton revolution</a>. He is also known among magicians, for a brilliant illusion that is called the &#8220;Kruskal Count.&#8221; See this <a href="http://www.math.rutgers.edu/docs/kruskal/">memorial page</a> about Kruskal for a list of his accomplishments.</p>
<p>
Today I want to talk about some other magical aspects of mathematics. One of the great things about mathematics is that often there are results that seem to be &#8220;impossible.&#8221; Magical.<br />
<span id="more-3678"></span></p>
<p>
I was at Princeton&#8217;s Computer Science department at the same time that Kruskal was in their Mathematics department. I still remember making a &#8220;small&#8221; contribution to Princeton&#8217;s attempt to lure Michael Rabin away from Harvard. As is often the case when recruiting a senior super-star like Rabin, they had a group dinner at a quite good local Chinese restaurant. Since I knew Michael well I was invited to join them. </p>
<p>
We had our own private room at the restaurant. It was a lovely meal: delicious food and great conversation. When the dinner ended, after we all had read our fortune cookies, Michael and others stood up and began to say their goodnights to each other.</p>
<p>
I was still sitting next to Martin, who was in charge of our dinner. He proceeded to pull out a hand-calculator, and started to figure out what we each owed as our share of the bill. I was a bit surprised, since in computer science the tradition was to have the department pay for the whole dinner&#8212;after all it was an official recruiting function. But each department has its own rules. </p>
<p>
Martin started to calculate, after tip, what each of the group of twelve owed for their dinner. I started to get my wallet out so I could pay my share, when I realized that Martin had included Rabin as one who would be paying. I gently suggested to Martin that perhaps we could treat our &#8220;guest&#8221; and divide not by 12 but by 11. Martin liked this idea very much, and re-calculated the new figure that we all owed.</p>
<p>
Rabin never did accept an offer to come to Princeton, but I like to think that the switch to dividing by 11 instead of 12 could not have hurt. </p>
<p>
<p><b> Kruskal&#8217;s Count </b></p>
<p><p>
Here is the trick, or the illusion as my magician friends prefer to say.</p>
<p>
The <a href="http://threesixty360.wordpress.com/2007/12/04/mind-reading-and-card-tricks-the-kruskal-count/">trick</a> uses an ordinary deck of cards. Each card has a value: an ace is <img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />, all picture cards are <img src='http://l.wordpress.com/latex.php?latex=%7B5%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{5}' title='{5}' class='latex' />, and the rest have their face value. Thus, a king of diamonds is <img src='http://l.wordpress.com/latex.php?latex=%7B5%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{5}' title='{5}' class='latex' /> and a three of spades is <img src='http://l.wordpress.com/latex.php?latex=%7B3%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{3}' title='{3}' class='latex' />. The deck is shuffled, really shuffled. No trick there.</p>
<p>
You are the magician and let Paul be the person you are trying to impress with your mind reading abilities. You tell Paul to pick a secret number <img src='http://l.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> that is between <img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7B10%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{10}' title='{10}' class='latex' /> and to not tell you the number. Then, as you deal the cards, one by one, you ask Paul to do a simple count: For example, if his secret number was <img src='http://l.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' />, then as you show the cards Paul should count to <img src='http://l.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> and see what the value of the second card is. That is his new secret number. He then does the same thing as the next cards are revealed. At all times he is counting and replacing his secret with a new secret. Paul does all of this in his head, keeping his secrets to himself.</p>
<p>
When the deck runs out, you think hard. Then, you announce the secret that is in Paul&#8217;s head. Amazing.</p>
<p>
<p><b> How It Works </b></p>
<p><p>
The critical part of the illusion is that you cannot guess what Paul&#8217;s original number <img src='http://l.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> was. That would be real mind-reading. If you can do that, then you should head directly to Las Vegas and play high-stakes poker. </p>
<p>
The key is that you, the magician, pick a number <img src='http://l.wordpress.com/latex.php?latex=%7By%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{y}' title='{y}' class='latex' />, and do the same process as Paul does. The brilliant insight of Kruskal is that there is a good chance that at some point you and Paul will select the exact same card. Once this happens, then you and Paul are in synch and will continue to remain in synch. So at the end you just announce what your secret is and it will be the same as Paul&#8217;s with about <img src='http://l.wordpress.com/latex.php?latex=%7B85%5C%25%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{85\%}' title='{85\%}' class='latex' /> probability. Very neat.</p>
<p>
<p><b> Magical Results </b></p>
<p><p>
There are many other magical results throughout theory. I plan a longer post on a list of my favorite ones. For now I want to discuss one of the great discoveries of Computer Science, that is one can turn &#8220;lemons into lemonade.&#8221; If there are hard functions, &#8220;the lemons,&#8221; that cannot be computed efficiently, then magical things can happen, the &#8220;lemonade.&#8221; Turning lemons into lemonade is one of the key insights of modern cryptography. </p>
<p>
I will not even begin to attempt to give a survey of the many great results in this area, but I do have a point that is related to my favorite question: &#8220;can P=NP be true?&#8221;</p>
<p>
Close you eyes for a moment, and then take a deep breath. I am going to ask you to imagine two different scenarios:</p>
<p>
<b>In the first one</b>, suppose Alicex is an alien from Mars and has just arrived on Earth. She knows some mathematics, but no modern complexity theory or cryptography&#8212;Mars has not yet been able to develop it. She is told that the following two things are possible here on Earth: </p>
<ol>
<li> She can send a perfectly secret message back to her friends on Mars; even though Alicex has no secret bits in common with her friends back there. On Mars they have one-time pads, but nothing better for sending secret messages. <i>Alicex is surprised</i>.
<li> She is also told that we, I mean Andrew Wiles, has been able to prove Fermat&#8217;s Last Theorem:
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle+x%5E%7Bn%7D+%2B+y%5E%7Bn%7D+%3D+z%5E%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle x^{n} + y^{n} = z^{n}' title='\displaystyle x^{n} + y^{n} = z^{n}' class='latex' /></p>
<p> with <img src='http://l.wordpress.com/latex.php?latex=%7Bn%3E2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n&gt;2}' title='{n&gt;2}' class='latex' /> implies that <img src='http://l.wordpress.com/latex.php?latex=%7Bxyz%3D0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{xyz=0}' title='{xyz=0}' class='latex' />. Of course Alicex calls this problem something else: the &#8220;Gozzetx Problem.&#8221; It is named after the famous Martian mathematician Fredx Gozzetx, whose elegant proof that every planar graph has a four coloring was one of the great achievements of early Martian mathematics. Every Martian high school student knows his famous proof.</p>
<p>
This is a welcome news. But then Alicex is told that she can convince her friends back home on Mars that she knows a proof of this great theorem. And she can do this without giving away the actual proof. This is very neat, since there is a <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cnot%7Bo%7D1%2C000%2C000%2C000%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\not{o}1,000,000,000}' title='{\not{o}1,000,000,000}' class='latex' /> prize in Martian money for a proof. She trusts her friends, but that&#8217;s a lot of money; even with their recent inflation <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cnot%7Bo%7D100%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\not{o}100}' title='{\not{o}100}' class='latex' /> is still enough to buy what we call a mocha latte. <i>Alicex is surprised</i>.
</ol>
<p>
<b>In the second one</b>, suppose that Bobx is also an alien from Mars and has just arrived on Earth. Bobx like Alicex knows some mathematics, but no modern complexity theory or cryptography. We tell Bobx that the following is possible here on Earth: there is an algorithm that can factor any integer in polynomial time. <i>Bobx is surprised</i>. </p>
<p>
<p><b> Open Problems </b></p>
<p><p>
Which alien is more surprised? Alicex or Bobx?</p>
<p>
One of the reasons I have doubts about P<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cneq%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\neq}' title='{\neq}' class='latex' />NP is simple. I often wonder which is more surprising? That there could be an intelligent algorithm to solve a &#8220;hard&#8221; problem like factoring, or that there are magical results of modern cryptography? This is one reason I find the P=NP question so puzzling. Why is it so easy to believe that there is no clever algorithm for SAT, but that it is possible to do: </p>
<ul>
<li> Public-Key encryption?
<li> Signatures?
<li> Zero-knowledge proofs?
<li> And on and on?
</ul>
<p> These seems like magic to me. What do you think?</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3678/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3678/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3678/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3678/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3678/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3678/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3678/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3678/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3678/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3678/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3678&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/10/11/magical-results-and-pnp/feed/</wfw:commentRss>
		<slash:comments>25</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/kruskal.jpg" medium="image">
			<media:title type="html">kruskal</media:title>
		</media:content>
	</item>
		<item>
		<title>The Surprising Power of Rational Functions</title>
		<link>http://rjlipton.wordpress.com/2009/10/07/the-surprising-power-of-rational-functions/</link>
		<comments>http://rjlipton.wordpress.com/2009/10/07/the-surprising-power-of-rational-functions/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 02:25:47 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[People]]></category>
		<category><![CDATA[Proofs]]></category>
		<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[closure]]></category>
		<category><![CDATA[matrix]]></category>
		<category><![CDATA[Newman]]></category>
		<category><![CDATA[polynomial]]></category>
		<category><![CDATA[PP]]></category>
		<category><![CDATA[rational function]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3655</guid>
		<description><![CDATA[
Newman&#8217;s theorem on rational approximations and complexity theory 



Donald Newman was not a theorist, but was a mathematician who worked on many topics during his career. One of his results is a lovely theorem that shows that the approximation of continuous functions by rational functions can be very different from the approximation by polynomials. 

Today [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3655&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font color="”#0066cc?"><br />
<em>Newman&#8217;s theorem on rational approximations and complexity theory </em><br />
<font color="”#000000?"></p>
<p><img src="http://rjlipton.files.wordpress.com/2009/10/images3.jpeg?w=149&#038;h=105" alt="images" title="images" width="149" height="105" class="alignright size-full wp-image-3658" /></p>
<p>
Donald Newman was not a theorist, but was a mathematician who worked on many topics during his career. One of his results is a lovely theorem that shows that the approximation of continuous functions by rational functions can be very different from the approximation by polynomials. </p>
<p>
Today I want to talk about this famous theorem, and discuss some applications to theory. If you do not know this theorem you may enjoy seeing it. Even if you do know it, you may enjoy seeing some thoughts I have on a potential application of this theorem.<br />
<span id="more-3655"></span></p>
<p>
I never had the honor of meeting Newman, but I just read a nice <a href="http://www.math.technion.ac.il/hat/newman.pdf">tribute</a> to him, while preparing this, which contained a number of interesting stories. One explained how he once introduced the great Paul Erd&ouml;s to a seminar audience in a &#8220;novel&#8221; way&#8212;you will have to read it yourself. Another was on Newman&#8217;s performance on the Putnam mathematics competition&#8212;he was clearly a very fast and powerful problem solver. I quote: </p>
<blockquote><p>
 &#8220;Ordinarily, six or seven solutions are sufficient for a student to win the competition. In his freshman year at CCNY, Newman won the Putnam with nine correct solutions. As a sophomore, he again won the competition with an unprecedented perfect 12 solutions, a fete (<em>sic</em>) he repeated in his junior year. He didn&#8217;t bother taking the exam as a senior.&#8221;
</p></blockquote>
<p>  I recall taking the Putnam exam as a freshman&#8212;I did okay&#8212;nothing like Newman. I did worse as a sophomore, even worse as a junior, and finally was thrown off the team as a senior. For me, the more mathematics I learned the worse I got. Oh well.</p>
<p>
I will <i>not</i> discuss whether or not the &#8220;world is digital or not.&#8221; I think the last post generated a lot of fun comments&#8212;many at my expense. I make mistakes and perhaps that post was one. If so I apologize to you all; I hope that the many comments, while most disagreed with me, were interesting to you. I somehow did not get my point across. So I will fold up my tent on that problem for a bit&#8212;no pun intended&#8212;and move on to Newman&#8217;s classic theorem. Wait, I now see how to explain my point <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cdots%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\dots}' title='{\dots}' class='latex' /> stop, I better not go there today.</p>
<p>
<p><b> Newman&#8217;s Theorem </b></p>
<p><p>
It has long been known that any continuous function on the interval <img src='http://l.wordpress.com/latex.php?latex=%7B%5B-1%2C1%5D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{[-1,1]}' title='{[-1,1]}' class='latex' />, or any compact interval will do, can be approximated arbitrarily well by a polynomial. There are many proofs of this fundamental fact, many refinements, and many generalizations. </p>
<p>
The function <img src='http://l.wordpress.com/latex.php?latex=%7B%7Cx%7C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{|x|}' title='{|x|}' class='latex' />, the absolute value of <img src='http://l.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' />, while continuous on the interval <img src='http://l.wordpress.com/latex.php?latex=%7B%5B-1%2C1%5D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{[-1,1]}' title='{[-1,1]}' class='latex' /> is not easy to approximate by a polynomial. To get to an error bound of roughly <img src='http://l.wordpress.com/latex.php?latex=%7B1%2Ft%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1/t}' title='{1/t}' class='latex' /> requires a polynomial of degree <img src='http://l.wordpress.com/latex.php?latex=%7B%5COmega%28t%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\Omega(t)}' title='{\Omega(t)}' class='latex' />. The reason, at least the intuition, is that the function <img src='http://l.wordpress.com/latex.php?latex=%7B%7Cx%7C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{|x|}' title='{|x|}' class='latex' /> has a sharp corner: at zero its derivative is undefined. This behavior makes it hard for a low degree polynomial to &#8220;fit&#8221; the absolute value function.</p>
<p>
What Newman did in 1962&#8212;the <a href="http://projecteuclid.org/DPubS?service=UI&amp;version=1.0&amp;verb=Display&amp;handle=euclid.mmj/1028999029">paper</a> was published in 1964&#8212;is to prove that one can do <i>exponentially</i> better with rational functions instead of polynomials in approximating the absolute value function. I am not an expert in approximation theory, but I believe that this result came as a surprise. See Herbert Stahl&#8217;s <a href="http://www.mrlonline.org/bull/1993-28-01/S0273-0979-1993-00351-3/S0273-0979-1993-00351-3.pdf">paper</a>, for instance. Another <a href="http://rjlipton.wordpress.com/2009/09/27/surprises-in-mathematics-and-theory/">surprise</a> in mathematics. </p>
<p>
Let&#8217;s now turn to Newman&#8217;s Theorem. We need some notation first. Let <img src='http://l.wordpress.com/latex.php?latex=%7Ba+%3D+%5Cexp%28-1%2F%7B%5Csqrt+n%7D%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{a = \exp(-1/{\sqrt n})}' title='{a = \exp(-1/{\sqrt n})}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7Bp%28x%29+%3D+%5CPi_%7Bk%3D0%7D%5E%7Bn-1%7D%28a%5E%7Bk%7D%2Bx%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{p(x) = \Pi_{k=0}^{n-1}(a^{k}+x)}' title='{p(x) = \Pi_{k=0}^{n-1}(a^{k}+x)}' class='latex' />. Then, </p>
<blockquote><p><b>Theorem: </b> <em> For all <img src='http://l.wordpress.com/latex.php?latex=%7Bx+%5Cin+%5B-1%2C1%5D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{x \in [-1,1]}' title='{x \in [-1,1]}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7Bn+%5Cge+5%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n \ge 5}' title='{n \ge 5}' class='latex' />,
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cmid+%7Cx%7C+-+r_%7Bn%7D%28x%29+%5Cmid+%5Cle+3%5Cexp%28-%7B%5Csqrt+n%7D%29&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  \mid |x| - r_{n}(x) \mid \le 3\exp(-{\sqrt n})' title='\displaystyle  \mid |x| - r_{n}(x) \mid \le 3\exp(-{\sqrt n})' class='latex' /></p>
<p> where <img src='http://l.wordpress.com/latex.php?latex=%7Br_%7Bn%7D%28x%29%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{r_{n}(x)}' title='{r_{n}(x)}' class='latex' /> is the rational function,
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++x+%5Cfrac%7Bp%28x%29-p%28-x%29%7D%7Bp%28x%29%2Bp%28-x%29%7D.&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  x \frac{p(x)-p(-x)}{p(x)+p(-x)}.' title='\displaystyle  x \frac{p(x)-p(-x)}{p(x)+p(-x)}.' class='latex' /></p>
<p> </em></p></blockquote>
<p>The theorem is remarkable in a number of respects: First, it shows that the degree of the approximating rational function can be much smaller than the degree of any approximating polynomial. Previously, to get an error bound of order <img src='http://l.wordpress.com/latex.php?latex=%7B1%2Ft%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1/t}' title='{1/t}' class='latex' /> a polynomial had to have degree order <img src='http://l.wordpress.com/latex.php?latex=%7Bt%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{t}' title='{t}' class='latex' />; now the degrees of the numerator and denominator are only order <img src='http://l.wordpress.com/latex.php?latex=%7B%5Clog%5E%7B2%7D+t%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\log^{2} t}' title='{\log^{2} t}' class='latex' />. This is a huge decrease. Second, the rational function is not just constructive, but is expressible in a simple closed form. Very neat.</p>
<p>
Both of these are wonderful for applications&#8212;as we will discuss in the next section.</p>
<p>
<p><b> Applications </b></p>
<p><p>
I knew the Newman theorem back when I was a graduate student, yet I never could find an application of it to some theory problem. The theorem seemed to be perfect: great error bound and constructive. Yet I never found any &#8220;use&#8221; of his theorem.</p>
<p>
It took a great insight from Richard Beigel, Nick Reingold, and Daniel Spielman to realize that Newman&#8217;s theorem could be used to solve an important open problem from complexity theory. They showed in a brilliant <a href="http://knight.cis.temple.edu/~beigel/papers/brs-pp-jcss.html">paper</a> that the complexity class PP is closed under intersection and union. This is one of the great proofs in complexity theory. It is very sad that Reingold <a href="http://www.research.att.com/~dsj/algs/NickObit3.htm">passed</a> away about a year ago.</p>
<p>
Later Mitsunori Ogihara used similar ideas to prove that the PL &#8220;hierarchy&#8221; collapses. There are now other uses of Newman&#8217;s theorem: for example, it is used in parts of learning theory. However, I will not attempt an exhaustive survey of its applications.</p>
<p>
The PP result solved a well known open problem that had been raised as soon as PP was defined as a complexity class. Recall PP is the complexity class that contains sets that are accepted by a polynomial-time bounded probabilistic Turing machine that accepts with probability <img src='http://l.wordpress.com/latex.php?latex=%7B%3E0.5%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{&gt;0.5}' title='{&gt;0.5}' class='latex' />. It trivial that this class is closed under complements, but closure under intersection/union is not obvious at all. </p>
<p>
<p><b> The PP Proof </b></p>
<p><p>
Here is a high level view of their proof. They need to detect the sign of an integer: the sign of <img src='http://l.wordpress.com/latex.php?latex=%7B-11%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-11}' title='{-11}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%7B-1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{-1}' title='{-1}' class='latex' /> and the sign of <img src='http://l.wordpress.com/latex.php?latex=%7B123%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{123}' title='{123}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%7B1%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1}' title='{1}' class='latex' />. The problem is that they need to do this with a low degree polynomial, which is not obvious how to do it. Newman&#8217;s theorem comes to the rescue.</p>
<p>
Suppose that there is a rational function <img src='http://l.wordpress.com/latex.php?latex=%7Br%28x%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{r(x)}' title='{r(x)}' class='latex' /> that approximates <img src='http://l.wordpress.com/latex.php?latex=%7B%7Cx%7C%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{|x|}' title='{|x|}' class='latex' /> well, which of course there is by Newman&#8217;s theorem. Then, for <img src='http://l.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> not too near <img src='http://l.wordpress.com/latex.php?latex=%7B0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{0}' title='{0}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%7Br%28x%29%2Fx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{r(x)/x}' title='{r(x)/x}' class='latex' /> must well approximate the sign of <img src='http://l.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' />. The problem is that their Turing machine can &#8220;compute&#8221; polynomials, but not rational functions&#8212;they cannot divide in their model. So they use the following cool, but trivial, trick:
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++a%2Fb+%3E+0+%5Ctext%7B+if+and+only+if+%7D+ab+%3E+0.&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  a/b &gt; 0 \text{ if and only if } ab &gt; 0.' title='\displaystyle  a/b &gt; 0 \text{ if and only if } ab &gt; 0.' class='latex' /></p>
<p>
There is much more to their proof, but this is the key use of Newman&#8217;s theorem. They do need to modify his function to one that works over the integers in an exponential size range. Here are the polynomials they use; note, as is often the case in computer science not only degree but the size of coefficients is also important. In particular they use the following polynomials:
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cbegin%7Barray%7D%7Brcl%7D++%09P_%7Bn%7D%28x%29+%26%3D%26+%28x-1%29%5Cdisplaystyle%7B%5Cprod_%7Bi%3D1%7D%5E%7Bn%7D%28x-2%5E%7Bi%7D%29%5E%7B2%7D%7D%2C+%5C%5C+%09S_%7Bn%7D%28x%29+%26%3D%26+%5Cfrac%7BP_%7Bn%7D%28-x%29+-+P_%7Bn%7D%28x%29%7D%7BP_%7Bn%7D%28-x%29+%2B+P_%7Bn%7D%28x%29%7D%2C+%5C%5C+%09A_%7Bn%7D%28x%2Cy%29+%26%3D%26+S_%7Bn%7D%28x%29+%2B+S_%7Bn%7D%28y%29+-1.+%5Cend%7Barray%7D+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{array}{rcl}  	P_{n}(x) &amp;=&amp; (x-1)\displaystyle{\prod_{i=1}^{n}(x-2^{i})^{2}}, \\ 	S_{n}(x) &amp;=&amp; \frac{P_{n}(-x) - P_{n}(x)}{P_{n}(-x) + P_{n}(x)}, \\ 	A_{n}(x,y) &amp;=&amp; S_{n}(x) + S_{n}(y) -1. \end{array} ' title='\displaystyle  \begin{array}{rcl}  	P_{n}(x) &amp;=&amp; (x-1)\displaystyle{\prod_{i=1}^{n}(x-2^{i})^{2}}, \\ 	S_{n}(x) &amp;=&amp; \frac{P_{n}(-x) - P_{n}(x)}{P_{n}(-x) + P_{n}(x)}, \\ 	A_{n}(x,y) &amp;=&amp; S_{n}(x) + S_{n}(y) -1. \end{array} ' class='latex' /></p>
<p> Some of the key properties are: </p>
<ol>
<li> The degree of <img src='http://l.wordpress.com/latex.php?latex=%7BA_%7Bn%7D%28x%2Cy%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A_{n}(x,y)}' title='{A_{n}(x,y)}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%7BO%28n%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{O(n)}' title='{O(n)}' class='latex' />;
<li> Each coefficient of <img src='http://l.wordpress.com/latex.php?latex=%7BA_%7Bn%7D%28x%2Cy%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A_{n}(x,y)}' title='{A_{n}(x,y)}' class='latex' /> is <img src='http://l.wordpress.com/latex.php?latex=%7B2%5E%7BO%28n%5E%7B2%7D%29%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2^{O(n^{2})}}' title='{2^{O(n^{2})}}' class='latex' />;
<li> If <img src='http://l.wordpress.com/latex.php?latex=%7Bx%2Cy%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x,y}' title='{x,y}' class='latex' /> are integers in <img src='http://l.wordpress.com/latex.php?latex=%7B%5B1%2C2%5E%7Bn%7D%5D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{[1,2^{n}]}' title='{[1,2^{n}]}' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%7BA_%7Bn%7D%28x%2Cy%29+%3E+0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A_{n}(x,y) &gt; 0}' title='{A_{n}(x,y) &gt; 0}' class='latex' />;
<li> If <img src='http://l.wordpress.com/latex.php?latex=%7Bx%2Cy%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{x,y}' title='{x,y}' class='latex' /> are integers in <img src='http://l.wordpress.com/latex.php?latex=%7B%5B1%2C2%5E%7Bn%7D%5D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{[1,2^{n}]}' title='{[1,2^{n}]}' class='latex' />, then <img src='http://l.wordpress.com/latex.php?latex=%7BA_%7Bn%7D%28-x%2Cy%29+%3C+0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A_{n}(-x,y) &lt; 0}' title='{A_{n}(-x,y) &lt; 0}' class='latex' />, <img src='http://l.wordpress.com/latex.php?latex=%7BA_%7Bn%7D%28x%2C-y%29+%3C+0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A_{n}(x,-y) &lt; 0}' title='{A_{n}(x,-y) &lt; 0}' class='latex' />, and <img src='http://l.wordpress.com/latex.php?latex=%7BA_%7Bn%7D%28-x%2C-y%29+%3C+0%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A_{n}(-x,-y) &lt; 0}' title='{A_{n}(-x,-y) &lt; 0}' class='latex' />.
</ol>
<p>
<p><b> Matrices </b></p>
<p><p>
I have several open problems that concern the behavior of matrices. I noticed that the following simple theorem can be proved using Newman&#8217;s theorem. This is likely to be known, but I will present the statement here with an outline of the proof. </p>
<blockquote><p><b>Lemma: </b> <em> Suppose that <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cmid+f%28x%29+-g%28x%29+%5Cmid+%5Cle+%5Cdelta+%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{\mid f(x) -g(x) \mid \le \delta }' title='{\mid f(x) -g(x) \mid \le \delta }' class='latex' /> for all <img src='http://l.wordpress.com/latex.php?latex=%7Bx%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{x}' title='{x}' class='latex' /> in the interval <img src='http://l.wordpress.com/latex.php?latex=%7B%5B-1%2C1%5D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{[-1,1]}' title='{[-1,1]}' class='latex' />, where <img src='http://l.wordpress.com/latex.php?latex=%7Bf%28x%29%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{f(x)}' title='{f(x)}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7Bg%28x%29%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{g(x)}' title='{g(x)}' class='latex' /> are polynomials. Suppose also that <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is an <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> matrix with all its eigenvalues in <img src='http://l.wordpress.com/latex.php?latex=%7B%5B-1%2C1%5D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{[-1,1]}' title='{[-1,1]}' class='latex' />. Then,
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cmid+%5Cmathsf%7Btrace%7D%28f%28A%29%29+-+%5Cmathsf%7Btrace%7D%28g%28A%29%29+%5Cmid+%5Cle+n%5Cdelta.&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  \mid \mathsf{trace}(f(A)) - \mathsf{trace}(g(A)) \mid \le n\delta.' title='\displaystyle  \mid \mathsf{trace}(f(A)) - \mathsf{trace}(g(A)) \mid \le n\delta.' class='latex' /></p>
<p> </em></p></blockquote>
<p> Recall, <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cmathsf%7Btrace%7D%28A%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\mathsf{trace}(A)}' title='{\mathsf{trace}(A)}' class='latex' /> is the sum of the diagonal entries of the matrix <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />. </p>
<p>
What is lemma says is: if an inequality holds for an interval that contains the eigenvalues of a matrix, then a related inequality holds for the traces of the matrices. </p>
<p>
The proof in the case that <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is diagonalizable is fairly straightforward. Suppose that <img src='http://l.wordpress.com/latex.php?latex=%7BA+%3D+S%5E%7B-1%7DDS%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A = S^{-1}DS}' title='{A = S^{-1}DS}' class='latex' /> where <img src='http://l.wordpress.com/latex.php?latex=%7BD%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{D}' title='{D}' class='latex' /> is a diagonal matrix. Then, the key insight is that
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cbegin%7Barray%7D%7Brcl%7D++%09%5Cmathsf%7Btrace%7D%28f%28A%29%29+%26%3D%26+%5Cmathsf%7Btrace%7D%28f%28S%5E%7B-1%7DDS%29%29+%5C%5C+%09%09%09%09%09%09%09%26%3D%26+%5Cmathsf%7Btrace%7D%28S%5E%7B-1%7Df%28D%29S%29+%5C%5C+%09%09%09%09%09%09%09%26%3D%26+%5Cmathsf%7Btrace%7D%28f%28D%29%29.+%5Cend%7Barray%7D+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \begin{array}{rcl}  	\mathsf{trace}(f(A)) &amp;=&amp; \mathsf{trace}(f(S^{-1}DS)) \\ 							&amp;=&amp; \mathsf{trace}(S^{-1}f(D)S) \\ 							&amp;=&amp; \mathsf{trace}(f(D)). \end{array} ' title='\displaystyle  \begin{array}{rcl}  	\mathsf{trace}(f(A)) &amp;=&amp; \mathsf{trace}(f(S^{-1}DS)) \\ 							&amp;=&amp; \mathsf{trace}(S^{-1}f(D)S) \\ 							&amp;=&amp; \mathsf{trace}(f(D)). \end{array} ' class='latex' /></p>
<p>The last equality follows from a basic property of the <a href="http://en.wikipedia.org/wiki/Trace_(linear_algebra)">trace</a> of a matrix.</p>
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cmathsf%7Btrace%7D%28f%28A%29%29+-+%5Cmathsf%7Btrace%7D%28g%28A%29%29+%3D+%5Cmathsf%7Btrace%7D%28f%28D%29%29+-+%5Cmathsf%7Btrace%7D%28g%28D%29%29.+&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='\displaystyle  \mathsf{trace}(f(A)) - \mathsf{trace}(g(A)) = \mathsf{trace}(f(D)) - \mathsf{trace}(g(D)). ' title='\displaystyle  \mathsf{trace}(f(A)) - \mathsf{trace}(g(A)) = \mathsf{trace}(f(D)) - \mathsf{trace}(g(D)). ' class='latex' /></p>
<p> The lemma then follows by applying the inequality on <img src='http://l.wordpress.com/latex.php?latex=%7Bf%28x%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{f(x)}' title='{f(x)}' class='latex' /> and <img src='http://l.wordpress.com/latex.php?latex=%7Bg%28x%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{g(x)}' title='{g(x)}' class='latex' /> to each entry of the diagonal matrix <img src='http://l.wordpress.com/latex.php?latex=%7BD%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{D}' title='{D}' class='latex' />; the bound is clearly <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> times <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cdelta%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\delta}' title='{\delta}' class='latex' />.</p>
<p>
The proof in the case that <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is <i>not</i> diagonalizable follows by a trick that I believe is due to Richard Bellman. His insight is that given any <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> there is a <img src='http://l.wordpress.com/latex.php?latex=%7BA%5E%7B%28k%29%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A^{(k)}}' title='{A^{(k)}}' class='latex' /> that is diagonalizable and is within <img src='http://l.wordpress.com/latex.php?latex=%7B1%2Fk%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{1/k}' title='{1/k}' class='latex' /> of <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />. Then, one lets <img src='http://l.wordpress.com/latex.php?latex=%7Bk%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{k}' title='{k}' class='latex' /> tend to <img src='http://l.wordpress.com/latex.php?latex=%7B%5Cinfty%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\infty}' title='{\infty}' class='latex' /> and concludes the result by continuity. This trick, for example, gives a very short proof of the fact that any matrix satisfies its own characteristic polynomial. First prove this for diagonalizable matrices, which is simple, and then use the Bellman trick.</p>
<blockquote><p><b>Theorem: </b> <em> Suppose that <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' /> is an <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=%7Bn%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n}' title='{n}' class='latex' /> matrix with all its eigenvalues in the interval <img src='http://l.wordpress.com/latex.php?latex=%7B%5B-1%2C1%5D%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{[-1,1]}' title='{[-1,1]}' class='latex' />, and with <img src='http://l.wordpress.com/latex.php?latex=%7Bn+%5Cge+5%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{n \ge 5}' title='{n \ge 5}' class='latex' />. Let <img src='http://l.wordpress.com/latex.php?latex=%7Br_%7Bn%7D%28x%29%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{r_{n}(x)}' title='{r_{n}(x)}' class='latex' /> be the rational function from Newman&#8217;s theorem. Then,
<p align="center"><img src='http://l.wordpress.com/latex.php?latex=%5Cdisplaystyle++%5Cmid+%5Cmathsf%7Btrace%7D%28r_%7Bn%7D%28A%29%29+-+S+%5Cmid+%5Cle+3n%5Cexp%28-%7B%5Csqrt+n%7D%29&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='\displaystyle  \mid \mathsf{trace}(r_{n}(A)) - S \mid \le 3n\exp(-{\sqrt n})' title='\displaystyle  \mid \mathsf{trace}(r_{n}(A)) - S \mid \le 3n\exp(-{\sqrt n})' class='latex' /></p>
<p> where <img src='http://l.wordpress.com/latex.php?latex=%7BS%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{S}' title='{S}' class='latex' /> is the sum of the absolute values of the eigenvalues of <img src='http://l.wordpress.com/latex.php?latex=%7BA%7D&#038;bg=e8e8e8&#038;fg=000000&#038;s=0' alt='{A}' title='{A}' class='latex' />. </em></p></blockquote>
<p><p>
The proof follows from the lemma and Newman&#8217;s theorem using <img src='http://l.wordpress.com/latex.php?latex=%7Bf+%3D+r_n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{f = r_n}' title='{f = r_n}' class='latex' /> and the absolute value function for <img src='http://l.wordpress.com/latex.php?latex=%7Bg%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{g}' title='{g}' class='latex' />.</p>
<p>
<p><b> Open Problems </b></p>
<p><p>
An obvious open problem is to find other applications of Newman&#8217;s theorem. Can you use the matrix version? I had an idea to use the matrix version on a learning problem, but that has not worked out yet. So I will get back to that in the future.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3655/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3655/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3655/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3655/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3655/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3655/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3655/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3655/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3655/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3655/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3655&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/10/07/the-surprising-power-of-rational-functions/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/images3.jpeg" medium="image">
			<media:title type="html">images</media:title>
		</media:content>
	</item>
		<item>
		<title>The World is Digital</title>
		<link>http://rjlipton.wordpress.com/2009/10/04/the-world-is-digital/</link>
		<comments>http://rjlipton.wordpress.com/2009/10/04/the-world-is-digital/#comments</comments>
		<pubDate>Mon, 05 Oct 2009 01:29:57 +0000</pubDate>
		<dc:creator>rjlipton</dc:creator>
				<category><![CDATA[People]]></category>
		<category><![CDATA[Algorithms]]></category>
		<category><![CDATA[digital]]></category>
		<category><![CDATA[Turing Award]]></category>
		<category><![CDATA[world]]></category>

		<guid isPermaLink="false">http://rjlipton.wordpress.com/?p=3585</guid>
		<description><![CDATA[
 One of the biggest insights of computer science is that the world is digital 



Alan Kay is a brilliant computer scientist who has won the Turing Award, along with many other honors. He is famous for creating the Dynabook concept&#8212;the idea that became the laptop, and for creating many programming languages, such as Smalltalk.

Today [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3585&subd=rjlipton&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font color="”#0066cc?"><br />
<em> One of the biggest insights of computer science is that the world is digital </em><br />
<font color="”#000000?"><br />
<img src="http://rjlipton.files.wordpress.com/2009/10/images2.jpeg?w=91&#038;h=100" alt="images" title="images" width="91" height="100" class="alignright size-full wp-image-3591" /></p>
<p><p>
Alan Kay is a brilliant computer scientist who has won the Turing Award, along with many other honors. He is famous for creating the <a href="http://en.wikipedia.org/wiki/Dynabook">Dynabook</a> concept&#8212;the idea that became the laptop, and for creating many programming languages, such as <a href="http://en.wikipedia.org/wiki/Smalltalk">Smalltalk</a>.</p>
<p>
Today I want to talk about one of the biggest results in all of Computer Science. No, it is not the Halting Problem, it is not the P=NP question, it is not that Linear Programming is in P, nor any of the many other beautiful results of theory. It is, in my opinion, the realization that the <i>world is digital</i> (WID).<br />
<span id="more-3585"></span></p>
<p>
This summer I was at an NSF workshop on <a href="http://www.cs.cmu.edu/afs/cs/usr/wing/www/publications/Wing06.pdf">Computational Thinking</a>&#8212;an idea that Jeannette Wing is behind. During the workshop I said:  </p>
<blockquote><p>
In my opinion, one of the key insights in all of Computer Science, is that the world is digital.
</p></blockquote>
<p>I also added that this was a relatively recent result, and it has had a tremendous impact on the world.</p>
<p>
As soon as I said this, Bill Wulf and Alan Kay both jumped all over me, and continued their comments into the next coffee break. They must have thought that I was crazy or worse&#8212;perhaps they still do. Their position was that WID had been known forever&#8212;back to the Greeks&#8212;therefore it was not new, nor was it a deep insight. I disagreed. It&#8217;s hard to argue with the past president of the National Academy of Engineering, Bill; and with a Turing Award winner, Alan. But I tried. I did not get them to budge, not at all.</p>
<p>
I still think I am right. I will explain why and I hope that you might agree with me, but either way let me know what you think.</p>
<p>
<p><b> The World is Digital </b></p>
<p>
<blockquote><p>
 <i>The more original a discovery, the more obvious it seems afterwards</i>&#8212;Arthur Koestler, journalist, novelist, social philosopher, and science writer.
</p></blockquote>
<p>
The fact that all things can be represented by sequences of zeroes and ones sounds like a simple idea, but I claim that it is deep and profound. </p>
<p>
Look around you. You may see, like I do, beautiful trees outside in the backyard. You may hear wonderful sounds: birds chirping; music playing classical or rock or hip hop or another genre. You may be reading a book or watching a show on a HD-TV and seeing images of people that are almost lifelike. </p>
<p>
Yet no matter how beautiful the images are, no matter how complex the sounds are, they all can be reduced to sequences of bits. The world truly is digital. In a sense, 0 and 1 represent computer science&#8217;s basic atomic particles. Information consists of arrangements of 0&#8217;s and 1&#8217;s. Nothing else is needed. Computer science differs from modern physics, while Physics uses hundreds of &#8220;basic particles&#8221; to build the world&#8212;we only need two. </p>
<p>
I believe that the discovery that all things can be viewed as digital is as revolutionary as the notion that the world is made of atomic particles. But, the discovery that the WID took half a century longer to discover than even the simplest atomic particles. </p>
<p>
Electrons, for example, were discovered by Joseph Thomson in 1897, long before we knew that the WID. Some may scoff and say that it&#8217;s obvious that the WID. But, I think I can argue that is not so. Yes, particle physics is a deep theory; however, Computer Science&#8217;s insight that everything can be represented digitally is profound. </p>
<p>
<p><b> Who Knew What When? </b></p>
<p><p>
One way to argue, I believe, that WID is profound is that it is a recent discovery. I believe that if it was &#8220;obvious,&#8221; then it would have been known for a long time. </p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>The Ancient Greeks:</i> Greats like Euclid did not know the WID. In all of his famous <a href="http://en.wikipedia.org/wiki/Euclid's_Elements">Elements</a>, there is not one section on boolean functions. </p>
<p>
Yet, Alan and Bill said to me that the ancient Greeks already knew that the WID. Of course, the Greeks knew that the square root of <img src='http://l.wordpress.com/latex.php?latex=%7B2%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{2}' title='{2}' class='latex' /> was not a rational number. Somehow, Alan and Bill felt that this showed that they knew that the WID. I think the Greeks knew many wonderful things but that they were in the dark with respect to the WID&#8212;at least that is my opinion.</p>
<p>
A kind of <a href="http://en.wikipedia.org/wiki/Atomic_theory">atomic theory</a> was known, without proof, to the Greeks. That the world is made of discrete atoms is not the same I would argue as the WID. They did not make the key jump: that is they did not realize that any object of any kind could be represented as a binary string. It seems obvious to us today, but it was not centuries ago.</p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>The Classic Mathematicians:</i> The great mathematicians such as Augustin Cauchy, Joseph Fourier, Carl Gauss, David Hilbert, Pierre-Simon Laplace, and Isaac Newton did not know the WID. They mainly studied continuous systems, and when they did work on discrete systems, they worked with the integers. But I do not believe they ever thought about boolean functions, and nor<br />
do I think that they knew the WID.</p>
<p><p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>Lewis Carroll:</i> Greats like Lewis Carroll&#8212;Charles Dodgson&#8212;did not know that the WID. While he is famous for his <i>Alice In Wonderland</i> series of books, during his later years he spent a great deal of time working on <i>symbolic logic</i>. He was especially interested in the theory of <i>syllogisms</i>: a set of premises that lead to a conclusion. Here is one of his examples:  </p>
<blockquote><p>
Babies are illogical<br />
Nobody is despised who can manage a crocodile<br />
Illogical persons are despised
</p></blockquote>
<p>He then concluded that  </p>
<blockquote><p>
Babies cannot manage crocodiles
</p></blockquote>
<p>
Lewis Carroll used a version of <a href="http://en.wikipedia.org/wiki/Venn_diagram">Venn diagrams</a> to help his readers understand his logical examples; these diagrams are named after the mathematician John Venn who introduced them in 1880. Carroll worked out a method of squares within squares that worked up to <img src='http://l.wordpress.com/latex.php?latex=%7B8%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{8}' title='{8}' class='latex' /> letters&#8212;where a letter is a &#8220;proposition.&#8221;</p>
<p>
What I find interesting is that Carroll did not see that he was working with boolean logic. His examples and notation are fairly complex, and show, I believe, again that the central nature of bits and boolean values was not understood by him nor his contemporaries. </p>
<p><p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>G&ouml;del:</i> Greats like Kurt G&ouml;del did not know the WID. If he had, he would not have needed to invent his elaborate scheme for <a href="http://en.wikipedia.org/wiki/Godel_number">numbering proofs</a>. If he had realized that proofs were only a finite collection of symbols, he could have assigned each symbol a unique binary string. That&#8217;s all he needed to do: no primes, no fancy facts about primes, just binary strings. Indeed, I never present his complex numbering system when I teach G&ouml;del&#8217;s Theorem; I just explain that proofs and formulas are binary strings. </p>
<p>
<img src='http://l.wordpress.com/latex.php?latex=%7B%5Cbullet+%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{\bullet }' title='{\bullet }' class='latex' /> <i>Markov:</i> Greats like the Russian mathematician Andrey Markov did not know the WID. He invented a model of computation that he called &#8220;normal algorithms.&#8221; Unlike Turing Machines, his model was based on string rewriting rules. I remember reading his 1950 book on his theory of algorithms. One of the striking parts of the book, to me, was the introduction. His model needed to restrict the set of symbols to a finite set of symbols&#8212;otherwise, his rewriting rules would not be finite. Today, since we know that the WID this is obviously no issue. However, when he wrote his book he was concerned about the restriction to a finite alphabet. He talked. at length, about the infinite number of ways that one might be able to write a script letter such an &#8220;a&#8221;:</p>
<p>
<img src="http://rjlipton.files.wordpress.com/2009/10/pic2.png?w=92&#038;h=90" alt="pic" title="pic" width="92" height="90" class="aligncenter size-full wp-image-3619" /></p>
<p>
But, he reasoned that while there might be an infinite number of script letter &#8220;a&#8221;s, the restriction to a fixed size alphabet was just fine.</p>
<p>
That Markov felt, in 1950, that he had to discuss the restriction to a finite alphabet shows me that he was unaware that the WID. Today if I had to argue why a finite alphabet suffices to describe any script letter, I would point out that we can just represent the letter as a digital image. Let the image be <img src='http://l.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> by <img src='http://l.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> pixels, where <img src='http://l.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> is some huge number&#8212;if <img src='http://l.wordpress.com/latex.php?latex=%7BN%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0' alt='{N}' title='{N}' class='latex' /> is large enough I would argue that if a human can tell two script letters apart, then they would have different digital representations.</p>
<p>
<p><b> How I Came to Know the World is Digital </b></p>
<p><p>
I remember when it finally began to dawn on me that the world is digital. I would like to say that I always knew, or at least that it came to me in a flash one day. But that would be a lie. It took me years to realize it. It&#8217;s one of those ideas or notions that came slowly to me. However, once I saw it, I wondered how I could have not always known it.</p>
<p>
At Carnegie Mellon, where I got my Ph.D., we were never taught that the WID. My professors, like Albert Meyer and Alan Perlis, never told me. Meyer was then, and is now a brilliant researcher&#8212;he never told me the WID. Perlis was, at the time, the chair of the entire department of Computer Science, yet he never told me either. </p>
<p>
When I was an assistant professor of Computer Science at Yale University in 1976, I got my first inkling that the world could be viewed as digital. I was on the phone with a friend, Bob Sedgewick, who at the time was an assistant professor at Brown University. He was telling me how a newly purchased printer worked. </p>
<p>
Now you have to go back 30 years to recall that printers were once very different from the beautiful laser/ink jet printers of today. In those days a printer was a typewriter on steroids. In case you have never used one, here is how a typewriter worked. There was a keyboard like today&#8217;s laptops. But when you pressed a key, a piece of metal hit an ink ribbon and then hit your paper. In this way each letter was printed. </p>
<p>
Printers did the same&#8212;again each letter had a piece of metal that was pressed onto the paper. The user was limited to the font and symbols which were built into the printer. Some printers had clever arrangements, so the printer could go fast. Some had the metal letters arranged on wheels; others had them arranged in even more complex configurations. In any event, typewriters and printers used the impact of a piece of metal onto an ink ribbon to leave a mark on paper. The only difference between printers and typewriters was speed. </p>
<p>
Back to my story: Bob was telling me about a new kind of printer that used dots to represent each letter or symbol on the page. He explained how the page was coded as an array of 0&#8217;s and 1&#8217;s. Where a 1 occurred, the printer placed ink on the page; a 0 instructed the printer to leave a blank on the spot. The way the ink was placed onto the paper was not laser-based or ink-jet based by the way. This early printer used a more primitive technology; however, the page was represented as an array of bits&#8212;it was digital. </p>
<p>
I have to say, I was surprised. I knew about G&ouml;del numbers. I knew about binary numbers. I knew a lot of computer science&#8212;I thought. However, I somehow was surprised that the printer was just placing bits onto a piece of paper. I did not see that printed words and images could be represented solely by dots. Really. </p>
<p>
What Bob explained was that he had written a program that created an array of 0&#8217;s and 1&#8217;s. Then, his program sent the array to the printer which created a printed page. He explained, further, that the program could create text or images or anything in this manner. This was a major shift from pieces of metal striking a ribbon and putting ink onto a page. </p>
<p>
Typewriters and printers had been limited to print only the images that were built into the metal keys or balls. Now with the digital approach of using bits as a way to control the printing process, a whole new vista opened up. Any picture, any image, any font could, in principle, be represented, or printed, without changing the hardware.</p>
<p>
This was a huge breakthrough.</p>
<p>
<p><b> The Chicken and The Egg Argument </b></p>
<p><p>
I always show these posts to Subruk before I post them; otherwise, they would be a mess. This time he raised an interesting point that we decided that he should make&#8212;that I would not change my thoughts to conform to his insight. I am doing this partly because I do not completely agree with his point, and partly because I think it is a great insight that he should make himself. The plan is that he will add a comment to this once it is out there for you to read. </p>
<p>
I would, however, like to address an issue that is related to his point, but please read his comment.</p>
<p>
Mathematics is driven by practice and also drives practice. There are many examples of deep mathematics that was created to solve very practical problems. For example, the theory of Fourier Series was driven by the need to solve certain differential equations. Many other examples come to mind: geometry&#8212;construction of temples, calculus&#8211;astronomy, probability theory&#8212;life insurance tables, and on and on. </p>
<p>
On the other hand, many times mathematics has been ahead of practice. Godfrey Hardy was actually proud that his favorite area, number theory, was <a href="http://en.wikipedia.org/wiki/A_Mathematician's_Apology">pure</a>&#8212;that it had no possible applications. Of course we can only speculate what he would think of modern cryptography&#8217;s use of number theory as a basis of encryption. Other areas of mathematics were also developed because they were beautiful. Many examples come to mind: abstract measure theory, set theory, algebraic geometry, infinite dimensional geometry&#8212;I am sure the list could go on and on. Sometimes these areas have eventually been used to solve real problems, and sometimes they remain beautiful areas that seem to be without any practical application.</p>
<p>
My point is simple. Even before computers existed mathematicians <i>could</i> have worked on binary strings, on boolean functions, and on many of the areas that we work on today. They did not, in my opinion, because they did not understand the power of bits; they did not know that the WID.</p>
<p>
A final point. If they had worked on boolean functions years ago, one wonders where theory would be today? Curiously, the difficult foundational issues that plagued early analysis, for example, would have been completely moot. Early mathematicians struggled with the basic question of &#8220;what is a real number?&#8221; If they had worked on boolean problems, these difficult problems would have been completely avoided.</p>
<p>
<p><b> Open Problems </b></p>
<p><p>
Is the &#8220;world is digital&#8221; one of the great insights of Computer Science? Or do you agree with Kay and Wulf? </p>
<p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rjlipton.wordpress.com/3585/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rjlipton.wordpress.com/3585/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rjlipton.wordpress.com/3585/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rjlipton.wordpress.com/3585/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rjlipton.wordpress.com/3585/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rjlipton.wordpress.com/3585/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rjlipton.wordpress.com/3585/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rjlipton.wordpress.com/3585/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rjlipton.wordpress.com/3585/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rjlipton.wordpress.com/3585/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rjlipton.wordpress.com&blog=6472207&post=3585&subd=rjlipton&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://rjlipton.wordpress.com/2009/10/04/the-world-is-digital/feed/</wfw:commentRss>
		<slash:comments>65</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/afaa664a052777146630095c596117b2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lipton</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/images2.jpeg" medium="image">
			<media:title type="html">images</media:title>
		</media:content>

		<media:content url="http://rjlipton.files.wordpress.com/2009/10/pic2.png" medium="image">
			<media:title type="html">pic</media:title>
		</media:content>
	</item>
	</channel>
</rss>