<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments for Self-Aware Systems</title>
	<atom:link href="http://selfawaresystems.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://selfawaresystems.com</link>
	<description>Wisdom Technology for Humanity</description>
	<pubDate>Thu, 08 Jan 2009 01:01:31 +0000</pubDate>
	<generator>http://wordpress.org/?v=MU</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>Comment on Paper on the Nature of Self-Improving Artificial Intelligence by omohundro</title>
		<link>http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-27</link>
		<dc:creator>omohundro</dc:creator>
		<pubDate>Fri, 26 Dec 2008 03:08:56 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-27</guid>
		<description>Wei Dai, 

Thanks for the comments. You are absolutely right that arbitrage arguments require that the system has rich enough dynamics to exploit any vulnerabilities. An agent can remain irrational without negative consequences if it is not possible for an adversary to take the actions necessary to exploit it. However, there are usually many ways to exploit an irrationality and an agent takes a huge risk by remaining irrational. Biologically, it creates a "niche" for any adversary which discovers how to exploit the irrationality. Intelligent systems which are able to modify themselves and which have any uncertainty about the laws of physics or about adversaries they may face in the future will have a huge incentive to eliminate their vulnerabilities. 

In your example, an agent which changes its mind about its preferences is exploitable. Say you prefer apples to bananas today, but bananas to apples tommorrow. At 11:59PM I sell you an apple in return for some money and a banana. Two minutes later at 12:01AM, I sell you the banana in return for some more money and the apple. It's 2 minutes later and you are poorer but otherwise in the same state. 

For the bid/ask spread example, you may not have that choice. Think of an animal who has to decide whether to fight or flee from a predator based on its assessment of the two distributions that arise. It has to pick one of the choices whether it likes it or not. The option of not acting isn't part of the dynamics of that situation. 

In general, preferences are over complete future time histories including your present assessment of your future actions. If a rational agent discovers that he has an internal mechanism that will cause him to change his preferences in the future, the rational response is to try to disable that mechanism, change himself, or set up an external commitment structure to keep his present preferences. Imagine yourself as a lover of books and to your horror you discover that you have a gene which will kick in in one year that will cause you to become an arsonist who will go around burning down libraries. What actions would you currently take? 

Best wishes,
Steve</description>
		<content:encoded><![CDATA[<p>Wei Dai, </p>
<p>Thanks for the comments. You are absolutely right that arbitrage arguments require that the system has rich enough dynamics to exploit any vulnerabilities. An agent can remain irrational without negative consequences if it is not possible for an adversary to take the actions necessary to exploit it. However, there are usually many ways to exploit an irrationality and an agent takes a huge risk by remaining irrational. Biologically, it creates a &#8220;niche&#8221; for any adversary which discovers how to exploit the irrationality. Intelligent systems which are able to modify themselves and which have any uncertainty about the laws of physics or about adversaries they may face in the future will have a huge incentive to eliminate their vulnerabilities. </p>
<p>In your example, an agent which changes its mind about its preferences is exploitable. Say you prefer apples to bananas today, but bananas to apples tommorrow. At 11:59PM I sell you an apple in return for some money and a banana. Two minutes later at 12:01AM, I sell you the banana in return for some more money and the apple. It&#8217;s 2 minutes later and you are poorer but otherwise in the same state. </p>
<p>For the bid/ask spread example, you may not have that choice. Think of an animal who has to decide whether to fight or flee from a predator based on its assessment of the two distributions that arise. It has to pick one of the choices whether it likes it or not. The option of not acting isn&#8217;t part of the dynamics of that situation. </p>
<p>In general, preferences are over complete future time histories including your present assessment of your future actions. If a rational agent discovers that he has an internal mechanism that will cause him to change his preferences in the future, the rational response is to try to disable that mechanism, change himself, or set up an external commitment structure to keep his present preferences. Imagine yourself as a lover of books and to your horror you discover that you have a gene which will kick in in one year that will cause you to become an arsonist who will go around burning down libraries. What actions would you currently take? </p>
<p>Best wishes,<br />
Steve</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Paper on the Nature of Self-Improving Artificial Intelligence by Wei Dai</title>
		<link>http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-26</link>
		<dc:creator>Wei Dai</dc:creator>
		<pubDate>Fri, 12 Dec 2008 10:38:56 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-26</guid>
		<description>Hi, I found your paper by way of http://www.overcomingbias.com/2008/12/two-visions-of/comments/page/2/#comment-142322542.

I think unfortunately the derivation in chapter 10 of expected utility maximization from the need to avoid vulnerabilities, especially section 10.9, doesn't work, because there are ways to avoid being Dutch booked, other than being an expected utility maximizer. For example, I may prefer a mixture of L1 and L2 to both L1 and L2, and as soon as the alpha-coin is flipped, change my preferences so that I now have the highest preference for either L1 or L2 depending on the outcome of the coin.

To give a real-world example, suppose I come home and my SO asks me "Do you want chicken or pork for dinner?" I say "Surprise me." Then whatever dinner turns out to be is what I want. I don't go in circles and say "I'd like to exchange that for another surprise, please."

Another way to avoid being Dutch booked is to have an ask/bid spread. Why should it be that for any mixture of L1 and L2, I must have a single price at which I am willing to both buy and sell that mixture? If there's a difference between the price that I'm willing to buy at, and the price that I'm willing to sell at, then that leaves me some room to violate expected utility maximization without being exploited.

Or I may have a vulnerability, but morality, customs, law, or high transaction costs prevent anyone from making a profit exploiting it.

I suppose the first objection is the most serious one (i.e. exploitable circularity can be avoided by changing preferences). The others, while showing that expected utility maximization doesn't have to be followed exactly, leaves open that it should be approximated.</description>
		<content:encoded><![CDATA[<p>Hi, I found your paper by way of <a href="http://www.overcomingbias.com/2008/12/two-visions-of/comments/page/2/#comment-142322542" rel="nofollow">http://www.overcomingbias.com/2008/12/two-visions-of/comments/page/2/#comment-142322542</a>.</p>
<p>I think unfortunately the derivation in chapter 10 of expected utility maximization from the need to avoid vulnerabilities, especially section 10.9, doesn&#8217;t work, because there are ways to avoid being Dutch booked, other than being an expected utility maximizer. For example, I may prefer a mixture of L1 and L2 to both L1 and L2, and as soon as the alpha-coin is flipped, change my preferences so that I now have the highest preference for either L1 or L2 depending on the outcome of the coin.</p>
<p>To give a real-world example, suppose I come home and my SO asks me &#8220;Do you want chicken or pork for dinner?&#8221; I say &#8220;Surprise me.&#8221; Then whatever dinner turns out to be is what I want. I don&#8217;t go in circles and say &#8220;I&#8217;d like to exchange that for another surprise, please.&#8221;</p>
<p>Another way to avoid being Dutch booked is to have an ask/bid spread. Why should it be that for any mixture of L1 and L2, I must have a single price at which I am willing to both buy and sell that mixture? If there&#8217;s a difference between the price that I&#8217;m willing to buy at, and the price that I&#8217;m willing to sell at, then that leaves me some room to violate expected utility maximization without being exploited.</p>
<p>Or I may have a vulnerability, but morality, customs, law, or high transaction costs prevent anyone from making a profit exploiting it.</p>
<p>I suppose the first objection is the most serious one (i.e. exploitable circularity can be avoided by changing preferences). The others, while showing that expected utility maximization doesn&#8217;t have to be followed exactly, leaves open that it should be approximated.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Paper on The Basic AI Drives by Wie kann eine ungefährliche KI gefährlich werden? &#171; Weissbier &#38; Wissenschaft</title>
		<link>http://selfawaresystems.com/2007/11/30/paper-on-the-basic-ai-drives/#comment-23</link>
		<dc:creator>Wie kann eine ungefährliche KI gefährlich werden? &#171; Weissbier &#38; Wissenschaft</dc:creator>
		<pubDate>Tue, 02 Sep 2008 11:22:05 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/11/30/paper-on-the-basic-ai-drives/#comment-23</guid>
		<description>[...] 2, 2008 von phi1ipp    Darum gehts in diesem Paper; &#8220;The Basic AI Drives&#8221; von Stephen Omohundro. Die Grundidee ist folgende: Egal wie [...]</description>
		<content:encoded><![CDATA[<p>[...] 2, 2008 von phi1ipp    Darum gehts in diesem Paper; &#8220;The Basic AI Drives&#8221; von Stephen Omohundro. Die Grundidee ist folgende: Egal wie [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Paper on The Basic AI Drives by The Singularity Institute Blog : Blog Archive : Singularity Institute Q1 2008 Update</title>
		<link>http://selfawaresystems.com/2007/11/30/paper-on-the-basic-ai-drives/#comment-16</link>
		<dc:creator>The Singularity Institute Blog : Blog Archive : Singularity Institute Q1 2008 Update</dc:creator>
		<pubDate>Sat, 03 May 2008 03:54:50 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/11/30/paper-on-the-basic-ai-drives/#comment-16</guid>
		<description>[...] Omohundro&#8217;s paper on &#8220;The Basic AI Drives,&#8221; and the following interviews on the challenges of artificial intelligence: SIAI Interview [...]</description>
		<content:encoded><![CDATA[<p>[...] Omohundro&#8217;s paper on &#8220;The Basic AI Drives,&#8221; and the following interviews on the challenges of artificial intelligence: SIAI Interview [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Paper on The Basic AI Drives by Accelerating Future &#187; A Non-Half-Assed Response to &#8220;One Half a Manifesto&#8221;</title>
		<link>http://selfawaresystems.com/2007/11/30/paper-on-the-basic-ai-drives/#comment-15</link>
		<dc:creator>Accelerating Future &#187; A Non-Half-Assed Response to &#8220;One Half a Manifesto&#8221;</dc:creator>
		<pubDate>Wed, 30 Apr 2008 03:07:08 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/11/30/paper-on-the-basic-ai-drives/#comment-15</guid>
		<description>[...] and uses it to modify its environment, then sure, it could do a lot of good or evil. According to Steve Omohundro, even an AI with harmless goals could engage in harmful behaviors. I see it as best to assume that [...]</description>
		<content:encoded><![CDATA[<p>[...] and uses it to modify its environment, then sure, it could do a lot of good or evil. According to Steve Omohundro, even an AI with harmless goals could engage in harmful behaviors. I see it as best to assume that [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Paper on the Nature of Self-Improving Artificial Intelligence by Future Current &#187; Blog Archive &#187; Fostering Wisdom Technology</title>
		<link>http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-14</link>
		<dc:creator>Future Current &#187; Blog Archive &#187; Fostering Wisdom Technology</dc:creator>
		<pubDate>Thu, 24 Apr 2008 00:23:05 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-14</guid>
		<description>[...] paper I just wrote called &#8220;The Nature of Artificial Intelligence&#8221; presents a detailed argument that aside from limitations of computational power we know [...]</description>
		<content:encoded><![CDATA[<p>[...] paper I just wrote called &#8220;The Nature of Artificial Intelligence&#8221; presents a detailed argument that aside from limitations of computational power we know [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on What is rational behavior? by Warren D. Smith</title>
		<link>http://selfawaresystems.com/2007/10/13/what-is-rational-behavior/#comment-13</link>
		<dc:creator>Warren D. Smith</dc:creator>
		<pubDate>Wed, 23 Apr 2008 16:34:26 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/10/13/what-is-rational-behavior/#comment-13</guid>
		<description>Hey Steve, long time no see.

Somebody brought your stuff to my attention and pretty fascinating.

I wrote a paper "mathematical definition of intelligence" I thought was going to be key to future progress in AI.  See paper #93 at
http://www.math.temple.edu/~wds/homepage/works.html
(#94 also interesting).  Turns out my work here was pretty much a rediscovery of Marcus Hutter's work, but we each knew stuff the other did not.

Now it then seemed to me that somebody should start following up on my/Hutter's work.  And nobody did.  But lo, I see YOUR work, which sounds like it IS following up on us, despite the fact you did not know about my paper you kind of have been following up on it anyhow.   You might be better off explicitly realizing that, though, and doing some of the explicit things I discuss, like building the "platform for intelligence testing" I recommend, and building an
AI that genuinely *is* an AI (by my defn).  Those things probably
would be pretty easy for you to do.

One more thing.  I saw a video of you speaking on self-improving AI at Stanford and getting grief from some guy about the "fact" that there is no way to measure utility, maybe utility does not exist etc (economics fruitiness).   Well, I have also gotten such grief in
a different context - voting systems.

My work on "Bayesian Regret" of voting systems, leading to conclusion "range voting" is the best of the usual commonly
proposed systems, involves utility.  HOWEVER, all my conclusions
about BR of voting systems work REGARDLESS of whether
any human utility for anything can ever be measured or not.
Mm?  That is the key thing.  It does not matter whether it can be measured.  My BR work still goes thru fine.   So I think that is a very 
strong reply to such hecklers, when it is usable.

You should get all your future-wacko pals to "endorse range voting".
You browse to 
http://rangevoting.org 
push the "endorse" button, and fill out the form.  May be necessary to have RV save the world.

Warren D. Smith.
Hope this message actually reaches you.</description>
		<content:encoded><![CDATA[<p>Hey Steve, long time no see.</p>
<p>Somebody brought your stuff to my attention and pretty fascinating.</p>
<p>I wrote a paper &#8220;mathematical definition of intelligence&#8221; I thought was going to be key to future progress in AI.  See paper #93 at<br />
<a href="http://www.math.temple.edu/~wds/homepage/works.html" rel="nofollow">http://www.math.temple.edu/~wds/homepage/works.html</a><br />
(#94 also interesting).  Turns out my work here was pretty much a rediscovery of Marcus Hutter&#8217;s work, but we each knew stuff the other did not.</p>
<p>Now it then seemed to me that somebody should start following up on my/Hutter&#8217;s work.  And nobody did.  But lo, I see YOUR work, which sounds like it IS following up on us, despite the fact you did not know about my paper you kind of have been following up on it anyhow.   You might be better off explicitly realizing that, though, and doing some of the explicit things I discuss, like building the &#8220;platform for intelligence testing&#8221; I recommend, and building an<br />
AI that genuinely *is* an AI (by my defn).  Those things probably<br />
would be pretty easy for you to do.</p>
<p>One more thing.  I saw a video of you speaking on self-improving AI at Stanford and getting grief from some guy about the &#8220;fact&#8221; that there is no way to measure utility, maybe utility does not exist etc (economics fruitiness).   Well, I have also gotten such grief in<br />
a different context - voting systems.</p>
<p>My work on &#8220;Bayesian Regret&#8221; of voting systems, leading to conclusion &#8220;range voting&#8221; is the best of the usual commonly<br />
proposed systems, involves utility.  HOWEVER, all my conclusions<br />
about BR of voting systems work REGARDLESS of whether<br />
any human utility for anything can ever be measured or not.<br />
Mm?  That is the key thing.  It does not matter whether it can be measured.  My BR work still goes thru fine.   So I think that is a very<br />
strong reply to such hecklers, when it is usable.</p>
<p>You should get all your future-wacko pals to &#8220;endorse range voting&#8221;.<br />
You browse to<br />
<a href="http://rangevoting.org" rel="nofollow">http://rangevoting.org</a><br />
push the &#8220;endorse&#8221; button, and fill out the form.  May be necessary to have RV save the world.</p>
<p>Warren D. Smith.<br />
Hope this message actually reaches you.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Paper on the Nature of Self-Improving Artificial Intelligence by Foresight Vision Talk: Self-Improving AI and Designing 2030 &#171; Self-Aware Systems</title>
		<link>http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-11</link>
		<dc:creator>Foresight Vision Talk: Self-Improving AI and Designing 2030 &#171; Self-Aware Systems</dc:creator>
		<pubDate>Fri, 30 Nov 2007 16:10:12 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-11</guid>
		<description>[...] Paper on the Nature of Self-Improving Artificial&#160;Intelligence  [...]</description>
		<content:encoded><![CDATA[<p>[...] Paper on the Nature of Self-Improving Artificial&nbsp;Intelligence  [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Paper on the Nature of Self-Improving Artificial Intelligence by People Database blog &#187; Blog Archive &#187; Self-Improving AI: Designing 2030</title>
		<link>http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-10</link>
		<dc:creator>People Database blog &#187; Blog Archive &#187; Self-Improving AI: Designing 2030</dc:creator>
		<pubDate>Fri, 30 Nov 2007 06:04:57 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-10</guid>
		<description>[...] will briefly go through the argument. There is a full paper on it on my website: www.selfawaresystems.com. Let me first say what rational behavior is in this [...]</description>
		<content:encoded><![CDATA[<p>[...] will briefly go through the argument. There is a full paper on it on my website: <a href="http://www.selfawaresystems.com" rel="nofollow">http://www.selfawaresystems.com</a>. Let me first say what rational behavior is in this [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Paper on the Nature of Self-Improving Artificial Intelligence by Standford Computer Systems Colloquium: Self-Improving AI and the Future of Computing &#171; Self-Aware Systems</title>
		<link>http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-8</link>
		<dc:creator>Standford Computer Systems Colloquium: Self-Improving AI and the Future of Computing &#171; Self-Aware Systems</dc:creator>
		<pubDate>Thu, 01 Nov 2007 16:09:04 +0000</pubDate>
		<guid isPermaLink="false">http://selfawaresystems.com/2007/10/05/paper-on-the-nature-of-self-improving-artificial-intelligence/#comment-8</guid>
		<description>[...] Paper on the Nature of Self-Improving Artificial&#160;Intelligence  [...]</description>
		<content:encoded><![CDATA[<p>[...] Paper on the Nature of Self-Improving Artificial&nbsp;Intelligence  [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
