The Inside View #3–Evan Hubinger—Takeoff speeds, Risks from learned optimization & Interpretability

The Inside View

Transcript: https://www.alignmentforum.org/posts/NFfZsWrzALPdw54NL/the-inside-view-3-evan-hubinger-homogeneity-in-takeoff

Outline:
00:00 Evan’s background @ MIRI & OpenAI
3:29 Coconut & functional programming
6:56 Homogeneity in AI takeoff
14:08 Reproducing SoTA & openness in multipolar scenarios
29:16 Quantilizers & operationalizing strategy stealing
42:16 Risks from learned optimization & evolution
52:29 Learned optimization in Machine Learning
1:04:51 Clarifying Inner AI Alignment terminology
1:13:33 Transparency & Interpretability
1:25:26 11 proposals for safe advanced AI
1:39:13 Underappreciated problems in AI Alignment & surprising advances in AI

<iframe></p> <p><a href="https://www.youtube.com/watch?v=uQN0wqzy164">Source</a></p> <div class="be1e40beae42d993bafb8643f4ddde8b" data-index="3" style="float: none; margin:10px 0 10px 0; text-align:center;"> <script async src="https://pagead2.googlesyndication.com/pagead/js/adsbygoogle.js"></script> <ins class="adsbygoogle" style="display:block; text-align:center;" data-ad-layout="in-article" data-ad-format="fluid" data-ad-client="ca-pub-9244112244416304" data-ad-slot="4549240677"></ins> <script> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> <div style="font-size: 0px; height: 0px; line-height: 0px; margin: 0; padding: 0; clear: both;"></div> </div> </article> <div class="clearfix"></div> <ul class="default-theme-post-navigation"> <li class="theme-nav-previous"><a href="https://theengineeringofconsciousexperience.com/melvin-ps-strategy-is-structured-to-fit-the-new-reality-is-ours/" rel="prev"><span class="meta-nav">←</span> Melvin P's Strategy Is Structured To Fit The New Reality – Is Ours?</a></li> <li class="theme-nav-next"><a href="https://theengineeringofconsciousexperience.com/photography-to-another-level-amazing-photo-effects-%e2%96%b6-12/" rel="next">Photography To Another Level | Amazing Photo Effects ▶ 12 <span class="meta-nav">→</span></a></li> </ul> <div class="clearfix"></div> <h3 class='comment-reply-title'>Similar Posts</h3> <div class="mb-related-posts mb-simple-featured-posts mb-simple-featured-posts-wrap row"> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/3doodler-2020-product-unveil-introducing-the-pro-3d-printing-pen-for-creative-professionals/" aria-hidden="true" tabindex="-1"> <img width="400" height="300" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1604277523_hqdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1604277523_hqdefault.jpg 480w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1604277523_hqdefault-300x225.jpg 300w" sizes="auto, (max-width: 400px) 100vw, 400px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/3doodler-2020-product-unveil-introducing-the-pro-3d-printing-pen-for-creative-professionals/" rel="bookmark">3Doodler 2020 Product Unveil – Introducing the PRO+ 3D Printing Pen for Creative Professionals</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/3doodler-2020-product-unveil-introducing-the-pro-3d-printing-pen-for-creative-professionals/" rel="bookmark"><time class="entry-date published updated" datetime="2020-10-28T07:32:38-07:00">October 28, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/4b5fe7d537ab5ac5640bc4d7066b2a06/">3Doodler</a></span></span> </div> </header> </article> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/gpt-3-why-is-it-dangerous-explained-in-simple-sentence/" aria-hidden="true" tabindex="-1"> <img width="501" height="282" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599150164_maxresdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599150164_maxresdefault.jpg 1280w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599150164_maxresdefault-300x169.jpg 300w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599150164_maxresdefault-1024x576.jpg 1024w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599150164_maxresdefault-768x432.jpg 768w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599150164_maxresdefault-520x293.jpg 520w" sizes="auto, (max-width: 501px) 100vw, 501px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/gpt-3-why-is-it-dangerous-explained-in-simple-sentence/" rel="bookmark">GPT-3 : WHY IS IT DANGEROUS ?? [Explained in Simple Sentence]</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/gpt-3-why-is-it-dangerous-explained-in-simple-sentence/" rel="bookmark"><time class="entry-date published updated" datetime="2020-08-21T04:19:01-07:00">August 21, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/4eca778cbc3d3373d95927ece49a58db/">The Panda Explores</a></span></span> </div> </header> </article> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/what-its-like-to-be-a-computer-an-interview-with-gpt-3/" aria-hidden="true" tabindex="-1"> <img width="501" height="282" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1602896971_maxresdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1602896971_maxresdefault.jpg 1280w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1602896971_maxresdefault-300x169.jpg 300w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1602896971_maxresdefault-1024x576.jpg 1024w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1602896971_maxresdefault-768x432.jpg 768w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1602896971_maxresdefault-520x293.jpg 520w" sizes="auto, (max-width: 501px) 100vw, 501px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/what-its-like-to-be-a-computer-an-interview-with-gpt-3/" rel="bookmark">What It's Like To be a Computer: An Interview with GPT-3</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/what-its-like-to-be-a-computer-an-interview-with-gpt-3/" rel="bookmark"><time class="entry-date published updated" datetime="2020-09-18T17:27:56-07:00">September 18, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/9a46b827146f1547cd32f0f51a46ff84/">Eric Elliott</a></span></span> </div> </header> </article> </div> <div id="comments" class="comments-area"> <h5 class="comments-title"> One thought on “<span>The Inside View #3–Evan Hubinger—Takeoff speeds, Risks from learned optimization & Interpretability</span>” </h5> <ol class="comment-list"> <li id="comment-238867" class="comment even thread-even depth-1"> <article id="div-comment-238867" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCb9F9_uV24PGj6x63PhXEVw" class="url" rel="ugc external nofollow">The Inside View</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/the-inside-view-3-evan-hubinger-takeoff-speeds-risks-from-learned-optimization-interpretability/#comment-238867"><time datetime="2021-05-27T11:53:02-07:00">May 27, 2021 at 11:53 am</time></a> </div> </footer> <div class="comment-content"> <p><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=00m00s">00:00</a> Evan's background @ MIRI & OpenAI<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=3m29s">3:29</a> Coconut & functional programming<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=6m56s">6:56</a> Homogeneity in AI takeoff<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=14m08s">14:08</a> Reproducing SoTA & openness in multipolar scenarios<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=29m16s">29:16</a> Quantilizers & operationalizing strategy stealing<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=42m16s">42:16</a> Risks from learned optimization & evolution<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=52m29s">52:29</a> Learned optimization in Machine Learning<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=1h04m51s">1:04:51</a> Clarifying Inner AI Alignment terminology<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=1h13m33s">1:13:33</a> Transparency & Interpretability<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=1h25m26s">1:25:26</a> 11 proposals for safe advanced AI<br /><a href="https://www.youtube.com/watch?v=uQN0wqzy164&t=1h39m13s">1:39:13</a> Underappreciated problems in AI Alignment & surprising advances in AI</p> </div> </article> </li> </ol> <p class="no-comments">Comments are closed.</p> </div> </main> </div> <div class="col-md-3 px-lg-3 "> </div> </div> </div> </div> <footer id="colophon" class="site-footer"> <div class="container"> <div class="row"> <div class="col-md-12 text-center"> <div class="site-info"> <span> Powered By: <a href="https://wordpress.org/" target="_blank">WordPress</a> </span> <span class="sep"> | </span> <span> Theme: <a href="https://odiethemes.com/themes/magazinebook/" target="_blank">MagazineBook</a> By OdieThemes </span> </div> </div> </div> </div> </footer> </div> <script>(function(){var advanced_ads_ga_UID="UA-88163215-1",advanced_ads_ga_anonymIP=!!1;window.advanced_ads_check_adblocker=function(){var t=[],n=null;function e(t){var n=window.requestAnimationFrame||window.mozRequestAnimationFrame||window.webkitRequestAnimationFrame||function(t){return setTimeout(t,16)};n.call(window,t)}return e((function(){var a=document.createElement("div");a.innerHTML=" ",a.setAttribute("class","ad_unit ad-unit text-ad text_ad pub_300x250"),a.setAttribute("style","width: 1px !important; height: 1px !important; position: absolute !important; left: 0px !important; top: 0px !important; overflow: hidden !important;"),document.body.appendChild(a),e((function(){var e,o,i=null===(e=(o=window).getComputedStyle)||void 0===e?void 0:e.call(o,a),d=null==i?void 0:i.getPropertyValue("-moz-binding");n=i&&"none"===i.getPropertyValue("display")||"string"==typeof d&&-1!==d.indexOf("about:");for(var c=0,r=t.length;c<r;c++)t[c](n);t=[]}))})),function(e){"undefined"==typeof advanced_ads_adblocker_test&&(n=!0),null!==n?e(n):t.push(e)}}(),(()=>{function t(t){this.UID=t,this.analyticsObject="function"==typeof gtag;var n=this;return this.count=function(){gtag("event","AdBlock",{event_category:"Advanced Ads",event_label:"Yes",non_interaction:!0,send_to:n.UID})},function(){if(!n.analyticsObject){var e=document.createElement("script");e.src="https://www.googletagmanager.com/gtag/js?id="+t,e.async=!0,document.body.appendChild(e),window.dataLayer=window.dataLayer||[],window.gtag=function(){dataLayer.push(arguments)},n.analyticsObject=!0,gtag("js",new Date)}var a={send_page_view:!1,transport_type:"beacon"};window.advanced_ads_ga_anonymIP&&(a.anonymize_ip=!0),gtag("config",t,a)}(),this}advanced_ads_check_adblocker((function(n){n&&new t(advanced_ads_ga_UID).count()}))})();})();</script><script type="speculationrules"> {"prefetch":[{"source":"document","where":{"and":[{"href_matches":"/*"},{"not":{"href_matches":["/wp-*.php","/wp-admin/*","/wp-content/uploads/*","/wp-content/*","/wp-content/plugins/*","/wp-content/themes/magazinebook/*","/*\\?(.+)"]}},{"not":{"selector_matches":"a[rel~=\"nofollow\"]"}},{"not":{"selector_matches":".no-prefetch, .no-prefetch a"}}]},"eagerness":"conservative"}]} </script> <div style="clear:both;width:100%;text-align:center; font-size:11px; "><a target="_blank" title="WP2Social Auto Publish" href="https://xyzscripts.com/wordpress-plugins/facebook-auto-publish/compare" >WP2Social Auto Publish</a> Powered By : <a target="_blank" title="PHP Scripts & Programs" href="http://www.xyzscripts.com" >XYZScripts.com</a></div> <script data-category="functional"> window['gtag_enable_tcf_support'] = false; window.dataLayer = window.dataLayer || []; function gtag(){dataLayer.push(arguments);} gtag('js', new Date()); gtag('config', '', { cookie_flags:'secure;samesite=none', }); </script> <script type="text/javascript" id="wpsi-search-navigation-js-extra"> /* <![CDATA[ */ var wpsi_search_navigation = {"ajaxurl":"https://theengineeringofconsciousexperience.com/wp-admin/admin-ajax.php","token":"66665e525e"}; //# sourceURL=wpsi-search-navigation-js-extra /* ]]> */ </script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/plugins/wp-search-insights/assets/js/search-navigation.js?ver=2.1" id="wpsi-search-navigation-js"></script> <script type="text/javascript" id="linkprefetcher-js-before"> /* <![CDATA[ */ window.LP_CONFIG = {"activeOnDesktop":true,"behavior":"mouseHover","hoverDelay":60,"instantClick":false,"activeOnMobile":true,"mobileBehavior":"viewport","ignoreKeywords":"#,?","isMobile":false} //# sourceURL=linkprefetcher-js-before /* ]]> */ </script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/plugins/bluehost-wordpress-plugin/vendor/newfold-labs/wp-module-performance/build/assets/link-prefetch.min.js?ver=4.14.0" id="linkprefetcher-js" defer></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/navigation.js?ver=1.0.9" id="magazinebook-navigation-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/skip-link-focus-fix.js?ver=1.0.9" id="magazinebook-skip-link-focus-fix-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/jquery.easy-ticker.js?ver=3.1.0" id="magazinebook-news-ticker-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/splide.min.js?ver=2.3.1" id="splide-js-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/theme.js?ver=1.0.9" id="magazinebook-theme-js-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/plugins/advanced-ads/admin/assets/js/advertisement.js?ver=2.0.17" id="advanced-ads-find-adblocker-js"></script> <script id="wp-emoji-settings" type="application/json"> {"baseUrl":"https://s.w.org/images/core/emoji/17.0.2/72x72/","ext":".png","svgUrl":"https://s.w.org/images/core/emoji/17.0.2/svg/","svgExt":".svg","source":{"concatemoji":"https://theengineeringofconsciousexperience.com/wp-includes/js/wp-emoji-release.min.js?ver=620c2d504e17f91f58efd04f9ffbb519"}} </script> <script type="module"> /* <![CDATA[ */ /*! This file is auto-generated */ const a=JSON.parse(document.getElementById("wp-emoji-settings").textContent),o=(window._wpemojiSettings=a,"wpEmojiSettingsSupports"),s=["flag","emoji"];function i(e){try{var t={supportTests:e,timestamp:(new Date).valueOf()};sessionStorage.setItem(o,JSON.stringify(t))}catch(e){}}function c(e,t,n){e.clearRect(0,0,e.canvas.width,e.canvas.height),e.fillText(t,0,0);t=new Uint32Array(e.getImageData(0,0,e.canvas.width,e.canvas.height).data);e.clearRect(0,0,e.canvas.width,e.canvas.height),e.fillText(n,0,0);const a=new Uint32Array(e.getImageData(0,0,e.canvas.width,e.canvas.height).data);return t.every((e,t)=>e===a[t])}function p(e,t){e.clearRect(0,0,e.canvas.width,e.canvas.height),e.fillText(t,0,0);var n=e.getImageData(16,16,1,1);for(let e=0;e<n.data.length;e++)if(0!==n.data[e])return!1;return!0}function u(e,t,n,a){switch(t){case"flag":return n(e,"\ud83c\udff3\ufe0f\u200d\u26a7\ufe0f","\ud83c\udff3\ufe0f\u200b\u26a7\ufe0f")?!1:!n(e,"\ud83c\udde8\ud83c\uddf6","\ud83c\udde8\u200b\ud83c\uddf6")&&!n(e,"\ud83c\udff4\udb40\udc67\udb40\udc62\udb40\udc65\udb40\udc6e\udb40\udc67\udb40\udc7f","\ud83c\udff4\u200b\udb40\udc67\u200b\udb40\udc62\u200b\udb40\udc65\u200b\udb40\udc6e\u200b\udb40\udc67\u200b\udb40\udc7f");case"emoji":return!a(e,"\ud83e\u1fac8")}return!1}function f(e,t,n,a){let r;const o=(r="undefined"!=typeof WorkerGlobalScope&&self instanceof WorkerGlobalScope?new OffscreenCanvas(300,150):document.createElement("canvas")).getContext("2d",{willReadFrequently:!0}),s=(o.textBaseline="top",o.font="600 32px Arial",{});return e.forEach(e=>{s[e]=t(o,e,n,a)}),s}function r(e){var t=document.createElement("script");t.src=e,t.defer=!0,document.head.appendChild(t)}a.supports={everything:!0,everythingExceptFlag:!0},new Promise(t=>{let n=function(){try{var e=JSON.parse(sessionStorage.getItem(o));if("object"==typeof e&&"number"==typeof e.timestamp&&(new Date).valueOf()<e.timestamp+604800&&"object"==typeof e.supportTests)return e.supportTests}catch(e){}return null}();if(!n){if("undefined"!=typeof Worker&&"undefined"!=typeof OffscreenCanvas&&"undefined"!=typeof URL&&URL.createObjectURL&&"undefined"!=typeof Blob)try{var e="postMessage("+f.toString()+"("+[JSON.stringify(s),u.toString(),c.toString(),p.toString()].join(",")+"));",a=new Blob([e],{type:"text/javascript"});const r=new Worker(URL.createObjectURL(a),{name:"wpTestEmojiSupports"});return void(r.onmessage=e=>{i(n=e.data),r.terminate(),t(n)})}catch(e){}i(n=f(s,u,c,p))}t(n)}).then(e=>{for(const n in e)a.supports[n]=e[n],a.supports.everything=a.supports.everything&&a.supports[n],"flag"!==n&&(a.supports.everythingExceptFlag=a.supports.everythingExceptFlag&&a.supports[n]);var t;a.supports.everythingExceptFlag=a.supports.everythingExceptFlag&&!a.supports.flag,a.supports.everything||((t=a.source||{}).concatemoji?r(t.concatemoji):t.wpemoji&&t.twemoji&&(r(t.twemoji),r(t.wpemoji)))}); //# sourceURL=https://theengineeringofconsciousexperience.com/wp-includes/js/wp-emoji-loader.min.js /* ]]> */ </script> <script>!function(){window.advanced_ads_ready_queue=window.advanced_ads_ready_queue||[],advanced_ads_ready_queue.push=window.advanced_ads_ready;for(var d=0,a=advanced_ads_ready_queue.length;d<a;d++)advanced_ads_ready(advanced_ads_ready_queue[d])}();</script> </body> </html>