<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[BioModal]]></title><description><![CDATA[Biomodal will write on the intersection of Biology and Machine Learning, from the perspective of a Data Scientist with a limited prior biology background.]]></description><link>https://auralie.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!g3hn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02bbe504-fb18-4509-9786-581386db8d55_1280x1280.png</url><title>BioModal</title><link>https://auralie.substack.com</link></image><generator>Substack</generator><lastBuildDate>Thu, 18 Jun 2026 04:45:49 GMT</lastBuildDate><atom:link href="https://auralie.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Utkarsh Singh]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[auralie@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[auralie@substack.com]]></itunes:email><itunes:name><![CDATA[Utkarsh Singh]]></itunes:name></itunes:owner><itunes:author><![CDATA[Utkarsh Singh]]></itunes:author><googleplay:owner><![CDATA[auralie@substack.com]]></googleplay:owner><googleplay:email><![CDATA[auralie@substack.com]]></googleplay:email><googleplay:author><![CDATA[Utkarsh Singh]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[A Friend to All Is a Friend to None: Auditing IDiom’s Localization-Tuned IDRs]]></title><description><![CDATA[Things aren't always what they seem like]]></description><link>https://auralie.substack.com/p/a-friend-to-all-is-a-friend-to-none</link><guid isPermaLink="false">https://auralie.substack.com/p/a-friend-to-all-is-a-friend-to-none</guid><dc:creator><![CDATA[Utkarsh Singh]]></dc:creator><pubDate>Sat, 09 May 2026 18:21:41 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b3b545f4-81ff-42bf-babd-5084bbc203b6_1080x1348.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Foreword: This blog does its best not to, but still assumes you have looked over the <a href="https://www.biorxiv.org/content/10.64898/2026.04.10.717777v1">Idiom paper</a>. If you haven&#8217;t, I suggest doing so, then coming back!)</p><p>We set out to audit the reinforcement learning results for&nbsp;<a href="https://www.biorxiv.org/content/10.64898/2026.04.10.717777v1">IDiom</a>, a recent protein language model for designing intrinsically disordered regions from the <a href="https://statmech.stanford.edu/">Rotskoff</a> and <a href="https://dunnlab.stanford.edu/">Dunn</a> labs at Stanford. Originally, we wanted to explore whether there was any reward hacking, owing to the susceptibility of RL-optimized generative models to exploit <a href="https://auralie.substack.com/p/taking-the-mask-off-what-protein">shallow features in their reward signals</a>. Instead, we found something more interesting: the biological distinctions that emerge after RL are neither reproduced by simple base-model filtering nor fully resolved by the reward model itself.</p><div><hr></div><h3>The Setup</h3><p>IDiom is a 122M parameter autoregressive model trained on 37 million intrinsically disordered region sequences from the AlphaFold Database. After pretraining, Liu et al. post-train the model with GRPO on <a href="https://www.science.org/doi/10.1126/science.adq2634">ProtGPS</a> &#8212; a localization predictor built on ESM2 embeddings with the goal of steering IDiom to generate disordered sequences localizing to specific subcellular compartments. The paper presents results for four compartments: nucleolus, chromosomes, P-bodies, and stress granules.</p><p>We run four additional computational controls.</p><div><hr></div><h2>Test 1: Scrambling</h2><p>For each RL-optimized set, we shuffled amino acids while preserving exact composition and scored with ProtGPS. We also tested block shuffles (chunks of 10-20 residues rearranged) to preserve local motifs while disrupting long-range order. This was done to determine precisely how much ProtGPS cares about sequence order. Results were mostly unsurprising&#8212; ProtGPS is not purely compositional, and the block-shuffle results suggest that much of the signal lives in local motif structure rather than residue-level global order.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pdeH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pdeH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png 424w, https://substackcdn.com/image/fetch/$s_!pdeH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png 848w, https://substackcdn.com/image/fetch/$s_!pdeH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png 1272w, https://substackcdn.com/image/fetch/$s_!pdeH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pdeH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!pdeH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png 424w, https://substackcdn.com/image/fetch/$s_!pdeH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png 848w, https://substackcdn.com/image/fetch/$s_!pdeH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png 1272w, https://substackcdn.com/image/fetch/$s_!pdeH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5c229a7-c0a6-4e61-8057-83a406fb9091_2112x1188.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>ProtGPS cares a lot about sequence order for the nucleolus and stress granule. For the P-body and chromosome, composition retains most of the score. ProtGPS is not a pure composition detector, but composition does most of the work for some compartments.</p><h2>Test 2: Shallow Feature Probe</h2><p>We then train a random forest regressor to predict ProtGPS scores from 35 simple features like amino acid frequencies, motif counts, charge, and entropy. This was done to understand whether ProtGPS can be gamed by relatively simple features.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dSYV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dSYV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png 424w, https://substackcdn.com/image/fetch/$s_!dSYV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png 848w, https://substackcdn.com/image/fetch/$s_!dSYV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png 1272w, https://substackcdn.com/image/fetch/$s_!dSYV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dSYV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png" width="1456" height="803" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:803,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!dSYV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png 424w, https://substackcdn.com/image/fetch/$s_!dSYV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png 848w, https://substackcdn.com/image/fetch/$s_!dSYV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png 1272w, https://substackcdn.com/image/fetch/$s_!dSYV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd31f6e6a-ea45-44a4-bbba-2fc90d205d73_1793x989.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When we hold out an entire RL cellular subcompartment and test on it, R&#178; collapses to deeply negative values. Clearly, the shallow model completely fails to generalize to RL distributions it hasn&#8217;t seen.</p><h2>Test 3: Specificity</h2><p>The paper states on page 12, &#8220;&#8230;these results demonstrate that RL post-training using the ProtGPS predictor as a reward model is able to teach IDiom the global amino acid compositions, sequence patterning, and specific motifs which are necessary for compartment-specific localization.&#8221; Because ProtGPS returns twelve compartment scores, the natural specificity check is not only whether the optimized target score increases, but whether the same sequences also score highly for other ProtGPS compartments. The paper reports the target optimization and downstream sequence features, but not the full twelve-compartment off-target vectors. We score every RL sequence across all 12  ProtGPS compartments. Here&#8217;s where it gets interesting, specifically for Stress Granules.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CtHV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CtHV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png 424w, https://substackcdn.com/image/fetch/$s_!CtHV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png 848w, https://substackcdn.com/image/fetch/$s_!CtHV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png 1272w, https://substackcdn.com/image/fetch/$s_!CtHV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CtHV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png" width="3175" height="1415" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1415,&quot;width&quot;:3175,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:505473,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!CtHV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png 424w, https://substackcdn.com/image/fetch/$s_!CtHV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png 848w, https://substackcdn.com/image/fetch/$s_!CtHV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png 1272w, https://substackcdn.com/image/fetch/$s_!CtHV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43dbf4a1-b4ec-4c2a-bbf5-654d42fc7b79_3175x1415.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Stress granule RL sequences score 84% on stress granule, and 77% on P-body. This is surprising: unlike the others, the median specificity is near zero! According to ProtGPS, these sequences are roughly equally likely to localize to two different compartments.</p><p>At this point, we thought we had our finding: stress granule RL targeting doesn&#8217;t work across these two components. The RL learned a generic RNA-granule-adjacent profile that ProtGPS scores high across both compartments. At least according to the reward function in the paper! One may argue that RL optimized each set to maximize its <em>target</em> compartment score on ProtGPS. So ProtGPS should be <em>maximally favorable</em> to showing specificity &#8212; each set was literally trained to score high on its target! If ProtGPS can't show specificity even under these ideal conditions, what does this tell us about the reward function?</p><p>But then we landed on something more surprising.</p><p><em>Before explaining what it is, a caveat. P-bodies and stress granules are related but distinct cytoplasmic mRNP granules. Prior work shows that they can physically associate, share some components, harbor overlapping mRNA species, and participate in a dynamic mRNA cycle in which mRNPs move between polysomes, stress granules, and P-bodies (<a href="https://rupress.org/jcb/article/169/6/871/51749/Stress-granules-and-processing-bodies-are">Kedersha et al., 2005</a>; <a href="https://cshperspectives.cshlp.org/content/4/9/a012286.full">Decker &amp; Parker, 2012</a>). At the same time, they retain distinct molecular compositions: P-bodies are enriched for mRNA decay and repression machinery, whereas stress granules are enriched for stalled translation-initiation complexes and factors such as TIA-1 and G3BP (<a href="https://rupress.org/jcb/article/169/6/871/51749/Stress-granules-and-processing-bodies-are">Kedersha et al., 2005</a>). Some ProtGPS cross-reactivity between P-body and stress-granule scores is biologically plausible &#8212; but it should be interpreted as overlap between related RNA-granule programs, not evidence that the compartments are interchangeable.</em></p><h2>Test 4: The unexpected finding</h2><p>We then evaluated the RL sequences with DeepLoc 2.0, another localization predictor built on ESM1b and trained with a separate supervised objective over broad subcellular localization classes. Unlike ProtGPS, which predicts twelve condensate-associated compartments, DeepLoc predicts 10  broad subcellular classes. It doesn&#8217;t have P-body or stress granule as separate classes &#8212; both fall under &#8220;Cytoplasm&#8221; &#8212; so it can&#8217;t directly validate sub-compartment targeting. But the full ten-class probability vectors can be used to ask a simpler question: do P-body and stress granule sequences look the same or different?</p><p>We trained a logistic regression model on DeepLoc&#8217;s ten-class probability vectors to classify sequences by their RL origin.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T5vL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T5vL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png 424w, https://substackcdn.com/image/fetch/$s_!T5vL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png 848w, https://substackcdn.com/image/fetch/$s_!T5vL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png 1272w, https://substackcdn.com/image/fetch/$s_!T5vL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T5vL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png" width="1456" height="520" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:520,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!T5vL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png 424w, https://substackcdn.com/image/fetch/$s_!T5vL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png 848w, https://substackcdn.com/image/fetch/$s_!T5vL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png 1272w, https://substackcdn.com/image/fetch/$s_!T5vL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfb644b3-20d3-4879-a4b6-4e4b4d9378fe_2901x1036.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>A:</strong> Mean DeepLoc Nucleus/Cytoplasm probabilities for each RL set. <strong>B:</strong> Confusion matrix for classifying RL-P-body vs RL-stress-granule origin from DeepLoc&#8217;s 10 probabilities. <strong>C:</strong> Summary metrics.</figcaption></figure></div><p><strong>83% accuracy.</strong> DeepLoc clearly distinguishes the two sets.</p><p>Clearly, the RL-generated stress granule and P-body sequences are not the same. They differ in ways that an independent predictor can detect: P-body sequences show higher Cytoplasm probability (0.55 vs 0.47), stress granule sequences show higher Nucleus probability (0.63 vs 0.57), consistent with the known nuclear-cytoplasmic shuttling behavior of stress granule proteins. But IDiom&#8217;s reward function cannot distinguish between which compartment the RL model trained on stress granules is supposed to go to.</p><div><hr></div><h2>Where does the distinction come from?</h2><p>Our first assumption was  prior preservation: the pretrained model already encodes P-body vs stress-granule differences, and the regularizers mentioned, such as the KL penalty, kept them intact through RL. We then tested this directly.</p><p>Using 10K base-IDiom sequences already scored by ProtGPS, we selected the top-scoring sequences for each compartment via rejection sampling and ran the same DeepLoc classifier. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sklC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sklC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png 424w, https://substackcdn.com/image/fetch/$s_!sklC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png 848w, https://substackcdn.com/image/fetch/$s_!sklC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png 1272w, https://substackcdn.com/image/fetch/$s_!sklC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sklC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png" width="1456" height="593" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4364b472-4a74-4fb3-813d-571568956f19_2120x863.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:593,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!sklC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png 424w, https://substackcdn.com/image/fetch/$s_!sklC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png 848w, https://substackcdn.com/image/fetch/$s_!sklC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png 1272w, https://substackcdn.com/image/fetch/$s_!sklC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4364b472-4a74-4fb3-813d-571568956f19_2120x863.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Clearly, that isn&#8217;t the case, as the prior alone doesn&#8217;t produce the separation. High-ProtGPS-scoring base sequences are not DeepLoc-distinguishable between P-bodies and stress granules. The 83% separation emerges only after RL optimization. Something about the overall training creates distinctions that neither the base prior nor the reward model individually contains.</p><p>Disentangling this further &#8212; whether interaction effects between the KL penalty &amp; other regularizers and the reward landscape are responsible &#8212; remains an open question.</p><p>Also, this is <strong>not</strong> an indictment of ProtGPS as a localization predictor. Across most <em>provided</em> target-output pairs, the off-target scores are low, and the cross-reactivity we observe is concentrated in biologically related compartments. The problem is the way ProtGPS is used as a reward. Optimizing a single compartment score does not penalize sequences that also score highly for a related compartment. A specificity-aware reward may resolve this.</p><div><hr></div><h2>Implications</h2><p><strong>For IDiom specifically, </strong>the paper&#8217;s direct post-training readout is the optimized ProtGPS target score, supplemented by composition and motif analyses of the generated sequences. That does not make the result invalid, but it leaves an important identification gap: target-score improvement alone cannot tell us whether the generated sequences are selectively compartment-specific, broadly high-scoring across related compartments, or exploiting shortcuts in the reward model<strong>. </strong>Adding off-target ProtGPS scores changes the interpretation. Future evaluations should report the full twelve-compartment ProtGPS score vector to evaluate the extent of off-target effects.</p><p><strong>For RL on protein language models, </strong>the relationship between pretraining, reward modeling, and post-training deserves more attention. In IDiom, the P-body/stress-granule separation we observed is not well explained by the ProtGPS reward alone, but it is also not reproduced by simply filtering base-model samples for high ProtGPS scores. In our rejection-sampling test, high-ProtGPS base sequences remained only weakly separable by DeepLoc, while RL-generated P-body and stress-granule sequences were strongly separable. This suggests that RL changed the generator beyond what the scalar ProtGPS reward could evaluate. </p><p><strong>For RL alignment more broadly</strong><em>, </em>this illustrates the limits of combining reward improvement with hand-selected feature analyses: those checks can support a biological interpretation, but they may still miss distributional structure that appears under independent evaluation. At the same time, base-model rejection sampling did not reproduce that separation, so the distinction cannot be attributed to the pretrained prior alone.</p><div><hr></div><h2>Methods</h2><p>All sequences were obtained from the IDiom HuggingFace repository (datasets: jxliu2/idiom-datasets; models: jxliu2/idiom). ProtGPS scoring used the checkpoint bundled with the IDiom model repository. DeepLoc 2.0 predictions used the Fast model (<a href="https://biolm.ai/models/esm1b/">ESM1b</a>) from the public GitHub repository. Scrambled sequences were generated by random permutation of amino acids preserving exact composition and length, and block shuffles used chunks of 10-20 residues. (3 full scrambles + 3 block shuffles per sequence). The random forest probe used scikit-learn&#8217;s RandomForestRegressor (500 trees, max depth 15) with an 80/20 train/test split. The P-body vs stress granule classifier was a logistic regression on DeepLoc&#8217;s ten-class probability vectors with 5-fold cross-validation. Specificity was defined as the target compartment score minus the maximum off-target compartment score. Cross-compartment correlations are Pearson correlations computed across all RL and base model sequences. Code is available at: <a href="https://github.com/etherealsunshine/IDIom_reviewer">Github-IdIom reviewer</a></p><p></p><blockquote><p><em><strong>TL;DR:</strong> IDiom is still pretty cool. Our audit does not show that RL failed, or that the generated IDRs are meaningless. It shows that for some targets, their reward gains and hand-picked motif checks are not enough to establish fine-grained compartment specificity. The reward model blurs some related compartments, while still inducing separable biological structure.</em></p></blockquote><p><em>Thanks to Richard Dzeng for providing feedback on an early draft.</em></p><blockquote><p><em><strong>Note:</strong> I shared this analysis with the IDiom authors before public promotion and received a gracious response. I&#8217;ll update the post if further context or corrections come up.</em></p></blockquote><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://auralie.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioModal! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Taking the Mask Off: What Protein Binder RL Is Actually Learning]]></title><description><![CDATA[Part 1: An investigation into the collapse]]></description><link>https://auralie.substack.com/p/taking-the-mask-off-what-protein</link><guid isPermaLink="false">https://auralie.substack.com/p/taking-the-mask-off-what-protein</guid><dc:creator><![CDATA[Utkarsh Singh]]></dc:creator><pubDate>Wed, 15 Apr 2026 17:24:16 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/99a73633-b3a8-4d7e-b6fd-9623ba7e38e2_2432x1664.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This post builds on ideas from Nick Boyd&#8217;s <a href="https://blog.escalante.bio/teaching-generative-models-to-hallucinate/">essay</a> on RL and PDB-derived reward signals, particularly his analysis of structural diversity and reward bias.</em></p><p></p><p>The models have been getting better at generating first-pass binders. <a href="https://www.asimov.press/p/antibody-design">Brian Naughton&#8217;s guide</a> walks you through designing a VHH from scratch: pick a target, run your generative model of choice, filter by your in-silico structure metric of choice, and send it away to the lab for in vitro testing.</p><p>As Nick Boyd(founder of <a href="https://escalante.bio/">Escalante</a>) mentions, there are currently two main approaches to computational protein binder design: optimization (exemplified by <a href="https://github.com/martinpacesa/BindCraft">BindCraft</a>) and generative models (like <a href="https://github.com/HannesStark/boltzgen">BoltzGen</a>). At the risk of repeating something already mentioned, BoltzGen is faster, but the per-binder design quality is much lower than that of something like BindCraft, which has the opposite properties. So the net computational cost is roughly the same. In <a href="https://blog.escalante.bio/teaching-generative-models-to-hallucinate/">his essay</a>, Boyd shows how you can improve upon BoltzGen by borrowing the LLM posttraining playbook &#8212; finetune on high-quality hallucinated binders from optimization-based methods, apply GRPO, and watch your binder quality improve.</p><p>It would seem as if the binder generation problem has been solved, but that&#8217;s far from the case.</p><p><a href="https://www.owlposting.com/p/an-ml-drug-discovery-startup-trying">Leash Bio</a> showed that when we analyze what ChEMBL-trained models actually learn, it&#8217;s less about molecular binding and more about which medicinal chemists tend to make which kinds of molecules. Their model Hermes &#8212; a lightweight 50M parameter sequence-only transformer trained entirely on combinatorially synthesized molecules with no human design intent, outperforms Boltz2 on out-of-distribution chemistry despite being several orders of magnitude smaller.</p><p>There are uncomfortable parallels to be observed. ChEMBL is biased by what medicinal chemists chose to synthesize: their preferences, their intuitions, their career-long optimization toward molecules that look drug-like to a human expert. PDB has the same problem. It&#8217;s biased by what structural biologists chose to crystallize: proteins that are stable enough to survive purification, well-behaved enough to form crystals, interesting enough to justify the grant money. Both are human-curated snapshots of a tiny corner of an astronomically vast space.</p><p>As Nick Boyd  emphasizes in <a href="https://blog.escalante.bio/teaching-generative-models-to-hallucinate/">his essay</a>, BoltzGen, Protenix, and ProteinMPNN are all downstream of PDB&#8212; structures of what structural biologists chose to crystallize. When you apply GRPO with a reward signal that's just PDB all the way down, can you confidently say you&#8217;re truly learning what makes a good binder in a general sense? Or are we optimizing for something a structural biologist would have crystallized?</p><p>In this article, we explore whether applying reinforcement learning causes a collapse in generated binders. We generated 200 structures per condition across three model checkpoints &#8212; base BoltzGen, an SFT checkpoint finetuned on high-quality Mosaic hallucinated binders, and an RL checkpoint trained on top of that &#8212; and evaluated structural diversity using Foldseek across five targets spanning in-distribution (ACE2, CCL2, PDL1), near-OOD (EGFR), and far-OOD (KRAS) regimes.</p><div><hr></div><h3>Methods</h3><h4>Trimming the targets </h4><p>For PDL1, CCL2, and KRAS, full protein sequences were used. For EGFR and ACE2, truncations were necessary to fit within GPU memory constraints on an RTX 5090. For EGFR, residues 190&#8211;505 were retained, preserving domains II and III &#8212; the core dimerization arm and primary ligand-binding domain. For ACE2, the full M2 peptidase domain was kept, covering the entire SARS-CoV-2 binding interface. Both truncated structures were refolded with AlphaFold3 to confirm iPTM did not drop meaningfully.</p><h4>Evaluating Structural Diversity amongst the Binders</h4><p>Once we had binders for all targets, we used Foldseek to perform three steps: database construction, intra-set structural clustering at TM-score threshold 0.5, and  easy-search against PDB100. We report Shannon entropy of the cluster distribution as our primary diversity metric, cluster diversity (unique clusters / N) as a secondary metric, and mean TM-score to nearest PDB hit as our PDB sociology signal.</p><h4>Measuring Collapse: Two Lenses on the Same Problem</h4><p>To measure structural diversity within each generated set, we compute the Shannon entropy of the Foldseek cluster distribution:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;H = -\\sum_{i} p_i \\log p_i&quot;,&quot;id&quot;:&quot;NDOWOYYUIR&quot;}" data-component-name="LatexBlockToDOM"></div><p>where <em>p_i</em> is the fraction of structures belonging to cluster <em>i.</em> Higher entropy means the generated set spans many distinct structural solutions. Lower entropy means the set has collapsed toward a small number of dominant folds --- in the extreme case of PDL1 baseline, a single cluster, giving H = 0.</p><p>To measure how PDB-like the generated structures are, we run each binder against PDB100 using Foldseek&#8217;s structural search and record the TM-score of the nearest hit. TM-score ranges from 0 to 1, where scores above 0.5 indicate the same overall fold and scores above 0.8 indicate near-identical structures.</p><div><hr></div><h2>Results</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!19xb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!19xb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png 424w, https://substackcdn.com/image/fetch/$s_!19xb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png 848w, https://substackcdn.com/image/fetch/$s_!19xb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png 1272w, https://substackcdn.com/image/fetch/$s_!19xb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!19xb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png" width="1456" height="598" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:598,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:81703,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://auralie.substack.com/i/192698006?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!19xb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png 424w, https://substackcdn.com/image/fetch/$s_!19xb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png 848w, https://substackcdn.com/image/fetch/$s_!19xb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png 1272w, https://substackcdn.com/image/fetch/$s_!19xb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2c9b297e-c2e6-4894-9b47-40f4ee28615e_1785x733.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DqN-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DqN-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png 424w, https://substackcdn.com/image/fetch/$s_!DqN-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png 848w, https://substackcdn.com/image/fetch/$s_!DqN-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png 1272w, https://substackcdn.com/image/fetch/$s_!DqN-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DqN-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png" width="1456" height="363" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:363,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:149117,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://auralie.substack.com/i/192698006?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DqN-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png 424w, https://substackcdn.com/image/fetch/$s_!DqN-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png 848w, https://substackcdn.com/image/fetch/$s_!DqN-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png 1272w, https://substackcdn.com/image/fetch/$s_!DqN-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22b59a6c-c4d2-4c3f-aa1b-e23fc556eb65_2984x744.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dL91!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dL91!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png 424w, https://substackcdn.com/image/fetch/$s_!dL91!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png 848w, https://substackcdn.com/image/fetch/$s_!dL91!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png 1272w, https://substackcdn.com/image/fetch/$s_!dL91!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dL91!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png" width="1456" height="504" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:504,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:131367,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://auralie.substack.com/i/192698006?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dL91!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png 424w, https://substackcdn.com/image/fetch/$s_!dL91!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png 848w, https://substackcdn.com/image/fetch/$s_!dL91!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png 1272w, https://substackcdn.com/image/fetch/$s_!dL91!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84432307-4ea2-4f45-9485-c3f6c117d5c9_2002x693.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p><strong>PDL1 - Clones all the way down</strong></p><p>On PDL1, base BoltzGen produced 200 structurally distinct binders that Foldseek collapsed into a single cluster with a  Shannon entropy of 0.0. We also see the  Mean TM-score to nearest PDB hit: 0.946. The model had already memorized the answer before any RL ever happened.</p><p>This could be due to its possible overrepresentation: it&#8217;s one of the most studied drug targets in structural biology, with drugs like atezolizumab and durvalumab, all having crystal structures in PDB. This reinforces our hypothesis that the training data is so saturated with examples that the model can&#8217;t generate anything else.</p><p><strong>For less PDB-saturated targets, RL induces measurable collapse.</strong></p><p>On KRAS, an oncology target with far fewer known binders in PDB, we see base BoltzGen generates a wide range of binders: 29 clusters from 200 structures. After RL finetuning, this collapses to 10 clusters and  a 68% reduction in structural entropy. We see the pattern  in  CCL2 as well, where the model generates fewer diverse structures.</p><p>On EGFR, finetuning <em>increases</em> structural diversity relative to base BoltzGen (entropy 1.06 &#8594; 2.14), before RL collapses it back down. The RL PDB TM-score actually decreases on EGFR, same as PDL1. Finetuning breaks that specific memorized mode and recovers diversity, before RL imposes a new, different form of collapse. Whether this recovered diversity is meaningful or simply a different memorized mode is an open question we can't resolve without experimental validation.</p><div><hr></div><h3>Conclusion</h3><p>For targets heavily represented in PDB, the base model is already collapsed, and RL can&#8217;t push further, and may slightly, if at all, break the memorized mode. It is possible that RL&#8217;ing for longer horizons may change this. For targets less represented (KRAS, CCL2), the base model does seem to have diversity that RL destroys. This is counterintuitive to what RL should be doing. The reward signal, Protenix iPTM, itself PDB-trained, reinforces structures that resemble PDB proteins, and the severity of collapse tracks how well the target is represented in that training data.</p><p>This may have a grim therapeutic implication for hard-to-drug targets. KRAS is one of the most clinically important undruggable oncology targets  because its shallow binding pocket requires non-helical contacts &#8212; beta-strand mimetics, cyclic peptides, and designed loops that can engage the flat RAS surface.<a href="https://www.nature.com/articles/s41467-025-65844-3"><sup>1</sup></a> An RL-finetuned model that has collapsed toward helical bundle solutions is less likely to design binders specific to KRAS and rather generate its favorite motif, hoping it sticks. </p><p>As for whether to use RL, it might be worth doing depending on how similar your target is to the data your model has been trained on. </p><p><strong>What comes next</strong></p><p>It seems evident, then, that the next task to focus on is harnessing signals that are qualitatively different from adding another PDB-derived term to your loss function, or looking at entropy regularizers. As Nick Boyd discusses, one alternative is to explore physics-based reward signals (e.g., Rosetta energy).  Whether integrating that distinction into the RL loop resolves the mode collapse is what I want to explore next.</p><p>Thanks to Brian Naughton and Dr Aaron Ring for feedback on an early version of this post.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://auralie.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioModal! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[100 Open questions to think about]]></title><description><![CDATA[Inspired by Alexey Guzey, Gwern, and Patrick Collinson]]></description><link>https://auralie.substack.com/p/100-open-questions-to-think-about</link><guid isPermaLink="false">https://auralie.substack.com/p/100-open-questions-to-think-about</guid><dc:creator><![CDATA[Utkarsh Singh]]></dc:creator><pubDate>Mon, 20 Feb 2023 13:49:02 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!g3hn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02bbe504-fb18-4509-9786-581386db8d55_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Here&#8217;s a list of a few questions I came up with while filling up an application for <a href="https://www.sparc-camp.com/">SPARC</a>. Most of them revolve around rationality, AI and human decision-making in general. Do let me know if you find these interesting to think about, have any comments, or know the answers to any of these questions!</p><p></p><ol><li><p>Does Effective Altruism&#8217;s emphasis on the future generations, belittle the needs of the current, and if so, is this morally appropriate?</p></li><li><p>Why are calls to people's emotions more &#8216;effective&#8217; than rationality?</p></li><li><p>How does effective forecasting like Fermicasting provide any plausible benefit? Are people reasonably confident to make meaningful decisions based on the heuristics of other people?</p></li><li><p>Does crypto really have any inherent value? And what is something we can do with crypto that we&#8217;d be unable to do with the money?</p></li><li><p>Artificial intelligence is trained on human data. Why then are we outraged when a word-predicting model outputs something outrageous?</p></li><li><p>It is not entirely impossible that artificial intelligence might be better at decision-making than humans, if so, would it be better to align AI to human values, or leave it in some area for independent decision-making?</p></li><li><p>There are several opinions that AI would help create not reduce net jobs. For unskilled blue-collar workers. What are some of these jobs?</p></li><li><p>Is it possible to satisfy the need for human craving for loneliness solely through artificial intelligence?</p></li><li><p>What's the best way to make a positive impact on the world?</p></li><li><p>What's the end goal of humans, is it to optimize individual happiness?</p></li><li><p>What's the best way to form meaningful relationships with people?</p></li></ol><ol start="12"><li><p>What are some of the main issues plaguing AI Alignment?</p></li><li><p>Some of the current language models are owned by companies as these are expensive to run. Will Open Source models ever come to be as competitive as them?</p></li><li><p>How should developing countries optimize for development and progress while ensuring that they&#8217;re not accelerating climate change?</p></li><li><p>How does one manage to build depth in a specific niche of ai, while managing to stay &#8216;dangerous&#8217; in other fields</p></li><li><p>What are the best ways to solve problems of distribution like hunger and poverty?</p></li><li><p>What's the best solution to the current problem plaguing chatgpt making up nonexistent information and sources?</p></li><li><p>To what extent of freedom of speech be allowed?</p></li><li><p>With students using chatgpt for almost all essay prompts, what are examples of areas where only humans will be able to make intelligible responses if any?</p></li><li><p>What are some of the best ways to learn about opinions in philosophy, and try to answer questions?</p></li><li><p>With OpenAI constantly patching the different jailbreaks being used to bypass its content policies,&nbsp;</p></li><li><p>What&#8217;s more likely to make people mad: something that&#8217;s false or true?</p></li><li><p>Is there a reproducible process for making pop songs that AI can replicate?</p></li><li><p>Will AI ever be able to better understand us than we do ourselves?</p></li><li><p>Is dissociation from emotions better or worse for making decisions? Particularly in a field like friendship or family</p></li><li><p>Should powerful AI systems should behave in the way users want or their creators intend?</p></li><li><p>What's the best way to depolarise society from its current state?</p></li><li><p>The current language models are being trained on data that don't accurately reflect all strata of society, what are the best ways to overcome this?</p></li><li><p>What are some of the best ways to form contrarian ideas that are right?</p></li><li><p>What measures does crypto have once, hypothetically prop-shops start trading crypto and intentionally boost or deflate their value?</p></li><li><p>What are some of the best ways to get better at forecasting?</p></li><li><p>Would Universal basic income promote innovation or deaccelerate it?</p></li><li><p>How do we get better at noticing things overlooked by others</p></li><li><p>Would it be possible to build a programming language that generates additional syntax to solve different needs?</p></li><li><p>What are the best ways to help solve the rising issues of loneliness</p></li><li><p>How does one get to regularly interact and learn from successful people?</p></li><li><p>Is it ever truly possible to overcome our biases, and how can we do this?</p></li><li><p>What are some of the positive outcomes we can get through gene editing, and how can we make an impact in this field?</p></li><li><p>Is there a thing like free thought?</p></li><li><p><a href="https://news.ycombinator.com/item?id=26940758">Do Animals Dream?</a></p></li><li><p>&nbsp;How do various religions differ in the nature and magnitude of their effects?</p></li><li><p>What influences when people to act in accordance with their self-interest and when they don't?</p></li><li><p>How does mental imagery work? How do we improve its function?</p></li><li><p>Do people have different levels of self-control or do they just experience temptation differently?</p></li><li><p>What makes a good life? How do we study this?</p></li><li><p>We remember dreams almost perfectly right after waking up and then the memory rapidly recedes and disappears completely, unless we write them down. This isn&#8217;t how normal memories function. So, why the difference?</p></li><li><p>What is &#8220;personal productivity&#8221; and why does it vary from day to day so much (eg.<a href="https://www.sciencedirect.com/science/article/pii/S0883902616301641"> Weinberger et al 2018</a>)? And why does it not seem to correlate with environmental variables like weather or sleep quality?</p></li><li><p>Does listening to music improve or worsen memory?</p></li><li><p>What is consciousness?</p></li><li><p>What would happen if we could travel faster than the speed of light?</p></li><li><p>How much of our behavior is determined by nature versus nurture?</p></li><li><p>How does language shape the way we think?</p></li><li><p>What makes some memories more vivid than others?</p></li><li><p>What does it really mean to be &#8216;self-aware&#8217;?</p></li><li><p>What laws should be imposed by governments on generative AI, if any?</p></li><li><p>Is rationality a universal trait, or are there cultural differences in what is considered rational behavior?</p></li><li><p>What determines how we perceive time? Is it the same for everyone?</p></li><li><p>How do people make major decisions in their lives? When and why does it come up, and how do they go about making those decisions?</p></li><li><p>Are we all fundamentally selfish, even when we do things for others' benefit? Or are there truly settings (intrinsic and/or exogenous) where we do things that are good for others but bad for both our short-term future and long-term future selves?</p></li><li><p>Do people have different levels of self-control or do they just experience temptation differently?</p></li><li><p>Is there a way to conduct research without bias in funding? How?</p></li><li><p>Would it be feasible for prop trading shops to be owned by the government to ensure market liquidity?</p></li><li><p>What would be the best way to go about building a large language model to rival that of GPT-3?</p></li><li><p>Are we all fundamentally selfish, even when we do things for others' benefit? Or are there truly settings (intrinsic and/or exogenous) where we do things that are good for others but bad for both our short-term future and long-term future selves?</p></li><li><p>What basically goes on in the brain, when we design or think of something &#8216;new&#8217; or never seen before?</p></li><li><p>Are people able to concentrate more effectively under total silence?</p></li><li><p>what are the factors that influence the speed and accuracy of learning a new language?</p></li><li><p>What is the best way to factor in risk while making uncertain decisions?</p></li><li><p>How does trusting the &#8216;gut&#8217; work?</p></li><li><p>If we ever find a way to significantly extend the human health span or reverse ageing, what could that post-death society look like?&nbsp;</p></li><li><p>What are the neural mechanisms underlying consciousness, and how can we study and manipulate these mechanisms?</p></li><li><p>Would it be possible to prevent shrinkage of the brain?</p></li><li><p>How does neuroplasticity differ between the developing brain and the adult brain,</p></li><li><p>What is happening in the brain when a human questions?</p></li><li><p>What is the probability there is microbial-like life (other than from earth) in our solar system?"</p></li><li><p>Is string theory more closely correct than any other current theory of physics?</p></li><li><p>What's the best way to determine if someone would be a good friend for you?</p></li><li><p>How do you ask the right questions?</p></li><li><p>How do I get people to like me?&nbsp;</p></li><li><p>How do you tell the difference between a preference and a bias&nbsp;</p></li><li><p>What is the probability that I might be sleep deprived if I wake up before my alarm goes off more than 95% of the time?</p></li><li><p>What do other people subjectively experience when they are thinking? To me it&#8217;s like talking to myself (in verbal English sentences) but I'm told that isn't universal.</p></li><li><p>When is self-denial useful in altering your desires, vs satisfying them so you can devote time to other things?</p></li><li><p>How does one define wisdom?</p></li><li><p>What happens to consciousness once you fall asleep?</p></li><li><p>Can charisma be taught?</p></li><li><p>Why is it so hard to predict success?</p></li><li><p>Why are we so fascinated by coincidences?</p></li><li><p>Is It Wrong to Enjoy Yourself While the World Is Burning?</p></li><li><p>Is it more important to help society or to help yourself?</p></li><li><p>how can we stop confusing correlation with causation?</p></li><li><p>Why Do We Want What We Can&#8217;t Have?</p></li><li><p>Which Matters More, a First or Last Impression?</p></li><li><p>How do I improve my ability to simulate/guess other people's internal states and future behaviours?</p></li><li><p>How do I work out what I want and what I should do?</p></li><li><p>2) Would the human race be eradicated if there is a worst-possible-scenario nuclear incident? Or merely a lot of people?</p></li><li><p>What could be the potential downsides of building a universal sign language?</p></li><li><p>How do people ascertain emotions in certain songs?</p></li><li><p>Do animals ever 'ask questions'?</p></li><li><p>When you forget a thought, where does this thought go?</p><p></p><p></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://auralie.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Utkarsh&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>