<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[ChatJPT Chatter]]></title><description><![CDATA[A field guide to using cloud, data, and AI in higher education plus leadership, governance, and career insights from 20+ years in higher ed IT.

Views expressed are mine alone and are NOT affiliated with my current employer.]]></description><link>https://substack.chatjpt.io</link><image><url>https://substackcdn.com/image/fetch/$s_!XIft!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5993b70-ec87-400e-86ce-4e93fca43511_400x400.jpeg</url><title>ChatJPT Chatter</title><link>https://substack.chatjpt.io</link></image><generator>Substack</generator><lastBuildDate>Sun, 05 Apr 2026 08:54:11 GMT</lastBuildDate><atom:link href="https://substack.chatjpt.io/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[ChatJPT]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[chatjpt@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[chatjpt@substack.com]]></itunes:email><itunes:name><![CDATA[ChatJPT]]></itunes:name></itunes:owner><itunes:author><![CDATA[ChatJPT]]></itunes:author><googleplay:owner><![CDATA[chatjpt@substack.com]]></googleplay:owner><googleplay:email><![CDATA[chatjpt@substack.com]]></googleplay:email><googleplay:author><![CDATA[ChatJPT]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The ‘Low Friction No Cost’ AI Inference Service for Research That You May Not Know About]]></title><description><![CDATA[Inside the Argonne ALCF platform that's already processed 11 billion tokens for researchers.]]></description><link>https://substack.chatjpt.io/p/low-friction-and-no-cost-the-federated</link><guid isPermaLink="false">https://substack.chatjpt.io/p/low-friction-and-no-cost-the-federated</guid><dc:creator><![CDATA[ChatJPT]]></dc:creator><pubDate>Wed, 31 Dec 2025 17:42:59 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!wq1p!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Would you believe that a no-to-low cost inference service already exists for the scientific research community?  I recently ran across a webinar from December from Argonne National Laboratory (<a href="https://www.alcf.anl.gov/sites/default/files/2025-12/2025-12-03-webinar.pdf">webinar slides</a> | <a href="https://docs.alcf.anl.gov/services/inference-endpoints/">website</a>) about their no-to-low cost federated AI inference service. It has been running since 2024, has served ~220 users,  processed ~10 million requests, and generated over 11 billion tokens<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>! </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://substack.chatjpt.io/subscribe?"><span>Subscribe now</span></a></p><div id="youtube2-22caQwCemug" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;22caQwCemug&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/22caQwCemug?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Not only is it is open to the scientific community*, but it costs <strong>zero</strong> dollars per token. AND it hosts LLM, VLM, and embedding inference endpoints that live on dedicated nodes that avoid HPC queues and auto-scale.  It also provides both API and <a href="https://inference.alcf.anl.gov/">OpenWebUI (open source ChatGPT</a>) experiences. Since it relies on Globus Auth, many faculty are probably already technically authenticated to use it <strong>with an approved allocation</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wq1p!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wq1p!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic 424w, https://substackcdn.com/image/fetch/$s_!wq1p!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic 848w, https://substackcdn.com/image/fetch/$s_!wq1p!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic 1272w, https://substackcdn.com/image/fetch/$s_!wq1p!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wq1p!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic" width="1456" height="690" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:690,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:106140,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://substack.chatjpt.io/i/183066488?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wq1p!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic 424w, https://substackcdn.com/image/fetch/$s_!wq1p!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic 848w, https://substackcdn.com/image/fetch/$s_!wq1p!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic 1272w, https://substackcdn.com/image/fetch/$s_!wq1p!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b399e-4053-4c5a-a0d4-1d3e217e0dd6_2886x1368.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3><strong>*The Catch </strong></h3><p>Ok so there is a little bit of friction and cannot just log in and start blasting through tokens. You need an allocation first. However, they do have a <strong><a href="https://www.alcf.anl.gov/science/directors-discretionary-allocation-program">Director&#8217;s Discretionary (DD) Allocation</a></strong> designed for &#8220;startups&#8221; and getting code running. It doesn&#8217;t require DOE sponsorship. It&#8217;s the &#8220;let me just try this out&#8221; path of least resistance before applying for major awards. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/p/low-friction-and-no-cost-the-federated?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://substack.chatjpt.io/p/low-friction-and-no-cost-the-federated?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p> Once your DD proposal is approved, you will receive the <strong>Project Short Name</strong> and <strong>Principal Investigator (PI)</strong>details necessary to finally complete the <a href="https://www.alcf.anl.gov/get-started">account registration</a> process.</p><p>The <a href="https://docs.alcf.anl.gov/account-project-management/accounts-and-access/accounts-and-access-faqs/">account activation and requirements FAQs</a> are helpful, but if you want to skip to the Getting Started portal, the documentation is fairly robust and straightforward: <a href="https://www.alcf.anl.gov/get-started">https://www.alcf.anl.gov/get-started</a>.</p><p>Here is what you will need for the form: </p><ul><li><p><strong>Principal Investigator (PI) Name:</strong> The name of the project lead who will act as your &#8220;Sponsor&#8221; and approve your request.</p></li><li><p><strong>Project / Allocation:</strong> You have to apply and be approved for this before the account is created.</p></li></ul><ul><li><p><strong>Institutional Email:</strong> Use your .edu or .gov address (not Gmail/Yahoo) to avoid delays.</p></li><li><p><strong>ORCID iD:</strong> You are required to link your ORCID account, so ensure you have your login credentials ready.</p></li><li><p><strong>Legal Name &amp; Citizenship info:</strong> You must accurately declare your citizenship status (passport)</p></li><li><p><strong>Curriculum Vitae (CV):</strong> A PDF copy of your CV is often required to be uploaded directly during this step. (for non-US Citizens)</p></li></ul><ul><li><p><strong>Token Preference:</strong> You will need to choose between a <strong>Mobile Token</strong> (app-based) or a <strong>Physical Cryptocard</strong>. - kind of like Duo.</p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading ChatJPT Chatter! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><h3><strong>More about the service</strong></h3><p>They are running this service on their <strong><a href="https://www.alcf.anl.gov/sophia">Sophia cluster</a></strong> (24 nodes of NVIDIA DGX A100s) and the <strong>Metis cluster</strong> (SambaNova SN40L systems).  </p><ul><li><p><strong>Sophia</strong> uses <a href="https://docs.vllm.ai/">vLLM</a> and supports the full range of OpenAI-compatible endpoints including chat, completions, embeddings, and batch processing. <a href="https://docs.alcf.anl.gov/services/inference-endpoints/#sophia-cluster-vllm">(available models</a>)</p></li><li><p><strong>Metis</strong> uses SambaNova&#8217;s inference API and currently supports only chat completions. (<a href="https://docs.alcf.anl.gov/services/inference-endpoints/#metis-cluster-sambanova">available models</a>)</p></li><li><p>There are two more clusters coming soon...possibly Solstice and Equinox&#8230;.which feature tens do thousands of NVIDIA Blackwell GPUs.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p></p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5us3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5us3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5us3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5us3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5us3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5us3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg" width="965" height="543" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:543,&quot;width&quot;:965,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:520419,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.chatjpt.io/i/183066488?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5us3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5us3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5us3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5us3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd45d0fde-322a-49dd-bf45-90dac8cfca37_965x543.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As stated earlier, there is a web interface (based on OpenWebUI) that looks like ChatGPT at <a href="https://inference.alcf.anl.gov">https://inference.alcf.anl.gov</a>. It provides multi-model chat access, RAG. agent capability, and all that you would expect from an AI chat interface these days.  But the real fun and power is the programmatic access. The API is OpenAI-compliant. That means your researchers don&#8217;t need to rewrite code&#8230;.just change the <code>base_url</code> in their Python script. So you could easily use it alongside the code you might have written with your on campus Portkey, LiteLLM, OpenRouter, or campus approved AI cloud APIs.</p><pre><code>from openai import OpenAI
from inference_auth_token import get_access_token

# Get your access token
access_token = get_access_token()

client = OpenAI(
    api_key=access_token,
    base_url="https://inference-api.alcf.anl.gov/resource_server/metis/api/v1"
)

response = client.chat.completions.create(
    model="gpt-oss-120b-131072",
    messages=[{"role": "user", "content": "Explain quantum computing in simple terms."}]
)

print(response.choices[0].message.content)</code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/p/low-friction-and-no-cost-the-federated?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://substack.chatjpt.io/p/low-friction-and-no-cost-the-federated?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p>And this is just the tip of the iceberg, these inference services join all the other services and GPU power on the clusters. Some interesting examples they highlighted from the webinar:</p><ol><li><p><strong>Agentic Workflows:</strong> They showed off some real work with <strong><a href="https://github.com/argonne-lcf/ChemGraph">ChemGraph</a></strong>. This is an agentic workflow where the LLM acts as a router, calling out to Python libraries like RDKit to actually simulate molecular properties<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>. </p></li><li><p><strong>AuroraGPT (<a href="https://auroragpt.anl.gov">site</a>):</strong> This is a model fine-tuned on scientific literature. </p></li><li><p><strong>Batch vs Real-Time inference:</strong> The inference endpoints are not just for chat and real time completions.  Argonne has built a <strong><a href="https://docs.alcf.anl.gov/services/inference-endpoints/">Batch API</a></strong> specifically for use cases when you want to dump a file with 150,000 requests and let their A100s chew through it.</p></li><li><p><strong>All the other services</strong>: LLMs are only a small fraction of what these clusters offer for science.  <a href="https://www.anl.gov/article/argonne-expands-nations-ai-infrastructure-with-powerful-new-supercomputers">And some exciting upgrades are planned</a> over the next few years due to public-private partnerships.</p></li></ol><p>As with any external compute service, please check with your IT, research computing, and research offices first (they may want to check out the <strong><a href="https://docs.alcf.anl.gov/policies/data-and-software-policies/data-policy/">data policies</a> and <a href="https://docs.alcf.anl.gov/policies/data-and-software-policies/software-policy/">software policies</a> </strong>to determine fit).  But for researchers looking to balance the high demand for AI compute with flat budgets and have a project to lean on, the ALCF service could represent a significant opportunity.</p><p>For questions or support, please contact <a href="mailto:support@alcf.anl.gov?subject=Inference%20Endpoint">ALCF Support</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://substack.chatjpt.io/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>https://www.alcf.anl.gov/sites/default/files/2025-12/2025-12-03-webinar.pdf</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>https://www.cilogon.org/news/globus-auth</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>https://www.anl.gov/article/argonne-expands-nations-ai-infrastructure-with-powerful-new-supercomputers</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>https://www.arxiv.org/pdf/2506.06363</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Yeah the entry-level job market is cooked, but is AI really the villain?]]></title><description><![CDATA[Are we trying to treat a patient without looking at the chart?]]></description><link>https://substack.chatjpt.io/p/yeah-the-entry-level-job-market-stinks</link><guid isPermaLink="false">https://substack.chatjpt.io/p/yeah-the-entry-level-job-market-stinks</guid><dc:creator><![CDATA[ChatJPT]]></dc:creator><pubDate>Sat, 27 Dec 2025 21:43:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!gd-I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Articles blaming AI for bleak entry-level job prospects for are becoming as abundant as the &#8220;AI Bro&#8221; greatest hits.  You know the ones: &#8220;adapt or die&#8221;, &#8220;__ will never be the same&#8221;, and &#8220;it&#8217;s so over&#8221;.  Sigh.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gd-I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gd-I!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!gd-I!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!gd-I!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!gd-I!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gd-I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png" width="382" height="226.8125" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ea731943-988c-42df-906f-aaa926f9f81c_1024x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:608,&quot;width&quot;:1024,&quot;resizeWidth&quot;:382,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gd-I!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png 424w, https://substackcdn.com/image/fetch/$s_!gd-I!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png 848w, https://substackcdn.com/image/fetch/$s_!gd-I!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png 1272w, https://substackcdn.com/image/fetch/$s_!gd-I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea731943-988c-42df-906f-aaa926f9f81c_1024x608.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">An AI bot with a graduation cap hanging</figcaption></figure></div><p>And yeah, the numbers are bad. Unemployment for new bachelor&#8217;s degree holders is up over 6%&#8230;.ugly compared to where we were a few years ago. The story goes that ChatGPT, Claude Code, Codex, and Cursor (what&#8217;s with all the C&#8217;s?) and the like are eating that entry-level work. Companies don&#8217;t need juniors anymore, so they say.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://substack.chatjpt.io/subscribe?"><span>Subscribe now</span></a></p><p>I get why that narrative sticks. It&#8217;s scary, sensational, dramatic, and feels right to those of us who have experienced what these tools do. But in <a href="https://onedtech.philhillaa.com/p/many-explanations-are-making-premature-analysis-on-recent-grad-unemployment">Glenda Morgan&#8217;s recent breakdown of this</a>, she points out something inconvenient: the hiring slump actually started before the generative AI boom really took hold. </p><blockquote><p>&#8220;But the argument is also, if not wrong, at least premature. Among other issues, the falloff in new graduate hiring predates the widespread adoption of generative AI. Despite these problems, it is an account that has gained a lot of traction. Because of this, there is a real danger that efforts to address the issue will be focused on the wrong thing.&#8221; - Glenda Morgan</p></blockquote><p>EDIT:  Oh and then there is this <a href="https://fortune.com/2025/12/27/occupations-most-exposed-to-ai-automation-outperform-vanguard/?utm_source=perplexity">timely published study</a> from 12/28 &#8594; <em>In an end-of-year note to investors, <strong><a href="https://fortune.com/company/vanguard/">Vanguard</a></strong> set the record straight: careers that research claims are most susceptible to AI are actually thriving, not dying out.</em></p><blockquote><p><em>&#8220;The approximately 100 occupations most exposed to AI automation are actually outperforming the rest of the labor market in terms of job growth and real wage increases,&#8221; the Vanguard <strong><a href="https://corporate.vanguard.com/content/dam/corp/research/pdf/isg_vemo_2026.pdf?utm_source=newsletter&amp;utm_medium=email&amp;utm_campaign=newsletter_axiosam&amp;stream=top">report revealed</a></strong>. &#8220;This suggests that current AI systems are generally enhancing worker productivity and shifting workers&#8217; tasks toward higher-value activities.&#8221; - Emma Burleigh</em></p></blockquote><p>This matters in higher ed because there already exists a climate of panic around the demographic cliff and worsening sentiment around higher ed ROI. If an AI bogeyman strikes fear into boards and administration, we may prematurely spend a fortune trying to solve AI. We&#8217;ll rewrite curriculums, buy a bunch of shiny new "AI readiness" platforms, engage in expensive Deloitte AI strategy and governance, and blindly give Microsoft even more money for more <s>lame</s> copilots. But if the real issue is just a crappy economy or high interest rates, none of that expensive work is going to change the hiring numbers. </p><p>It highlights something we&#8217;re terrible at in higher ed: we don&#8217;t actually know what&#8217;s happening with our own data. </p><p>University data is all over the place. Career Services has some of it, Alumni Relations has some, the Registrar has some, definitions and access rules are opaque and inconsistent, and quality is often an afterthought.  It&#8217;s a freaking mess. </p><p>(Let&#8217;s not talk about data loss, privacy, and cybersecurity implications&#8230;)</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading ChatJPT Chatter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>So while everyone is wringing their hands about the "future of work," I&#8217;m over here still thinking about something much more boring: integration and data enablement. Are we going to make huge strategic bets based on anecdotes without a clear view of our pipeline? Are employers rejecting our students because they can&#8217;t use AI or because the market competition is flooded?</p><blockquote><p><em>&#8220;There is a real danger that if we focus on the second phase (the expertise upheaval) rather than the first phase and the more immediate causes of the graduate hiring slump, we will fail to address the issue or address it only partially.&#8221; - Glenda Martin</em></p></blockquote><p>Maybe AI really will wipe out entry-level jobs. But are we trying to treat a patient without looking at the chart first? Before we turn the whole university upside down chasing the AI trend, we should probably figure out what our data is actually trying to tell us.  Yes&#8230;I am speaking of that same boring need to build stronger data cultures and capabilities. </p><p>It&#8217;s not as exciting as an "AI Revolution," but it&#8217;s probably a lot more useful. Just think of where you would be now had you started 10 years ago.</p>]]></content:encoded></item><item><title><![CDATA[Cool that Google embedded the entire PubMed dataset in BigQuery for semantic search, but....]]></title><description><![CDATA[Watch out for the sample queries on the site. It could cost you.]]></description><link>https://substack.chatjpt.io/p/cool-that-google-embedded-the-entire</link><guid isPermaLink="false">https://substack.chatjpt.io/p/cool-that-google-embedded-the-entire</guid><dc:creator><![CDATA[ChatJPT]]></dc:creator><pubDate>Tue, 23 Dec 2025 18:21:38 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!4i3h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em><strong>DISCLAIMER: I am relatively new to GCP and Bigquery, so please double check and offer clarifications. That helps us learn.<br></strong></em><br>Google&#8217;s recent announcement about putting the entire PubMed database into BigQuery pre-embedded COMBINED with their recent announcement of MCP-ifying their services sounded too good to be true. Thirty-five million biomedical articles. Semantic search. The ability to create powerful Pubmed researcher agents on GCP.  WITHOUT having to worry about the EXPENSIVE data engineering around maintaining my own embedding pipeline.</p><p>The second paragraph in Google&#8217;s announcement on <a href="https://cloud.google.com/blog/topics/public-sector/accelerate-medical-research-with-pubmed-data-now-available-in-bigquery">Accelerate Medical Research in Bigquery</a> describes this new service in BigQuery: </p><div class="pullquote"><p>At Google Cloud, we&#8217;re addressing this challenge by making PubMed data available as a <a href="https://cloud.google.com/datasets">BigQuery public dataset</a> with vector search capabilities from <a href="https://cloud.google.com/vertex-ai">Vertex AI</a> (both BigQuery and Vertex AI Vector Search are FedRAMP High authorized), enabling semantic search of medical concepts beyond simple keyword matching.</p></div><p>It likely costs thousands of dollars in compute time and significant overhead to keep a vector column up to date on 35 &#8216;million-ish&#8217; articles to facilitate semantic search.<br><br>But Google has already done it. You don&#8217;t have to front the cost to vectorize the library (and maintain it).  </p><p>So I asked myself&#8230;.&#8221;well if BigQuery can now be an MCP server, then can I plug it into Claude Code, OpenWebUI, TypingMind, an OpenAI GPT, or wherever&#8221;?  <br><br>So off I went on this adventure  <s>fools errand</s>. In my GCP account, I followed their "Getting Started" guide, copied their sample query, modified it to include research topics from one of my researchers and watched the magic happen.<br><br>Cool.  It worked.  <em><strong>BUT 115 GB SCANNED FOR ONE QUERY</strong></em>.  That sounds expensive.  The excitement waned.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4i3h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4i3h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic 424w, https://substackcdn.com/image/fetch/$s_!4i3h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic 848w, https://substackcdn.com/image/fetch/$s_!4i3h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic 1272w, https://substackcdn.com/image/fetch/$s_!4i3h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4i3h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic" width="1066" height="1230" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1230,&quot;width&quot;:1066,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:80810,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://substack.chatjpt.io/i/182418163?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4i3h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic 424w, https://substackcdn.com/image/fetch/$s_!4i3h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic 848w, https://substackcdn.com/image/fetch/$s_!4i3h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic 1272w, https://substackcdn.com/image/fetch/$s_!4i3h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a7c6e87-c54d-4ceb-9694-6b680760dfee_1066x1230.heic 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><br>For a single question. At BigQuery&#8217;s on-demand rate, that&#8217;s around 70c per query after free-tier. Run that a thousand times for a serious research project, and it wouldn&#8217;t take much to watch those precious research credits go up in smoke. Stacking that on top of agent costs likely won&#8217;t make leadership happy and optimistic about the value of AI. <br><br>What I found after research is that the difference between a pocket-emptying query and a practically free one came down to deleting a single line of code.</p><h3>The Trap of the "Easy" Query</h3><p>The problem lies in how BigQuery bills you. As it is mostly concerned about how data hungry you are, and the PubMed dataset is massive, the real weight is in the article_text column.<br><br>In the example Google provides, the SQL asks the database to perform a vector search to identify the most semantically similar articles to your query. But it also asks the database to return the full text of those articles in the same swoop.<br><br>This sounds efficient, right? It is not.<br><br>When I ran that standard query, BigQuery had to scan the embedding column to do the math, but because I requested SELECT base.article_text, it also decided to haul back that massive text column along for the ride. That&#8217;s what led to the 100+ GB of data.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading ChatJPT Chatter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><h3>The ~90% Discount</h3><p>But if the vector search only needs the embeddings (the mathematical representation of the text) to figure out which articles match, does it really need the human-readable text until a human wants to dive deeper?  Could that be the job of a second agent and some human in the loop process? Or a deterministic workflow?<br><br>I started by the exact same query and deleted one line: <strong>base.article_text</strong> and hit run. The results came back just as fast, giving me the IDs, titles, and authors of the most relevant papers. But the data processed dropped from 115 GB down to 13ish GB. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YtWr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YtWr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png 424w, https://substackcdn.com/image/fetch/$s_!YtWr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png 848w, https://substackcdn.com/image/fetch/$s_!YtWr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png 1272w, https://substackcdn.com/image/fetch/$s_!YtWr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YtWr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png" width="495" height="334" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:334,&quot;width&quot;:495,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:38438,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.chatjpt.io/i/182418163?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YtWr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png 424w, https://substackcdn.com/image/fetch/$s_!YtWr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png 848w, https://substackcdn.com/image/fetch/$s_!YtWr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png 1272w, https://substackcdn.com/image/fetch/$s_!YtWr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1bdfad6-8b2f-48bc-9b68-f25555a31ffd_495x334.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>By simply not pulling the full text through the computational pipeline, the price of the query dropped from a dollar to a dime. Those savings will add up.</p><h3>Another Way to Build This</h3><p>If you actually need the text (and spoiler: you probably do), maybe don&#8217;t ask for it during the search. <br><br>First, use the lean, 13GB query to find the top X Article IDs. That costs a few cents. Then, run a second, tiny query to fetch the full text for analysis 10 IDs. Cheaper.<br><br>It could probably even get better if you copy the table or just the rows you need to your own project (a one-time or maybe scheduled pull cost of a few dollars) and build your own vector index.  Then you stop scanning GB and start scanning MB.   </p><h3>The Lesson</h3><p>Cloud providers love to show you the "happy path" code that gets you a result in five seconds. FinOPS batteries are often not included.  Watch out.</p><p><em>(Again: Please correct my misunderstandings of BigQuery and help me learn).</em></p><p>Ok now on to the MCP/A2A part of my journey&#8230;.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.chatjpt.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading ChatJPT Chatter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item></channel></rss>