<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/">
    <channel>
        <title>Cloudless AI Blog</title>
        <link>https://cloudless-ai.app/docs/blog</link>
        <description>Cloudless AI Blog</description>
        <lastBuildDate>Wed, 15 Jan 2025 00:00:00 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>https://github.com/jpmonette/feed</generator>
        <language>en</language>
        <item>
            <title><![CDATA[Introducing Cloudless AI]]></title>
            <link>https://cloudless-ai.app/docs/blog/introducing-cloudless-ai</link>
            <guid>https://cloudless-ai.app/docs/blog/introducing-cloudless-ai</guid>
            <pubDate>Wed, 15 Jan 2025 00:00:00 GMT</pubDate>
            <description><![CDATA[We're excited to introduce Cloudless AI — a compute layer that routes LLM inference to idle NPU and GPU nodes across your corporate network, falling back to the cloud only when necessary.]]></description>
            <content:encoded><![CDATA[<p>We're excited to introduce <strong>Cloudless AI</strong> — a compute layer that routes LLM inference to idle NPU and GPU nodes across your corporate network, falling back to the cloud only when necessary.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="the-problem">The problem<a href="https://cloudless-ai.app/docs/blog/introducing-cloudless-ai#the-problem" class="hash-link" aria-label="Direct link to The problem" title="Direct link to The problem" translate="no">​</a></h2>
<p>Every corporate network has machines with powerful NPUs and GPUs sitting mostly idle — developer workstations, render farms, on-prem servers. Meanwhile teams are paying cloud bills to run the same inference workloads that those machines could handle.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="our-solution">Our solution<a href="https://cloudless-ai.app/docs/blog/introducing-cloudless-ai#our-solution" class="hash-link" aria-label="Direct link to Our solution" title="Direct link to Our solution" translate="no">​</a></h2>
<p>Cloudless AI sits between your applications and your compute. Install the router, drop the node agent on machines with spare capacity, and point your existing OpenAI or Anthropic SDK at the router's endpoint. That's it — your requests are now routed to the best available on-prem node, with transparent cloud fallback when capacity is full.</p>
<h2 class="anchor anchorTargetStickyNavbar_Vzrq" id="get-started">Get started<a href="https://cloudless-ai.app/docs/blog/introducing-cloudless-ai#get-started" class="hash-link" aria-label="Direct link to Get started" title="Direct link to Get started" translate="no">​</a></h2>
<p>Follow our <a class="" href="https://cloudless-ai.app/docs/docs/guides/quickstart">quickstart guide</a> to have the full stack running locally in under 5 minutes.</p>]]></content:encoded>
            <category>announcement</category>
            <category>product</category>
        </item>
    </channel>
</rss>