When Software Margin Drops to 17%: Four Strategic Escapes for the AI-Native P&L

TV
Thiago Victorino
7 min read
When Software Margin Drops to 17%: Four Strategic Escapes for the AI-Native P&L

For two decades, a SaaS pitch deck carried one constant. Gross margin near 80%. Net margin in the high teens at scale. The whole financial logic of subscription software rested on the assumption that, after the platform was built, serving the next customer cost almost nothing.

AI changed the cost structure underneath that assumption. Inference is not free. It is a per-task expense that scales with usage. When a SaaS company embeds AI features and pays the model bill on behalf of customers, the gross margin floor drops. Not by a few points. By an order of magnitude in the wrong direction.

Domen Jemec published a structured illustration of this on GPTomics in May 2026. He built two models. A traditional SaaS company called GainZ. An AI-native one called AIBoost. Same revenue per user, same growth curve, same operating headcount. The only variable that moves is whether AI inference costs sit inside cost of revenue.

His result, treated as scenario rather than survey: GainZ lands at 80% gross margin and 19% net margin at one million users. AIBoost lands at 17% gross margin and 11% net margin at the same scale. Same product category. Same customer count. Wildly different financial physics.

The illustrative numbers are not the point. The point is the question every operator running an AI feature now has to answer. If your business has been priced on SaaS comparables and your actual cost structure resembles AIBoost, what is your strategic escape?

Jemec sketches four. Each carries a different governance posture, because at 17% gross margin every avoidable agent error is no longer a process problem. It is an existential cost.

The Setup: What the Numbers Actually Show

Jemec’s scenarios use deliberately simple inputs. $120 per user in annual revenue. Fixed costs that grow 10% with every doubling of users. A constant unit cost of inference that scales linearly with usage. Treat the absolute numbers as illustrative. The shape of the spread is what matters.

In traditional SaaS, gross margin is dominated by hosting and support. Both behave like fixed costs that get amortized as you scale. The 80% number falls out of the model almost automatically.

In AI-native software, gross margin is dominated by inference, which behaves like a variable cost that does not amortize. Add more users and you pay more for tokens. The 17% number falls out of the model just as automatically, with no operating mistake required to get there.

Two consequences for any operator. First, the same growth that used to compound profit now compounds the inference bill in lockstep. Second, the gap between your revenue and your model spend is where every governance failure shows up. A 5% retry rate that nobody catches in a SaaS P&L is invisible. In an AI-native P&L at 17% gross margin, that same 5% can move the line between profitable and unprofitable users.

We have covered the supply-side version of this argument in AI Economics Fracture and the consumer-product version in Cursor’s Negative Margin SaaS Inversion. Jemec’s contribution is the strategic-options frame. Once you accept that the margin floor moved, the next question is what you do about it.

Escape One: Cheap Clones

AI tools cut the cost of building software by roughly 90% in Jemec’s framing. That is a tailwind for incumbents, but a much bigger tailwind for new entrants. If a clone takes a year and a small team to ship, the moat from product alone is thin.

The strategic posture here is honest. You compete on price, distribution, or speed of iteration. You do not pretend to have a moat that the build cost no longer supports. You run a tight P&L, you keep token spend disciplined, and you accept that your category will fragment.

Governance question: how do you keep inference costs aligned with revenue when your category is racing to the bottom on price? Every wasted token is a percentage of margin you cannot afford to lose.

Escape Two: Niche and Ultra-Low-Cost

The flip side of cheaper build cost is that previously unprofitable segments become viable. Markets that were too small or too cost-sensitive for the old SaaS economics now justify a focused product.

A niche tool for a specific role in a specific industry, priced at $20 a month instead of $200, would have been a non-starter in 2020. With AI lowering build cost and operations cost together, it can work. The trade is volume for fit. You serve fewer customers, but the ones you serve cannot find a better option.

Governance question: how do you keep operating cost low enough that ultra-low-cost pricing still leaves room for margin? At $20 a month, an extra dollar in monthly inference spend is 5% of revenue. The discipline cannot be optional.

Escape Three: Luxury Software

Jemec’s third escape is the one most operators have not seriously considered. Software priced as a luxury good. Not premium, luxury. Where the price itself is part of the value proposition.

He cites Raya, Dorsia, and Bloomberg Terminal at $32,000 a year. Each one rests on something the buyer cannot get cheaper. Curation, access, network effect, or in Bloomberg’s case, a workflow so deeply embedded in a profession that switching is unthinkable.

The risk is the I Am Rich App. A $999 iPhone app whose only feature was displaying a red gem to signal wealth. It failed because there was no functional differentiation underneath the price. Luxury software without operational substance is performance art, and the market notices.

The honest version is harder. Real luxury software earns the price through something the cheap clones cannot replicate. Proprietary data. A curated network. A workflow lock-in that no model can synthesize on its own.

Governance question: how do you maintain the operational standard the price implies? At luxury pricing, a visible agent failure is not a refund event. It is a brand event.

Escape Four: More Than Software

The fourth escape is the most defensible and the most operationally heavy. You stop being a pure software company. You vertically integrate into atoms, into data, or into a workflow that pure software cannot occupy.

Atoms means physical operations. A surgical robotics company. A drone-as-a-service platform. A fulfillment network with software as the connective tissue. The hardware or operations layer creates a cost structure that pure-AI competitors cannot replicate by writing more code.

Data means proprietary information that competitors cannot scrape. A clinical dataset built over a decade. A pricing engine fed by transactions that only your network sees. The data is the moat, and the model is the access layer to it.

Governance question: how do you keep the integrated layer trustworthy when it touches the physical world or a regulated dataset? Every escape on this list relies on some form of trust, but this one cannot afford a single recoverable failure. A hallucination in a drone routing system is not a margin problem. It is a recall.

The Pattern Across the Four

Each escape exists because the 17% gross margin scenario is real enough that operators have to choose between escapes. Doing nothing is the fifth option, and it is the one where the math runs out first.

Three observations across all four.

The first is that governance discipline becomes financial, not procedural. At 80% gross margin, you can afford to invest in process maturity even if you cannot measure the return. At 17% gross margin, every dollar of preventable model spend lands directly on the bottom line. The teams that survive will be the ones that treat measurement of token spend, agent errors, and retry rates as a unit-economics problem rather than an ops nicety.

The second is that the escapes are not mutually exclusive in the same firm, but they are mutually exclusive in the same product line. A luxury offering and a cheap clone cannot share a roadmap. The decision belongs in strategy, not in product management.

The third is that the comparable benchmark for an AI-native company is no longer the traditional SaaS company in the same vertical. It is whichever escape best describes the strategy. A luxury AI workflow should be compared to luxury services. A niche AI tool should be compared to other niche specialists. Holding AI-native P&Ls against legacy SaaS gross margin will make every legitimate escape look like underperformance.

Do This Now

If you operate an AI-native or AI-adjacent business, take three steps in the next two weeks.

Build the AIBoost-style model for your own product, with your own pricing and your own token spend. Do not borrow Jemec’s numbers. Use yours. Find out where your gross margin actually lands.

Pick the escape that matches your strategy. If you are running an AI feature inside a traditional SaaS and you are not vertically integrated, you are likely already on a cheap-clone or niche trajectory whether you have admitted it or not. Name the trajectory and align pricing, governance investment, and roadmap to it.

Treat your inference governance as a unit-economics line item. Every avoidable retry, every loop, every misrouted call is no longer a process annoyance. At your real gross margin, it is the difference between an escape that funds itself and one that does not.

The 17% scenario is illustrative. The decision it forces is real.


This analysis synthesizes How AI Changes Software P&L (GPTomics / Domen Jemec, May 2026).

Victorino Group helps AI-native and AI-adjacent companies align governance investment with the actual economics of their P&L. Let’s talk.

All articles on The Thinking Wire are written with the assistance of Anthropic's Opus LLM. Each piece goes through multi-agent research to verify facts and surface contradictions, followed by human review and approval before publication. If you find any inaccurate information or wish to contact our editorial team, please reach out at editorial@victorinollc.com . About The Thinking Wire →

If this resonates, let's talk

We help companies implement AI without losing control.

Schedule a Conversation