Why GenAI companies are choosing smaller models that do more

GenAI companies are transitioning to smaller, task-specific open-source language models

The era of massive one-size-fits-all language models is fading. What’s emerging is smarter, leaner, and significantly more adaptable, smaller, open-source models tuned precisely to specific functions. Microsoft, HubSpot, and ServiceNow are not just experimenting with this approach, they’re committing to it. They’re doing what smart companies do: aligning technology to real business needs.

What makes these smaller models valuable? They’re targeted. They’re lighter. And they’re faster to deploy. This isn’t about cutting corners, it’s about picking the right tool for the job. When you need an AI agent to handle customer service, summarize documents, generate content, or even onboard a new employee, there’s now a reliable way to implement an AI that’s trained just to excel at that role.

Craig LeClair from Forrester Research nailed it: companies aren’t going to rely on just one or two general models. They’re going to deploy hundreds, each one trained on a specific domain. This means task-specific intelligence at scale, and that changes the economics of automation. You put less energy into making a large model behave, and more focus into deploying the exact capability needed, when and where it matters.

For CxOs, this trend marks a meaningful operational shift. With leaner models, deployments can scale faster business-wide. You don’t have to wait for an LLM to catch up with your use case, you direct the implementation based on current processes and data, and keep moving forward. That’s agility. That’s control.

Open-source models offer flexibility

Open-source language models have hit their stride, and the benefits are clear. They reduce compute loads, give teams the freedom to fine-tune for specific tasks, and offer far more flexibility than their proprietary counterparts. In practice, this cuts costs and accelerates time-to-value, especially in enterprise environments that demand accuracy, adaptability, and speed.

Microsoft is already embedding smaller open-source models into its core product platforms. Phi Silica, their proprietary open small model, is integrated into Microsoft 365 apps that work offline on Windows devices equipped with neural processing units. Aparna Chennapragada, Microsoft’s Chief Product Officer for Experiences and Devices, explained how their teams are tuning these models for specific tasks, writing, analysis, even generating imagery. That’s smart allocation of resources and targeted performance optimization.

HubSpot’s approach is similar in philosophy. They’re using models like Mistral AI for text and Stability Diffusion 3 for image generation. Nicholas Holland, Head of AI and SVP of Product at HubSpot, made it clear, they’re not out to reinvent the wheel. They’re applying the right, existing models to the right needs. That doesn’t just reduce overhead; it ensures fast, specific outcomes for customers without the drag of massive computation.

What makes this relevant for executives? Flexibility and governance. Open-source tools can be customized internally, governed under your own data policies, and deployed on your terms. This gives you more control, lower costs, and better privacy, all without depending on the roadmap of a single vendor.

The larger takeaway? Efficiency wins. Whether your priority is data sovereignty, enterprise agility, or sharper margins, open-source LLMs now deliver an effective balance of performance and control.

Microsoft is driving innovation by integrating smaller open-source LLMs

Microsoft is building AI tools that don’t rely on a constant connection to the cloud. That’s a fundamental shift. With the introduction of Phi Silica, a compact, open-source language model, Microsoft is equipping Windows AI PCs with the ability to run tasks locally, without needing to call external servers. This means users can stay productive regardless of connectivity while benefiting from AI capabilities that are fine-tuned for the work they’re doing.

Aparna Chennapragada, Microsoft’s Chief Product Officer for Experiences and Devices, made the point clearly. Microsoft isn’t just using one small model, they’re assembling a toolkit of lightweight, post-trained models that serve distinct operational tasks. Writing assistance, complex document analysis, image content creation, each function is matched with a model optimized to deliver results efficiently and quickly on-device.

This isn’t about cutting down on features. It’s about precision and performance. For executives, that translates to real-world benefits. Teams can continue creating, designing, writing, or solving, even in environments where cloud access isn’t stable or allowed due to policy or compliance. It also brings compute under tighter organizational control, especially when paired with hardware like neural processing units.

From a broader perspective, this approach reduces internal reliance on external compute resources. It lowers latency, enables critical workflows in offline environments, and helps scale AI adoption in industries with strict data sensitivity requirements, such as legal, finance, or healthcare, with fewer compromises.

HubSpot’s AI strategy

HubSpot isn’t spending aggressive capital trying to build the next breakthrough AI model. Instead, they’re focused on execution, on using what’s already best-in-class to get customers faster, more accurate results. The company’s Breeze AI platform uses a stack of open-source models that can be deployed where they perform best. That includes Mistral AI SAS for natural language tasks and Stability Diffusion 3 for visual content generation.

Nicholas Holland, SVP of Product and Head of AI at HubSpot, put it plainly: they’re not concerned with owning foundational models. They’re concerned with solving business problems at speed and scale. The model used depends entirely on the task, if reasoning is required, a model optimized for logic gets the job. If it’s text or image generation, a faster, smaller model can step in and deliver without unnecessary overhead.

This modular view puts customer outcomes front and center. It also allows HubSpot to adapt rapidly. They can swap models in and out, or finetune different capabilities as needed, without overhauling the infrastructure. That’s operational focus, and executives should take note.

For business leaders, the value proposition is clear. There’s greater flexibility, lower R&D overhead, and faster iteration cycles on features that directly affect customer engagement. It also means avoiding lock-in and maintaining competitive adaptability. HubSpot’s strategy shows the advantage of working with the ecosystem instead of trying to own it.

ServiceNow is advancing enterprise AI

ServiceNow is taking a disciplined approach to generative AI by zeroing in on reasoning over raw size. In collaboration with Nvidia, they developed Apriel, a purpose-built, open-source language model with just 15 billion parameters. That’s considerably smaller than most foundation models, but it’s intentionally designed to perform better at enterprise-grade reasoning tasks like IT support, HR decision assistance, and customer service workflows.

Dorit Zilbershot, ServiceNow’s Group Vice President of AI Experiences and Innovation, emphasized that more parameters don’t automatically mean better output, especially in real-world enterprise scenarios where the model’s ability to reason accurately within a structured system matters more than broad generalization. Apriel achieves faster inference speeds and lower compute demands, which are both critical for performance and cost control.

This approach aligns well with enterprise requirements. Large models consume considerable infrastructure, which isn’t always necessary when the task demands clarity, logic, and contextual consistency. Apriel’s lean design makes it deployable, governable, and scalable within production environments without weakening performance.

For C-suite leaders, this is an efficient use of AI capital. You invest in intelligence where it drives outcomes, not complexity. Apriel shows that stronger reasoning can come from tailored, compact models, not always from sheer scale. That gives enterprises more reliable tools, faster implementations, and reduced operational friction.

Data governance and control concerns

Data governance isn’t a side consideration anymore, it’s a core factor in how AI is deployed across industries. Enterprises are increasingly looking for ways to retain tighter control over training data, model behavior, and access rules. Running open-source models on-premise checks all of those boxes. It provides transparency, ensures alignment with internal compliance protocols, and reduces the risk of data exposure.

Craig LeClair, Vice President and Principal Analyst at Forrester Research, explained this shift clearly. He pointed out that many organizations are drifting back to on-premise infrastructure because they want AI models that are trained on proprietary data under controlled conditions. The primary concern is the possibility of intellectual property leakage or breakdowns in governance when models are managed externally through third-party APIs.

For decision-makers, this trend represents a straightforward calculation: when your data shapes your value proposition, you control the inputs and outputs. Open-source models running internally give enterprises full visibility into what the model is learning, how it operates, and how its responses evolve after post-training.

This direction also unlocks better long-term AI sustainability. You’re not reliant on vendor updates or deltas in outside models. You’re building with assets that remain under full organizational governance. That strategic autonomy becomes more important as AI matures into mission-critical infrastructure. Enterprises that prepare for this now will scale more confidently later.

Key takeaways for leaders

GenAI companies are transitioning to smaller, task-specific open-source language models: Leaders should adopt smaller, domain-specific AI models to accelerate deployment and improve task accuracy across business functions while maintaining cost efficiency.
Open-source models offer flexibility, customization, and efficient cost management: Favor open-source LLMs to gain architectural control, reduce infrastructure costs, and customize AI tools quickly for evolving enterprise needs.
Microsoft is driving innovation by integrating smaller open-source LLMs to enhance offline capabilities in Microsoft 365: Enterprises should consider localizing AI operations with lightweight models to improve data security, ensure offline usability, and lower operational latency.
HubSpot’s AI strategy is centered on deploying the best-fitting open-source tools rather than developing proprietary models: Avoid overinvesting in proprietary AI; instead, focus on leveraging proven open-source models to quickly meet customer demands and scale smarter.
ServiceNow is advancing enterprise AI by co-developing a lean, reasoning-optimized model named Apriel with Nvidia: Executives should explore lean models optimized for reasoning to achieve faster decision automation without the heavy compute cost of large-scale LLMs.
Data governance and control concerns are driving the trend toward on-premise deployment of open-source AI models: Prioritize on-premise deployment of open-source AI to safeguard sensitive data, maintain compliance, and retain strategic control over training sources.