Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More
The enterprise world is rapidly growing its usage of open source large language models (LLMs), driven by companies gaining more sophistication around AI โ seeking greater control, customization, and cost efficiency.ย
While closed models like OpenAIโs GPT-4 dominated early adoption, open source models have since closed the gap in quality, and are growing at least as quickly in the enterprise, according to multiple VentureBeat interviews with enterprise leaders.
This is a change from earlier this year, when I reported that while the promise of open source was undeniable, it was seeing relatively slow adoption. But Metaโs openly available models have now been downloaded more than 400 million times, the company told VentureBeat, at a rate 10 times higher than last year, with usage doubling from May through July 2024. This surge in adoption reflects a convergence of factors โ from technical parity to trust considerations โ that are pushing advanced enterprises toward open alternatives.
โOpen always wins,โ declares Jonathan Ross, CEO of Groq, a provider of specialized AI processing infrastructure that has seen massive uptake of customers using open models. โAnd most people are really worried about vendor lock-in.โ
Even AWS, which made a $4 billion investment in closed-source provider Anthropic โ its largest investment ever โ acknowledges the momentum. โWe are definitely seeing increased traction over the last number of months on publicly available models,โ says Baskar Sridharan, AWSโ VP of AI & Infrastructure, which offers access to as many models as possible, both open and closed source, via its Bedrock service.ย
The platform shift by big app companies accelerates adoption
Itโs true that among startups or individual developers, closed-source models like OpenAI still lead. But in the enterprise, things are looking very different. Unfortunately, there is no third-party source that tracks the open versus closed LLM race for the enterprise, in part because itโs near impossible to do: The enterprise world is too distributed, and companies are too private for this information to be public. An API company, Kong, surveyed more than 700 users in July. But the respondents included smaller companies as well as enterprises, and so was biased toward OpenAI, which without question still leads among startups looking for simple options. (The report also included other AI services like Bedrock, which is not an LLM, but a service that offers multiple LLMs, including open source ones โ so it mixes apples and oranges.)
But anecdotally, the evidence is piling up. For one, each of the major business application providers has moved aggressively recently to integrate open source LLMs, fundamentally changing how enterprises can deploy these models. Salesforce led the latest wave by introducing Agentforce last month, recognizing that its customer relationship management customers needed more flexible AI options. The platform enables companies to plug in any LLM within Salesforce applications, effectively making open source models as easy to use as closed ones. Salesforce-owned Slack quickly followed suit.
Oracle also last month expanded support for the latest Llama models across its enterprise suite, which includes the big enterprise apps of ERP, human resources, and supply chain. SAP, another business app giant, announced comprehensive open source LLM support through its Joule AI copilot, while ServiceNow enabled both open and closed LLM integration for workflow automation in areas like customer service and IT support.
โI think open models will ultimately win out,โ says Oracleโs EVP of AI and Data Management Services, Greg Pavlik. The ability to modify models and experiment, especially in vertical domains, combined with favorable cost, is proving compelling for enterprise customers, he said.
A complex landscape of โopenโ models
While Metaโs Llama has emerged as a frontrunner, the open LLM ecosystem has evolved into a nuanced marketplace with different approaches to openness. For one, Metaโs Llama has more than 65,000 model derivatives in the market. Enterprise IT leaders must navigate these, and other options ranging from fully open weights and training data to hybrid models with commercial licensing.
Mistral AI, for example, has gained significant traction by offering high-performing models with flexible licensing terms that appeal to enterprises needing different levels of support and customization. Cohere has taken another approach, providing open model weights but requiring a license fee โ a model that some enterprises prefer for its balance of transparency and commercial support.
This complexity in the open model landscape has become an advantage for sophisticated enterprises. Companies can choose models that match their specific requirements โ whether thatโs full control over model weights for heavy customization, or a supported open-weight model for faster deployment. The ability to inspect and modify these models provides a level of control impossible with fully closed alternatives, leaders say. Using open source models also often requires a more technically proficient team to fine-tune and manage the models effectively, another reason enterprise companies with more resources have an upper hand when using open source.
Metaโs rapid development of Llama exemplifies why enterprises are embracing the flexibility of open models. AT&T uses Llama-based models for customer service automation, DoorDash for helping answer questions from its software engineers, and Spotify for content recommendations. Goldman Sachs has deployed these models in heavily regulated financial services applications. Other Llama users include Niantic, Nomura, Shopify, Zoom, Accenture, Infosys, KPMG, Wells Fargo, IBM, and The Grammy Awards.ย
Meta has aggressively nurtured channel partners. All major cloud providers embrace Llama models now. โThe amount of interest and deployments theyโre starting to see for Llama with their enterprise customers has been skyrocketing,โ reports Ragavan Srinivasan, VP of Product at Meta, โespecially after Llama 3.1 and 3.2 have come out. The large 405B model in particular is seeing a lot of really strong traction because very sophisticated, mature enterprise customers see the value of being able to switch between multiple models.โ He said customers can use a distillation service to create derivative models from Llama 405B, to be able to fine tune it based on their data. Distillation is the process of creating smaller, faster models while retaining core capabilities.ย
Indeed, Meta covers the landscape well with its other portfolio of models, including the Llama 90B model, which can be used as a workhorse for a majority of prompts, and 1B and 3B, which are small enough to be used on device. Today, Meta released โquantizedโ versions of those smaller models. Quantization is another process that makes a modelย smaller, allowing less power consumption and faster processing. What makes these latest special is that they were quantized during training, making them more efficient than other industry quantized knock-offs โ four times faster at token generation than their originals, using a fourth of the power.
Technical capabilities drive sophisticated deployments
The technical gap between open and closed models has essentially disappeared, but each shows distinct strengths that sophisticated enterprises are learning to leverage strategically. This has led to a more nuanced deployment approach, where companies combine different models based on specific task requirements.
โThe large, proprietary models are phenomenal at advanced reasoning and breaking down ambiguous tasks,โ explains Salesforce EVP of AI, Jayesh Govindarajan. But for tasks that are light on reasoning and heavy on crafting language, for example drafting emails, creating campaign content, researching companies, โopen source models are at par and some are better,โ he said. Moreover, even the high reasoning tasks can be broken into sub-tasks, many of which end up becoming language tasks where open source excels, he said.ย
Intuit, the owner of accounting software Quickbooks, and tax software Turbotax, got started on its LLM journey a few years ago, making it a very early mover among Fortune 500 companies. Its implementation demonstrates a sophisticated approach. For customer-facing applications like transaction categorization in QuickBooks, the company found that its fine-tuned LLM built on Llama 3 demonstrated higher accuracy than closed alternatives. โWhat we find is that we can take some of these open source models and then actually trim them down and use them for domain-specific needs,โ explains Ashok Srivastava, Intuitโs chief data officer. They โcan be much smaller in size, much lower and latency and equal, if not greater, in accuracy.โ
The banking sector illustrates the migration from closed to open LLMs. ANZ Bank, a bank that serves Australia and New Zealand, started out using OpenAI for rapid experimentation. But when it moved to deploy real applications, it dropped OpenAI in favor of fine-tuning its own Llama-based models, to accommodate its specific financial use cases, driven by needs for stability and data sovereignty. The bank published a blog about the experience, citing the flexibility provided by Llamaโs multiple versions, flexible hosting, version control, and easier rollbacks. We know of another top-three U.S. bank that also recently moved away from OpenAI.
Itโs examples like this, where companies want to leave OpenAI for open source, that have given rise to things like โswitch kitsโ from companies like PostgresML that make it easy to exit OpenAI and embrace open source โin minutes.โ
Infrastructure evolution removes deployment barriers
The path to deploying open source LLMs has been dramatically simplified. Metaโs Srinivasan outlines three key pathways that have emerged for enterprise adoption:
- Cloud Partner Integration: Major cloud providers now offer streamlined deployment of open source models, with built-in security and scaling features.
- Custom Stack Development: Companies with technical expertise can build their own infrastructure, either on-premises or in the cloud, maintaining complete control over their AI stack โ and Meta is helping with its so-called Llama Stack.
- API Access: For companies seeking simplicity, multiple providers now offer API access to open source models, making them as easy to use as closed alternatives. Groq, Fireworks, and Hugging Face are examples. All of them are able to provide you an inference API, a fine-tuning API, and basically anything that you would need or you would get from a proprietary provider.
Safety and control advantages emerge
The open source approach has also โ unexpectedly โ emerged as a leader in model safety and control, particularly for enterprises requiring strict oversight of their AI systems. โMeta has been incredibly careful on the safety part, because theyโre making it public,โ notes Groqโs Ross. โThey actually are being much more careful about it. Whereas with the others, you donโt really see whatโs going on and youโre not able to test it as easily.โ
This emphasis on safety is reflected in Metaโs organizational structure. Its team focused on Llamaโs safety and compliance is large relative to its engineering team, Ross said, citing conversations with the Meta a few months ago. (A Meta spokeswoman said the company does not comment on personnel information). The September release of Llama 3.2 introduced Llama Guard Vision, adding to safety tools released in July. These tools can:
- Detect potentially problematic text and image inputs before they reach the model
- Monitor and filter output responses for safety and compliance
Enterprise AI providers have built upon these foundational safety features. AWSโs Bedrock service, for example, allows companies to establish consistent safety guardrails across different models. โOnce customers set those policies, they can choose to move from one publicly available model to another without actually having to rewrite the application,โ explains AWSโ Sridharan. This standardization is crucial for enterprises managing multiple AI applications.
Databricks and Snowflake, the leading cloud data providers for enterprise, also vouch for Llamaโs safety: โLlama models maintain the โhighest standards of security and reliability,โ said Hanlin Tang, CTO for Neural Networks
Intuitโs implementation shows how enterprises can layer additional safety measures. The companyโs GenSRF (security, risk and fraud assessment) system, part of its โGenOSโ operating system, monitors about 100 dimensions of trust and safety. โWe have a committee that reviews LLMs and makes sure its standards are consistent with the companyโs principles,โ Intuitโs Srivastava explains. However, he said these reviews of open models are no different than the ones the company makes for closed-sourced models.
Data provenance solved through synthetic training
A key concern around LLMs is about the data theyโve been trained on. Lawsuits abound from publishers and other creators, charging LLM companies with copyright violation. Most LLM companies, open and closed, havenโt been fully transparent about where they get their data. Since much of it is from the open web, it can be highly biased, and contain personal information.ย
Many closed sourced companies have offered users โindemnification,โ or protection against legal risks or claims lawsuits as a result of using their LLMs. Open source providers usually do not provide such indemnification. But lately this concern around data provenance seems to have declined somewhat. Models can be grounded and filtered with fine-tuning, and Meta and others have created more alignment and other safety measures to counteract the concern. Data provenance is still an issue for some enterprise companies, especially those in highly regulated industries, such as banking or healthcare. But some experts suggest these data provenance concerns may be resolved soon through synthetic training data.ย
โImagine I could take public, proprietary data and modify them in some algorithmic ways to create synthetic data that represents the real world,โ explains Salesforceโs Govindarajan. โThen I donโt really need access to all that sort of internet dataโฆ The data provenance issue just sort of disappears.โ
Meta has embraced this trend, incorporating synthetic data training in Llama 3.2โs 1B and 3B models.ย
Regional patterns may reveal cost-driven adoption
The adoption of open source LLMs shows distinct regional and industry-specific patterns. โIn North America, the closed source models are certainly getting more production use than the open source models,โ observes Oracleโs Pavlik. โOn the other hand, in Latin America, weโre seeing a big uptick in the Llama models for production scenarios. Itโs almost inverted.โ
What is driving these regional variations isnโt clear, but they may reflect different priorities around cost and infrastructure. Pavlik describes a scenario playing out globally: โSome enterprise user goes out, they start doing some prototypesโฆusing GPT-4. They get their first bill, and theyโre like, โOh my god.โ Itโs a lot more expensive than they expected. And then they start looking for alternatives.โ
Market dynamics point toward commoditization
The economics of LLM deployment are shifting dramatically in favor of open models. โThe price per token of generated LLM output has dropped 100x in the last year,โ notes venture capitalist Marc Andreessen, who questioned whether profits might be elusive for closed-source model providers. This potential โrace to the bottomโ creates particular pressure on companies that have raised billions for closed-model development, while favoring organizations that can sustain open source development through their core businesses.
โWe know that the cost of these models is going to go to zero,โ says Intuitโs Srivastava, warning that companies โover-capitalizing in these models could soon suffer the consequences.โ This dynamic particularly benefits Meta, which can offer free models while gaining value from their application across its platforms and products.
A good analogy for the LLM competition, Groqโs Ross says, is the operating system wars. โLinux is probably the best analogy that you can use for LLMs.โ While Windows dominated consumer computing, it was open source Linux that came to dominate enterprise systems and industrial computing. Intuitโs Srivastava sees the same pattern: โWe have seen time and again: open source operating systems versus non open source. We see what happened in the browser wars,โ when open source Chromium browsers beat closed models.
Walter Sun, SAPโs global head of AI, agrees: โI think that in a tie, people can leverage open source large language models just as well as the closed source ones, that gives people more flexibility.โ He continues: โIf you have a specific need, a specific use caseโฆ the best way to do it would be with open source.โ
Some observers like Groqโs Ross believe Meta may be in a position to commit $100 billion to training its Llama models, which would exceed the combined commitments of proprietary model providers, he said. Meta has an incentive to do this, he said, because it is one of the biggest beneficiaries of LLMs. It needs them to improve intelligence in its core business, by serving up AI to users on Instagram, Facebook, Whatsapp. Meta says its AI touches 185 million weekly active users, a scale matched by few others.ย
This suggests that open source LLMs wonโt face the sustainability challenges that have plagued other open source initiatives. โStarting next year, we expect future Llama models to become the most advanced in the industry,โ declared Meta CEO Mark Zuckerberg in his July letter of support for open source AI. โBut even before that, Llama is already leading on openness, modifiability, and cost efficiency.โ
Specialized models enrich the ecosystem
The open source LLM ecosystem is being further strengthened by the emergence of specialized industry solutions. IBM, for instance, has released its Granite models as fully open source, specifically trained for financial and legal applications. โThe Granite models are our killer apps,โ says Matt Candy, IBMโs global managing partner for generative AI. โThese are the only models where thereโs full explainability of the data sets that have gone into training and tuning. If youโre in a regulated industry, and are going to be putting your enterprise data together with that model, you want to be pretty sure whatโs in there.โ
IBMโs business benefits from open source, including from wrapping its Red Hat Enterprise Linux operating system into a hybrid cloud platform, that includes usage of the Granite models and its InstructLab, a way to fine-tune and enhance LLMs. The AI business is already kicking in. โTake a look at the ticker price,โ says Candy. โAll-time high.โ
Trust increasingly favors open source
Trust is shifting toward models that enterprises can own and control. Ted Shelton, COO of Inflection AI, a company that offers enterprises access to licensed source code and full application stacks as an alternative to both closed and open source models, explains the fundamental challenge with closed models: โWhether itโs OpenAI, itโs Anthropic, itโs Gemini, itโs Microsoft, they are willing to provide a so-called private compute environment for their enterprise customers. However, that compute environment is still managed by employees of the model provider, and the customer does not have access to the model.โ This is because the LLM owners want to protect proprietary elements like source code, model weights, and hyperparameter training details, which canโt be hidden from customers who would have direct access to the models. Since much of this code is written in Python, not a compiled language, it remains exposed.
This creates an untenable situation for enterprises serious about AI deployment. โAs soon as you say โOkay, well, OpenAIโs employees are going to actually control and manage the model, and they have access to all the companyโs data,โ it becomes a vector for data leakage,โ Shelton notes. โCompanies that are actually really concerned about data security are like โNo, weโre not doing that. Weโre going to actually run our own model. And the only option available is open source.’โ
The path forward
While closed-source models maintain a market share lead for simpler use cases, sophisticated enterprises increasingly recognize that their future competitiveness depends on having more control over their AI infrastructure. As Salesforceโs Govindarajan observes: โOnce you start to see value, and you start to scale that out to all your users, all your customers, then you start to ask some interesting questions. Are there efficiencies to be had? Are there cost efficiencies to be had? Are there speed efficiencies to be had?โ
The answers to these questions are pushing enterprises toward open models, even if the transition isnโt always straightforward. โI do think that there are a whole bunch of companies that are going to work really hard to try to make open source work,โ says Inflection AIโs Shelton, โbecause they got nothing else. You either give in and say a couple of large tech companies own generative AI, or you take the lifeline that Mark Zuckerberg threw you. And youโre like: โOkay, letโs run with this.’โ
source: https://venturebeat.com/ai/the-enterprise-verdict-on-ai-models-why-open-source-will-win/


