evolvedigital.ai blog

Tag: Meta

Meta Launches Muse Spark 1.1: A New Frontier Agentic Model Enters the Paid API Market

Meta Superintelligence Labs released Muse Spark 1.1 on July 9, 2026, a multimodal reasoning model built specifically for agentic tasks that marks a significant strategic shift for the company. For the first time, Meta is charging for access to a frontier AI model through the paid Meta Model API, putting it in direct competition with Anthropic’s Claude and OpenAI’s GPT lineup. The launch was punctuated by CEO Mark Zuckerberg’s return to X after three years away from the platform. Muse Spark 1.1 arrives with a 1 million token context window, native computer use capabilities, and parallel sub-agent execution, entering public preview immediately for developers globally.

What Was Announced

Muse Spark 1.1 was released by Meta Superintelligence Labs, the research division led by Alexandr Wang, on July 9, 2026. The model is designed to handle complex, multi-step agentic workflows — a class of AI task that requires reasoning over long sessions, executing actions across computer interfaces, and managing many subtasks in parallel.

Pricing for Muse Spark 1.1 is set at $1.25 per million input tokens and $4.25 per million output tokens. Developers can begin testing immediately with $20 in free API credits. The model is available through the Meta Model API in public preview, and is also accessible through the Meta AI app’s Thinking mode and at meta.ai, giving both enterprise developers and individual users access to the same underlying capability.

CEO Mark Zuckerberg announced the launch on X, marking his return to the platform for the first time in three years — his last engagement there was in July 2023, when the platform rebranded from Twitter. Zuckerberg described Muse Spark 1.1 as “a strong agentic and coding model at a very low price,” signaling that Meta intends to compete on cost as well as raw capability.

Alexandr Wang, who leads Meta Superintelligence Labs, said the new platform represents the company’s strongest model for agentic and coding work, with a focus on enabling autonomous multi-step task completion at enterprise scale.

Technical Details

Muse Spark 1.1 is built on a multimodal architecture trained for high performance on extended, multi-step tasks. The model supports a 1 million token context window, allowing it to retain information and reason across very long sessions without losing track of earlier context — an essential feature for enterprise workflows that may unfold over hours rather than minutes.

One of the model’s key technical differentiators is its approach to parallel execution. Rather than processing complex tasks sequentially, Muse Spark 1.1 is trained to spawn and coordinate parallel sub-agents, enabling it to complete more steps in less time on large projects. The model also ships with native computer use capabilities, allowing it to interact directly with desktop applications, mobile interfaces, and web browsers to complete multi-step digital workflows autonomously.

On benchmark evaluations, Muse Spark 1.1 tops professional and scaled tool-use benchmarks including JobBench and MCP Atlas. Meta reports major improvements over the original Muse Spark across tool use, computer use, coding, and multi-agent orchestration. The model trails Anthropic’s Opus 4.8 and OpenAI’s GPT-5.5 on pure coding and multimodal reasoning tasks, pointing to clear strengths in agentic and workflow automation scenarios.

Industry Impact and Reactions

The most significant aspect of the Muse Spark 1.1 release may not be the model itself, but what it signals about Meta’s business strategy. For years, Meta positioned itself as a champion of open-source AI, releasing its LLaMA model family freely and building a public reputation in contrast to closed API providers like Anthropic and OpenAI. The launch of a paid Meta Model API changes that equation directly. Meta is now entering the commercial frontier model market, offering a product that competes on price, capability, and a distinct technical focus on agentic tasks.

The timing of the launch is notable. The AI coding and agentic AI markets have been intensifying rapidly throughout 2026, with major releases from virtually every large AI lab. Meta’s entry into this space with a model specifically designed for agentic and tool-use tasks puts additional pressure on the pricing tiers that Anthropic and OpenAI have established. At $1.25 per million input tokens, Muse Spark 1.1 is positioned as a cost-competitive option for developers building applications that make heavy use of AI tool calls and computer use.

The fact that Zuckerberg personally returned to X to make the announcement underscores how significant Meta views this launch internally. The three-year absence from the platform made the post immediately visible to tech media and the developer community, amplifying the announcement beyond what a standard press release would achieve.

What Comes Next

Meta has indicated that Muse Spark 1.1 is the beginning of a new product line rather than a standalone model release. The Meta Model API is launching in public preview, suggesting the company plans to expand availability, add enterprise-grade features such as private deployment and usage analytics, and iterate on the model rapidly in the months ahead. Developers can expect additional SDK support, expanded documentation, and broader regional availability as the preview progresses.

The competitive landscape will almost certainly respond. Anthropic, OpenAI, and Google have each made significant investments in agentic AI capabilities throughout 2026, and Meta’s entry at an aggressive price point adds further urgency to their own development roadmaps. The next benchmark releases from all four labs will be closely watched by enterprise buyers weighing platform commitments.

Conclusion

Meta Muse Spark 1.1 marks a meaningful turning point for the company and for the AI industry. A company long associated with open-source AI is now competing directly in the paid frontier model market, with a model purpose-built for agentic workflows, computer use, and large-scale task automation. Whether Muse Spark closes the performance gap with top competitors on coding and multimodal tasks in future versions remains to be seen, but the commercial and strategic implications of this launch extend well beyond any single benchmark result.

Stay updated on the latest AI news at Evolve Digital.

July 10, 2026
Meta Launches Meta Compute: A New Cloud Business to Rival AWS, Google, and Microsoft

Meta Platforms made a landmark strategic announcement on July 1, 2026, revealing plans to launch Meta Compute, a dedicated business unit that will sell access to the company’s AI compute infrastructure and hosted AI models to paying external customers. The move sends Meta directly into competition with Amazon Web Services, Google Cloud, and Microsoft Azure — and sent Meta’s stock climbing nearly 10 percent in a single trading session. The announcement marks a fundamental shift in how Meta frames its massive AI infrastructure spending: from cost center to revenue engine.

What Was Announced

Meta’s new cloud division, Meta Compute, will offer two primary services: raw GPU compute capacity leased to external customers, and access to hosted AI models — including Meta’s recently released closed-weight model, Muse Spark. The business will be led by a high-profile leadership trio: Santosh Janardhan, Meta’s head of infrastructure; Daniel Gross, the leader of Meta Superintelligence Labs; and Dina Powell McCormick, Meta’s president.

The announcement was first reported by Bloomberg on July 1, 2026, and confirmed by Meta shortly after. CEO Mark Zuckerberg had previously indicated that a cloud computing business was “definitely on the table” as a mechanism for generating returns on infrastructure investment, but this marks the first formal organizational step toward that goal.

Meta has committed $182.9 billion to AI infrastructure build-out through the coming years. Major new data center campuses in Louisiana and Ohio are expected to come online in 2026, adding substantial compute capacity that Meta now plans to monetize externally rather than leave idle. The timing of this announcement was deliberate: investor pressure over Meta’s elevated capital expenditure had been building for months, and Meta Compute reframes that spending as an asset under development rather than a liability.

Meta raised its full-year capital expenditure guidance in April 2026 to between $125 billion and $145 billion — a range that alarmed some analysts at the time. With Meta Compute now on the table, the calculus for investors changed dramatically.

Technical Details

Meta’s compute infrastructure is built around Nvidia GPU clusters optimized for large-scale AI training and inference. The external-facing offering is expected to follow a model similar to CoreWeave, where customers lease dedicated GPU capacity for specific workloads rather than accessing shared cloud resources through traditional virtual machine abstractions. This approach is especially attractive to AI labs, enterprises running fine-tuning workloads, and research organizations that need predictable, high-performance access to accelerated compute.

On the model hosting side, Meta Compute will offer inference access to Meta’s proprietary models, including Muse Spark. This positions Meta as both an infrastructure provider and a model-as-a-service vendor — a combination already proven by AWS (via Bedrock), Google (via Vertex AI), and Microsoft (via Azure AI Studio). Meta’s advantage is that it is offering access to its own first-party models alongside raw compute, potentially at prices that undercut competitors due to the scale of Meta’s infrastructure investments.

The compute pools available through Meta Compute are expected to draw from multiple geographic regions as Meta’s new data centers come online, giving enterprise customers options for data residency and latency requirements. Specific API endpoints, pricing structures, and service-level agreements had not been publicly disclosed as of July 2, 2026, though announcements are expected in the coming weeks.

Industry Impact and Reactions

The market reaction was swift and unambiguous. Meta shares closed up nearly 9 to 10 percent on the day of the announcement, with investors welcoming the prospect of returns on an infrastructure buildout that had previously drawn skepticism. The move effectively reframed Meta’s $182.9 billion commitment from a liability into the foundation of a potential new business line worth billions in annual recurring revenue.

The announcement had the opposite effect on neocloud rivals. Shares of CoreWeave and Nebius Group both fell roughly 12 percent as investors anticipated new competition from a company with far greater infrastructure scale and financial resources. Both CoreWeave and Nebius have built businesses around selling GPU compute to AI companies, precisely the market Meta is now entering.

The strategy is not without precedent. SpaceX began leasing compute capacity from its Colossus 1 data center in May 2026, signing deals with Anthropic, Google, and AI startup Reflection AI. Elon Musk’s company has since become one of the largest third-party compute platforms in the world, with committed external revenues exceeding $80 billion through 2029. Meta’s announcement suggests that large infrastructure operators without traditional cloud businesses are increasingly looking to monetize their GPU capacity in the open market rather than keep it captive.

What Comes Next

Meta Compute is expected to begin accepting enterprise customers in the second half of 2026, with the Louisiana and Ohio data centers contributing additional capacity as they come online. The company has not announced a specific launch date for its public API or pricing tiers, but industry analysts expect a phased rollout beginning with select enterprise partners before a broader availability announcement. Developer-facing tooling, including integration with existing Meta AI products, is also anticipated.

The longer-term question is whether Meta Compute can establish itself as a credible alternative to the hyperscalers. AWS, Google Cloud, and Microsoft Azure collectively control the vast majority of enterprise cloud spending and have deep integrations with enterprise software ecosystems that will take years to replicate. Meta’s path to competitiveness likely runs through pricing, model quality, and the ability to offer tight integration with Meta’s own AI research output.

Conclusion

Meta’s launch of Meta Compute represents one of the most significant strategic pivots in the company’s history — a deliberate move to transform its AI infrastructure from a research enabler into a commercial product. With nearly $183 billion committed to compute infrastructure, a roster of proprietary AI models, and a leadership team drawn from Meta’s most senior technical and business ranks, Meta Compute arrives as a credible entrant in a market that is still defining itself. For enterprises, AI startups, and the broader cloud industry, the arrival of Meta as a compute vendor will reshape competitive dynamics in ways that are only beginning to become clear.

Stay updated on the latest AI news at Evolve Digital.

July 2, 2026
Google Limits Meta’s Gemini AI Access as Global Compute Shortage Reaches a Breaking Point

Google has restricted Meta’s access to its Gemini AI models after the social media giant requested more computing capacity than Google could supply, the Financial Times reported on June 28, 2026. The move has disrupted and delayed multiple internal Meta AI projects and signals a deepening global crisis in artificial intelligence infrastructure that is now affecting even the largest players in the industry.

What Was Announced

According to the Financial Times and subsequent reports from CNBC and other outlets, Google informed Meta around March 2026 that it was unable to fulfill the full volume of Gemini AI computing capacity Meta had sought to purchase. Meta, which had become one of Google’s largest enterprise Gemini customers, found its AI operations constrained as a result.

The fallout was immediate for Meta’s internal teams. The company instructed employees to use AI tokens more sparingly and to improve efficiency in how they consume computing resources. Meta has also begun shifting internal workloads from Google’s Gemini to its own internally developed Muse Spark model, a move that signals a strategic pivot toward reducing dependency on external AI providers.

The situation extends beyond Meta. Several other Google Cloud customers have reportedly been affected by compute constraints, though to a lesser extent than Meta. Google declined to comment on the specifics of any individual customer relationship, but the scope of the shortage is reflected in the company’s own financial disclosures and executive commentary.

Google Cloud posted more than $20 billion in quarterly revenue, a year-over-year increase of 63 percent. Despite that staggering growth, the company faces an estimated $460 billion in unmet infrastructure demand. Google CEO Sundar Pichai publicly acknowledged the challenge, stating: “We are compute-constrained in the near term.”

Technical Details

The core bottleneck is GPU supply. Training and serving large AI models requires massive quantities of specialized hardware, primarily NVIDIA GPUs, which remain in critically short supply across the industry. Google has committed $180 to $190 billion toward AI infrastructure investment in 2026, a figure that reflects the scale of the problem rather than a solution to it.

To bridge the gap between existing capacity and skyrocketing customer demand, Google has entered into an extraordinary arrangement with SpaceX, paying approximately $920 million per month for access to 110,000 NVIDIA GPUs. Google describes this as “bridge capacity,” a temporary measure to supplement its own data center buildout while new facilities come online. The SpaceX deal alone represents an annualized spend of roughly $11 billion on externally sourced compute.

For Meta specifically, the compute squeeze arrived at a difficult moment. The company has simultaneously been undergoing significant internal restructuring, including a reduction of approximately 8,000 positions, while also planning to invest up to $135 billion in its own AI infrastructure. Meta’s reliance on Google’s Gemini API for internal tooling made the compute limits particularly disruptive to engineering workflows that had been built around consistent access to that capacity.

Industry Impact and Reactions

The Google and Meta situation is being closely watched across the AI industry as a concrete example of the infrastructure constraints that have until recently been discussed in mostly theoretical terms. For months, analysts and executives have warned that demand for AI compute would outstrip supply. This episode confirms that the gap has become wide enough to affect major commercial relationships between two of the largest technology companies on the planet.

The competitive implications are significant. Meta’s accelerated investment in its own Muse Spark model and internal compute suggests that large-scale AI consumers are drawing lessons from this episode and moving toward greater self-sufficiency. Other hyperscalers and enterprise AI adopters who rely on third-party API access for critical workflows may now reconsider their dependence on any single compute provider.

For Google, the situation presents a paradox: its Gemini models are generating intense commercial demand, yet infrastructure limits are forcing the company to ration access to paying customers. While Google Cloud’s revenue growth is exceptional, the ability to translate that demand into revenue is constrained by hardware availability. Competitors including Microsoft Azure, AWS, and Oracle Cloud are facing similar pressures, though each has structured its infrastructure investments differently.

What Comes Next

Google has provided no specific public timeline for when compute capacity constraints will ease. The company’s bridge arrangement with SpaceX is expected to persist into late 2026 at minimum, as new Google-owned data center capacity requires 18 to 24 months from groundbreaking to full operation. The $180 to $190 billion infrastructure commitment suggests that Google is building toward a significant expansion of capacity, but the benefits of that investment are unlikely to reach enterprise customers in the near term.

Meta, for its part, has signaled that its long-term strategy involves far greater self-reliance on internally developed models and owned infrastructure. The Muse Spark transition and the planned $135 billion infrastructure investment are likely to reduce the company’s exposure to third-party compute rationing going forward. Whether Google can retain Meta as a major customer once its own capacity is online will be one of the more consequential enterprise AI business storylines of the next 12 months.

Conclusion

The restriction of Meta’s Gemini AI access is a milestone moment in the evolution of the AI industry, marking the first widely reported instance of a major provider rationing compute to a major customer due to infrastructure scarcity. As demand for AI services continues to accelerate faster than new data center capacity can be built, the industry should expect rationing, strategic pivots toward internal models, and intensified competition for GPU supply to become defining features of the AI landscape through 2026 and beyond.

Stay updated on the latest AI news at Evolve Digital.

June 29, 2026
Meta Launches AI Subscription Tiers Under New ‘Meta One’ Brand, Charging Up to $19.99 Per Month

Meta took a significant step toward monetizing its artificial intelligence investments on May 28, 2026, officially launching a new subscription brand called Meta One that introduces tiered paid AI plans across Instagram, Facebook, and WhatsApp. The announcement marks a fundamental shift in how the social media giant plans to generate revenue from the billions of dollars it has poured into AI infrastructure, complementing rather than replacing its advertising business.

What Was Announced

Meta One is the new umbrella brand for a family of subscription tiers that give users access to enhanced AI capabilities across Meta’s core apps. The initial rollout covers consumers globally, with simultaneous testing of professional and business tiers targeting creators and enterprise customers.

The two AI-focused consumer tiers are priced at $7.99 per month for Meta One Plus and $19.99 per month for Meta One Premium. Both tiers sit on top of existing free Meta AI access, which remains available to all users at no charge.

Meta is also launching app-level subscriptions for its individual platforms. Instagram and Facebook Plus plans are priced at $3.99 per month, while a WhatsApp Plus plan is available at $2.99 per month. These entry-level subscriptions focus on profile customization, analytics, and enhanced messaging features rather than AI capabilities specifically.

Professional tiers aimed at creators and businesses range from $14.99 to $49.99 per month, bundling verification badges, improved search visibility, advanced audience analytics, and AI-assisted content creation tools.

Technical Details

The distinction between the two AI subscription tiers centers on compute access and task complexity. Meta One Plus at $7.99 per month is designed for users who regularly generate images and videos using Meta AI, or who rely on the assistant for longer reasoning conversations. It provides expanded generation quotas and moderately extended reasoning capabilities.

Meta One Premium at $19.99 per month unlocks what Meta describes as “thinking mode,” a deeper reasoning mode that allows the AI model to spend more compute cycles working through complex queries before responding. This mirrors similar tiered reasoning approaches offered by OpenAI and Google, where standard responses are faster and lighter, while premium reasoning responses are slower but more thorough for tasks such as coding, analysis, and multi-step planning.

The AI underpinning Meta AI across all tiers is built on Meta’s open-weight Llama model family. Meta has not disclosed which specific Llama version powers the subscription-tier features, but the company has consistently used its proprietary Llama models for consumer-facing AI products since Meta AI launched in 2023.

Industry Impact and Reactions

The launch positions Meta as the latest major AI company to adopt a tiered subscription model for consumer AI. OpenAI has operated paid ChatGPT tiers since early 2023, and Google charges for expanded access to Gemini’s advanced capabilities. By introducing Meta One, Meta is aligning its monetization strategy with the broader industry approach of offering free base access while charging power users for increased compute capacity and more capable models.

The timing is notable. Meta announced capital expenditure guidance of $115 to $135 billion for 2026, nearly double its 2025 spending on AI infrastructure. At the same time, the company cut approximately 8,000 jobs in late May 2026 while redirecting resources toward AI development. The subscription revenue from Meta One is intended in part to offset the cost of providing AI services at scale to more than three billion monthly active users across Meta’s platforms.

Meta simultaneously faces growing competition in its core advertising business. Both OpenAI and xAI have publicly signaled intentions to compete with Meta in advertising, making it strategically important for Meta to develop direct subscription revenue streams that are insulated from that competitive pressure.

What Comes Next

Meta has indicated that the current Meta One launch represents the first phase of a broader subscription strategy. Additional tiers and features are expected to be introduced later in 2026, including more deeply integrated AI agents across the WhatsApp and Messenger platforms. The company has also hinted at subscription offerings specifically for business customers that would go beyond the current professional tiers.

The broader AI subscription market will be watching adoption figures closely. Meta’s distribution advantage is significant: with more than three billion users already inside its apps, the addressable market for even a small conversion rate to paid AI plans is substantial. How quickly consumers adopt paid AI tiers on social platforms, compared to dedicated AI assistants, will likely shape how other major platform companies approach their own AI monetization strategies in 2026 and beyond.

Conclusion

Meta’s launch of the Meta One subscription brand on May 28, 2026 signals the company’s intent to build a durable revenue stream from its AI investments beyond advertising. By introducing tiered access from $7.99 to $19.99 per month for AI features, and combining that with app-level and professional subscriptions, Meta is building a multi-layered business model that mirrors successful approaches already adopted by OpenAI and Google. As AI compute costs continue to rise and competition intensifies, the subscription approach gives Meta a direct pathway to recover infrastructure spending while offering users meaningful value through enhanced AI capabilities in the apps they already use every day.

Stay updated on the latest AI news at Evolve Digital.

May 28, 2026
Meta Launches Llama 4: Its First Natively Multimodal Open-Weight AI Models with Mixture-of-Experts Architecture

Meta has launched the Llama 4 model family, a significant leap forward in open-weight AI that introduces native multimodality and a mixture-of-experts (MoE) architecture to the widely-downloaded Llama ecosystem. The two initial models — Llama 4 Scout and Llama 4 Maverick — are available for download on Hugging Face and represent what Meta is calling the beginning of a new era of AI development centered on natively multimodal intelligence rather than text-first models retrofitted with vision capabilities.

What Was Announced

Meta’s AI research division announced Llama 4 Scout and Llama 4 Maverick as the first models in the Llama 4 herd, both of which can natively process and reason over text, images, and other modalities without relying on separate vision encoders or adapter modules tacked onto a text-only core. This architectural shift — building multimodality into the model from the ground up — is the defining characteristic of the Llama 4 generation and represents a different approach than the vision-language model (VLM) pipeline Meta and others used in earlier multimodal releases.

The models also introduce a mixture-of-experts architecture to the public Llama family. In a MoE design, the model’s parameters are divided into specialized “expert” sub-networks, and only a subset of experts is activated for any given input token. This allows MoE models to have a much larger total parameter count than a dense model of equivalent computational cost, enabling stronger performance without proportionally higher inference expenses. Scout and Maverick differ primarily in scale, with Maverick positioned as the higher-capability model targeting advanced reasoning and instruction following tasks.

Both models are available under a permissive license on Hugging Face, continuing Meta’s strategy of releasing open-weight models that developers can run locally, fine-tune, and deploy without per-token API fees. The Llama family has now surpassed 650 million cumulative downloads across all variants, reflecting the massive developer community that has built around the open-weight model ecosystem Meta has created.

Technical Details

The native multimodal architecture of Llama 4 is technically significant because it allows the model to develop more integrated representations of visual and textual information during training, rather than learning to bridge two separately trained modalities at inference time. Early evaluations suggest this produces more coherent responses to queries that combine text and visual context — such as analyzing a chart while answering a question about it in natural language, or performing multi-step reasoning that requires alternating between visual observation and textual inference.

The MoE architecture brings Llama 4 into alignment with the design choices made by leading closed models, including GPT-4 and some variants of Gemini, which have been suspected or confirmed to use sparse MoE designs. For developers building on Llama, this represents a capability jump that preserves the efficiency advantages of the open-weight ecosystem while offering a more competitive performance profile against frontier commercial models.

Context window length has also been substantially extended in the Llama 4 series, with Scout and Maverick supporting context windows that allow processing of lengthy documents, extended conversations, and complex multi-image inputs without truncation. This is particularly relevant for enterprise use cases that involve processing large volumes of unstructured data or maintaining long-horizon task context in agentic settings.

Industry Impact and Reactions

The Llama 4 release lands at a moment when the gap between open-weight and closed-weight AI models has been narrowing, and the announcement is likely to further accelerate that trend. Developers who have built production systems on Llama 3 will be evaluating a direct upgrade path, while enterprises that have been considering commercial API providers may find that the Llama 4 capability profile reduces the premium they are willing to pay for proprietary models.

For OpenAI, Anthropic, and Google, the continued advancement of Meta’s open-weight models creates competitive pressure in the developer tools and enterprise segments where open-source deployment flexibility is a meaningful procurement criterion. While closed models retain advantages in the highest-stakes enterprise applications requiring guarantees around reliability and support, the Llama ecosystem is becoming progressively more competitive across a wider range of use cases.

The broader open-source AI community has responded enthusiastically to the Llama 4 announcement, with fine-tuning efforts, evaluation results, and deployment guides appearing on Hugging Face, GitHub, and developer forums within hours of the release. Meta’s decision to maintain a permissive license for the Llama 4 herd — despite pressure from some quarters to restrict commercial use — reinforces the company’s position as the primary driver of open-weight frontier AI development.

What Comes Next

Meta has signaled that Scout and Maverick are the first members of a broader Llama 4 herd, with additional models targeting specific capability tiers and use cases expected to follow. The company is also preparing for its first dedicated developer conference, LlamaCon, where it is expected to share additional roadmap details, developer tools, and ecosystem announcements built around the Llama platform.

Fine-tuning infrastructure for Llama 4 is already being built out across the major cloud providers, and enterprise AI vendors including those offering retrieval-augmented generation and agent frameworks are updating their products to support the new models. The pace of adoption will be closely watched as an indicator of how the open-weight AI market responds to a generation of models that are simultaneously more capable and architecturally more complex than their predecessors.

Conclusion

Meta’s Llama 4 launch represents a genuine advance in open-weight AI — not just an incremental update to the Llama lineage, but a fundamental architectural shift toward native multimodality and sparse computation. With 650 million cumulative downloads behind it and a rapidly growing developer community ahead, the Llama 4 herd is positioned to become the foundation layer of a substantial portion of the world’s AI deployments in 2026 and beyond.

Stay updated on the latest AI news at Evolve Digital.

March 29, 2026