LLM · Other Companies

Liquid AI Releases LFM2.5-8B-A1B: An On-Device MoE Model With 8.3B Total and 1.5B Active Parameters
Featured
LLM

Liquid AI Releases LFM2.5-8B-A1B: An On-Device MoE Model With 8.3B Total and 1.5B Active Parameters

Liquid AI's LFM2.5-8B-A1B activates 1.5B of 8.3B parameters, offering 128K context, reasoning, and tool calling on consumer hardware. The post Liquid AI Releases LFM2.5-8B-A1B: An On-Device MoE Model...

MarkTechPost
Read more
Perplexity AI Open-Sources Unigram Tokenizer That Achieves 5x Lower p50 Latency Than Hugging Face tokenizers Crate
LLM

Perplexity AI Open-Sources Unigram Tokenizer That Achieves 5x Lower p50 Latency Than Hugging Face tokenizers Crate

Perplexity AI has open-sourced an optimized Unigram tokenizer that significantly improves performance compared to...

MarkTechPost
Amazon builds its own AI production platform and greenlights three AI animated series for Prime Video
LLM

Amazon builds its own AI production platform and greenlights three AI animated series for Prime Video

Amazon MGM Studios and AWS have launched a GenAI Creators' Fund providing filmmakers access to their proprietary AI...

The Decoder
ElevenLabs Music v2 promises opera-to-metal transitions without losing musical coherence
LLM

ElevenLabs Music v2 promises opera-to-metal transitions without losing musical coherence

ElevenLabs has released Music v2, an AI music generation model capable of seamlessly transitioning between different...

The Decoder
The AI boom drove Nvidia's yearly Taiwan spending from $15 billion to $150 billion
LLM

The AI boom drove Nvidia's yearly Taiwan spending from $15 billion to $150 billion

Nvidia's annual spending with Taiwan-based suppliers, primarily TSMC, has surged from $15 billion to $150 billion due...

The Decoder
Meet EAGLE 3.1: The Speculative Decoding Algorithm That Fixes Attention Drift in LLM Inference
LLM

Meet EAGLE 3.1: The Speculative Decoding Algorithm That Fixes Attention Drift in LLM Inference

EAGLE 3.1, a new speculative decoding algorithm developed by the EAGLE team, vLLM, and TorchSpec, addresses instability...

MarkTechPost
7 Ways to Get So Good at AI, People Will Think You Are AI
LLM

7 Ways to Get So Good at AI, People Will Think You Are AI

The article provides practical tips for mastering AI tools and prompting techniques to become proficient with AI...

Wired AI
Uber president says AI spending is getting ‘harder to justify’
LLM

Uber president says AI spending is getting ‘harder to justify’

Uber has exhausted its annual AI budget four months into 2026 and is questioning the return on its investments....

The Verge AI
Together AI Open-Sources OSCAR: An Attention-Aware 2-Bit KV Cache Quantization System for Long-Context LLM Serving
LLM

Together AI Open-Sources OSCAR: An Attention-Aware 2-Bit KV Cache Quantization System for Long-Context LLM Serving

Together AI has open-sourced OSCAR, an INT2 KV cache quantization method that reduces memory usage by 8× and improves...

MarkTechPost
AI models often give the right answers but point to the wrong sources
LLM

AI models often give the right answers but point to the wrong sources

Leading AI models like GPT and Gemini often cite incorrect source passages to support their answers, even when the...

The Decoder
StepFun Releases StepAudio 2.5 Realtime: An End-to-End Voice Model with Roleplay-Specific RLHF and Paralinguistic Comprehension
LLM

StepFun Releases StepAudio 2.5 Realtime: An End-to-End Voice Model with Roleplay-Specific RLHF and Paralinguistic Comprehension

StepFun released StepAudio 2.5 Realtime, an end-to-end real-time speech language model with customizable persona...

MarkTechPost
ByteDance study finds that asking LMMs questions beats making it transcribe text for long document training
LLM

ByteDance study finds that asking LMMs questions beats making it transcribe text for long document training

ByteDance Seed demonstrates that a 7B parameter model can effectively answer questions on long, image-heavy documents...

The Decoder
Cloudflare CEO Prince says builders and sellers are safe but AI is coming for the measurers
LLM

Cloudflare CEO Prince says builders and sellers are safe but AI is coming for the measurers

Cloudflare CEO Matthew Prince laid off over 20% of the workforce, attributing the cuts to AI replacing middle...

The Decoder
Google checks websites for llms.txt in new agentic browsing audit
LLM

Google checks websites for llms.txt in new agentic browsing audit

Google is testing how well websites handle AI agents through a new experimental category called "Agentic Browsing" in...

The Decoder
Cohere open-sources its strongest model yet
LLM

Cohere open-sources its strongest model yet

Cohere, a Canadian AI company, has released Command A+, its most powerful language model to date, as open source under...

The Decoder
US government takes $2 billion equity stake in nine quantum computing firms
LLM

US government takes $2 billion equity stake in nine quantum computing firms

Beneficiaries include startup backed by firm with links to the Trump family.

Ars Technica AI
One Model, Three Modalities: ByteDance Releases Lance for Image and Video Understanding, Generation, and Editing
LLM

One Model, Three Modalities: ByteDance Releases Lance for Image and Video Understanding, Generation, and Editing

ByteDance's Intelligent Creation Lab released Lance, an open-source multimodal model that performs image and video...

MarkTechPost
We’re announcing new community investments in Missouri.
LLM

We’re announcing new community investments in Missouri.

We’re helping build the state’s next-generation workforce and investing in energy programs.

Google AI Blog
Google publishes exploit code threatening millions of Chromium users
LLM

Google publishes exploit code threatening millions of Chromium users

Google publishes exploit code before patch, reported 42 months earlier, is fixed.

Ars Technica AI
Stability AI releases a new audio model that can create 6-minute songs
LLM

Stability AI releases a new audio model that can create 6-minute songs

Stability AI has released Stability Audio 3.0 small model, capable of generating two-minute audio tracks that can run...

TechCrunch AI

Stay Updated

Get the latest AI news delivered to your inbox every morning. No spam, unsubscribe anytime.