LLM · Other Companies

ChatGPT bleeds market share as Claude posts explosive monthly growth
LLM

ChatGPT bleeds market share as Claude posts explosive monthly growth

Claude has doubled its market share in a single month, surpassing DeepSeek and Grok, while ChatGPT maintains market...

The Decoder
Qwen Team Open-Sources Qwen3.6-35B-A3B: A Sparse MoE Vision-Language Model with 3B Active Parameters and Agentic Coding Capabilities
LLM

Qwen Team Open-Sources Qwen3.6-35B-A3B: A Sparse MoE Vision-Language Model with 3B Active Parameters and Agentic Coding Capabilities

Qwen Team has open-sourced Qwen3.6-35B-A3B, a sparse Mixture of Experts (MoE) vision-language model that uses only 3...

MarkTechPost
A Technical Deep Dive into the Essential Stages of Modern Large Language Model Training, Alignment, and Deployment
LLM

A Technical Deep Dive into the Essential Stages of Modern Large Language Model Training, Alignment, and Deployment

The article provides a technical overview of the complete pipeline for training modern large language models, covering...

MarkTechPost
Reid Hoffman weighs in on the ‘tokenmaxxing’ debate
LLM

Reid Hoffman weighs in on the ‘tokenmaxxing’ debate

Reid Hoffman discusses the debate around 'tokenmaxxing' and argues that while tracking AI token usage can indicate...

TechCrunch AI
NVIDIA and the University of Maryland Researchers Released Audio Flamingo Next (AF-Next): A Super Powerful and Open Large Audio-Language Model
LLM

NVIDIA and the University of Maryland Researchers Released Audio Flamingo Next (AF-Next): A Super Powerful and Open Large Audio-Language Model

NVIDIA and University of Maryland researchers released Audio Flamingo Next (AF-Next), an open large audio-language...

MarkTechPost
From LLMs to hallucinations, here’s a simple guide to common AI terms
LLM

From LLMs to hallucinations, here’s a simple guide to common AI terms

The article provides a glossary of common AI terminology that has emerged with the rise of large language models and...

TechCrunch AI
LLM

MiniMax Just Open Sourced MiniMax M2.7: A Self-Evolving Agent Model that Scores 56.22% on SWE-Pro and 57.0% on Terminal Bench 2

MiniMax has open-sourced MiniMax M2.7, a self-evolving agent model that achieves 56.22% on SWE-Pro and 57.0% on...

MarkTechPost
Liquid AI Releases LFM2.5-VL-450M: a 450M-Parameter Vision-Language Model with Bounding Box Prediction, Multilingual Support, and Sub-250ms Edge Inference
LLM

Liquid AI Releases LFM2.5-VL-450M: a 450M-Parameter Vision-Language Model with Bounding Box Prediction, Multilingual Support, and Sub-250ms Edge Inference

Liquid AI released LFM2.5-VL-450M, a 450M-parameter vision-language model featuring bounding box prediction,...

MarkTechPost
LLMs crush coding and math but choke on casual questions, and that's not a contradiction
LLM

LLMs crush coding and math but choke on casual questions, and that's not a contradiction

Large language models demonstrate strong performance on structured tasks like coding and mathematics but struggle with...

The Decoder
AI fundamentals
LLM

AI fundamentals

A beginner-friendly guide explaining artificial intelligence fundamentals, including how AI works and how large...

OpenAI Blog
Financial services
LLM

Financial services

This article explores AI resources specifically designed for financial services institutions, including prompt packs,...

OpenAI Blog
Zhipu AI's GLM-5.1 can rethink its own coding strategy across hundreds of iterations
LLM

Zhipu AI's GLM-5.1 can rethink its own coding strategy across hundreds of iterations

Zhipu AI released GLM-5.1, a new large language model under MIT license that can iteratively refine its coding strategy...

The Decoder
A Comprehensive Implementation Guide to ModelScope for Model Search, Inference, Fine-Tuning, Evaluation, and Export
LLM

A Comprehensive Implementation Guide to ModelScope for Model Search, Inference, Fine-Tuning, Evaluation, and Export

A comprehensive tutorial on using ModelScope for machine learning workflows, covering model search, inference,...

MarkTechPost
The US Army Is Building Its Own Chatbot for Combat
LLM

The US Army Is Building Its Own Chatbot for Combat

The US Army is developing its own chatbot powered by AI and trained on real military data to provide soldiers with...

Wired AI
One in four quotes in AI chatbot responses comes from journalism, Muckrack study finds
LLM

One in four quotes in AI chatbot responses comes from journalism, Muckrack study finds

A Muckrack study analyzing 15 million AI citations found that one in four source references in ChatGPT, Claude, and...

The Decoder
I can’t help rooting for tiny open source AI model maker Arcee
LLM

I can’t help rooting for tiny open source AI model maker Arcee

Arcee, a 26-person U.S. startup, has developed a high-performing open source large language model that is gaining...

TechCrunch AI
AI chatbot traffic grows seven times faster than social media but still trails by a factor of four
LLM

AI chatbot traffic grows seven times faster than social media but still trails by a factor of four

AI chatbot traffic is growing seven times faster than social media platforms, according to Similarweb analysis. Despite...

The Decoder
Alibaba's Qwen team makes AI models think deeper with new algorithm
LLM

Alibaba's Qwen team makes AI models think deeper with new algorithm

Alibaba's Qwen team developed a new algorithm that improves reinforcement learning for reasoning models by weighting...

The Decoder
New Rowhammer attacks give complete control of machines running Nvidia GPUs
LLM

New Rowhammer attacks give complete control of machines running Nvidia GPUs

GDDRHammer, GeForge and GPUBreach hammer GPU memory in ways that hijack the CPU.

Ars Technica AI