LLM

Large Language Models

New Rowhammer attacks give complete control of machines running Nvidia GPUs
LLM

New Rowhammer attacks give complete control of machines running Nvidia GPUs

GDDRHammer, GeForge and GPUBreach hammer GPU memory in ways that hijack the CPU.

Ars Technica AI
Microsoft takes on AI rivals with three new foundational models
LLMMicrosoft

Microsoft takes on AI rivals with three new foundational models

Microsoft released three new foundational AI models capable of transcribing voice to text, generating audio, and...

TechCrunch AI
Gemma 4: Byte for byte, the most capable open models
LLMGoogle

Gemma 4: Byte for byte, the most capable open models

Google announces Gemma 4, their most capable open-source language models designed for advanced reasoning and agentic...

DeepMind Blog
Anthropic Says That Claude Contains Its Own Kind of Emotions
LLMAnthropic

Anthropic Says That Claude Contains Its Own Kind of Emotions

Anthropic researchers have discovered internal representations within Claude that function similarly to human emotions....

Wired AI
LLMOpenAI

Codex now offers more flexible pricing for teams

Codex introduces pay-as-you-go pricing options for ChatGPT Business and Enterprise tiers, enabling teams greater...

OpenAI Blog
IBM Releases Granite 4.0 3B Vision: A New Vision Language Model for Enterprise Grade Document Data Extraction
LLM

IBM Releases Granite 4.0 3B Vision: A New Vision Language Model for Enterprise Grade Document Data Extraction

IBM has released Granite 4.0 3B Vision, a specialized vision-language model designed for enterprise document data...

MarkTechPost
Z.ai Launches GLM-5V-Turbo: A Native Multimodal Vision Coding Model Optimized for OpenClaw and High-Capacity Agentic Engineering Workflows Everywhere
LLM

Z.ai Launches GLM-5V-Turbo: A Native Multimodal Vision Coding Model Optimized for OpenClaw and High-Capacity Agentic Engineering Workflows Everywhere

Zhipu AI launches GLM-5V-Turbo, a native multimodal vision-language model designed to bridge visual perception and code...

MarkTechPost
STADLER reshapes knowledge work at a 230-year-old company
LLMOpenAI

STADLER reshapes knowledge work at a 230-year-old company

STADLER, a 230-year-old company, has implemented ChatGPT to transform knowledge work processes across its 650...

OpenAI Blog
Gemini 3.1 Flash Live: Making audio AI more natural and reliable
LLMGoogle

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

Google has released Gemini 3.1 Flash, an improved voice model featuring enhanced precision and lower latency for more...

DeepMind Blog
ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text
LLM

ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text

The article discusses LLMs training other LLMs, a 72B parameter distributed training run, and comparative analysis...

Import AI
Gemini 3.1 Flash-Lite: Built for intelligence at scale
LLMGoogle

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Google has released Gemini 3.1 Flash-Lite, the fastest and most cost-efficient model in the Gemini 3 series. This...

DeepMind Blog
Nano Banana 2: Combining Pro capabilities with lightning-fast speed
LLM

Nano Banana 2: Combining Pro capabilities with lightning-fast speed

Nano Banana 2 is a new image generation model that combines advanced capabilities with exceptional speed performance....

DeepMind Blog
Import AI 446: Nuclear LLMs; China's big AI benchmark; measurement and AI policy
LLM

Import AI 446: Nuclear LLMs; China's big AI benchmark; measurement and AI policy

The article covers multiple AI topics including nuclear-powered LLMs, China's major AI benchmark initiative, and the...

Import AI
Gemini 3.1 Pro: A smarter model for your most complex tasks
LLMGoogle

Gemini 3.1 Pro: A smarter model for your most complex tasks

Google announces Gemini 3.1 Pro, a new large language model designed to handle complex tasks that require more...

DeepMind Blog
A new way to express yourself: Gemini can now create music
LLMGoogle

A new way to express yourself: Gemini can now create music

Google has integrated its advanced Lyria 3 music generation model into the Gemini app, allowing users to create...

DeepMind Blog
Import AI 444: LLM societies; Huawei makes kernels with AI; ChipBench
LLM

Import AI 444: LLM societies; Huawei makes kernels with AI; ChipBench

Import AI 444 covers multiple AI developments including LLM societies, Huawei's AI-assisted kernel development, and...

Import AI
LLMDeepSeek

DeepSeek Unveils DeepSeek-Prover-V2: Advancing Neural Theorem Proving with Recursive Proof Search and a New Benchmark

DeepSeek AI releases DeepSeek-Prover-V2, an open-source LLM specialized for Lean 4 theorem proving that uses recursive...

Synced Review
LLMDeepSeek

DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT

DeepSeek AI has published research on a new technique called SPCT for scaling general reward models during inference,...

Synced Review