LLM · Google

Google speeds up Gemma 4 threefold with multi-token prediction
LLMGoogle

Google speeds up Gemma 4 threefold with multi-token prediction

Google has released multi-token prediction drafters for its Gemma 4 open model that significantly accelerate text...

The Decoder
Google updates AI search to include quotes from Reddit and other sources
LLMGoogle

Google updates AI search to include quotes from Reddit and other sources

Google has updated its AI search feature to incorporate quotes and information from Reddit and other web forums and...

TechCrunch AI
Google AI Releases Multi-Token Prediction (MTP) Drafters for Gemma 4: Delivering Up to 3x Faster Inference Without Quality Loss
LLMGoogle

Google AI Releases Multi-Token Prediction (MTP) Drafters for Gemma 4: Delivering Up to 3x Faster Inference Without Quality Loss

Google AI has released Multi-Token Prediction (MTP) Drafters for the Gemma 4 family of language models, using...

MarkTechPost
Google Adds Event-Driven Webhooks to the Gemini API, Eliminating the Need for Polling in Long-Running AI Jobs
LLMGoogle

Google Adds Event-Driven Webhooks to the Gemini API, Eliminating the Need for Polling in Long-Running AI Jobs

Google has introduced event-driven webhooks to the Gemini API, enabling push-based notifications for long-running AI...

MarkTechPost
Google Gemini now generates full documents, spreadsheets, and presentations directly inside the chat
LLMGoogle

Google Gemini now generates full documents, spreadsheets, and presentations directly inside the chat

Google Gemini now has the ability to generate full documents, spreadsheets, and presentations directly within the chat...

The Decoder
Google rolls out Gemini memory in Europe and wants you to bring your ChatGPT data along
LLMGoogle

Google rolls out Gemini memory in Europe and wants you to bring your ChatGPT data along

Google has rolled out Gemini memory features in Europe, allowing the AI to remember user preferences. The update also...

The Decoder
Google's new AI tools put film scouting in Street View and promise to cut weeks of satellite analysis to minutes
LLMGoogle

Google's new AI tools put film scouting in Street View and promise to cut weeks of satellite analysis to minutes

Google unveiled three new AI imaging tools at Cloud Next, including capabilities for integrating AI-generated images...

The Decoder
Google plans nearly two million new AI chips as it turns to Marvell for custom designs
LLMGoogle

Google plans nearly two million new AI chips as it turns to Marvell for custom designs

Google is partnering with chip designer Marvell Technology to develop two specialized AI chips for its data centers,...

The Decoder
Google AI Launches Gemini 3.1 Flash TTS: A New Benchmark in Expressive and Controllable AI Voice
LLMGoogle

Google AI Launches Gemini 3.1 Flash TTS: A New Benchmark in Expressive and Controllable AI Voice

Google has released Gemini 3.1 Flash TTS, a preview text-to-speech model that enhances speech quality, expressive...

MarkTechPost
Gemini 3.1 Flash TTS: the next generation of expressive AI speech
LLMGoogle

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Google has released Gemini 3.1 Flash TTS, a new audio model featuring granular audio tags that enable precise control...

DeepMind Blog
An End-to-End Coding Guide to NVIDIA KVPress for Long-Context LLM Inference, KV Cache Compression, and Memory-Efficient Generation
LLMGoogle

An End-to-End Coding Guide to NVIDIA KVPress for Long-Context LLM Inference, KV Cache Compression, and Memory-Efficient Generation

A practical tutorial on NVIDIA's KVPress technology for optimizing long-context language model inference through KV...

MarkTechPost
Google’s Gemini AI can answer your questions with 3D models and simulations
LLMGoogle

Google’s Gemini AI can answer your questions with 3D models and simulations

Google has upgraded Gemini to generate interactive 3D models and simulations in response to user questions. Users can...

The Verge AI
Google Gemini now generates interactive visualizations you can tweak and explore right in the chat
LLMGoogle

Google Gemini now generates interactive visualizations you can tweak and explore right in the chat

Google Gemini now supports generating interactive visualizations directly within the chat interface that users can...

The Decoder
A Coding Guide to Build Advanced Document Intelligence Pipelines with Google LangExtract, OpenAI Models, Structured Extraction, and Interactive Visualization
LLMGoogle

A Coding Guide to Build Advanced Document Intelligence Pipelines with Google LangExtract, OpenAI Models, Structured Extraction, and Interactive Visualization

This tutorial demonstrates how to build advanced document intelligence pipelines using Google's LangExtract library...

MarkTechPost
Google's AI Overviews are correct nine out of ten times, study finds
LLMGoogle

Google's AI Overviews are correct nine out of ten times, study finds

A study found that Google's AI Overviews provide accurate responses 90% of the time, despite Google's disclaimer that...

The Decoder
AI benchmarks systematically ignore how humans disagree, Google study finds
LLMGoogle

AI benchmarks systematically ignore how humans disagree, Google study finds

A Google study reveals that standard AI benchmarks using only 3-5 human raters per example are insufficient for...

The Decoder
Google Launches Open Model Family Gemma 4
LLMGoogle

Google Launches Open Model Family Gemma 4

Google has launched Gemma 4, a new open model family designed for advanced reasoning and multimodal capabilities. The...

AI Business
Gemma 4: Byte for byte, the most capable open models
LLMGoogle

Gemma 4: Byte for byte, the most capable open models

Google announces Gemma 4, their most capable open-source language models designed for advanced reasoning and agentic...

DeepMind Blog
Gemini 3.1 Flash Live: Making audio AI more natural and reliable
LLMGoogle

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

Google has released Gemini 3.1 Flash, an improved voice model featuring enhanced precision and lower latency for more...

DeepMind Blog