Research & Papers

Latest AI research

Meet mKernel: A Multi-GPU, Multi-Node Fused Kernel Library for GPU-Driven Communication
Featured
Research & Papers

Meet mKernel: A Multi-GPU, Multi-Node Fused Kernel Library for GPU-Driven Communication

UC Berkeley's UCCL team releases mKernel, fusing intra-node NVLink, inter-node RDMA, and dense compute into a single persistent CUDA kernel. The post Meet mKernel: A Multi-GPU, Multi-Node Fused...

MarkTechPost
Read more
RSI is the new AGI — and it’s just as hard to pin down
Research & Papers

RSI is the new AGI — and it’s just as hard to pin down

Multiple AI labs are pursuing recursive self-improvement (RSI) as a path toward artificial general intelligence, but...

TechCrunch AI
A Coding Guide to Implement a pgvector-Powered Semantic, Hybrid, Sparse, and Quantized Vector Search System
Research & Papers

A Coding Guide to Implement a pgvector-Powered Semantic, Hybrid, Sparse, and Quantized Vector Search System

This tutorial demonstrates how to build a vector search system using pgvector in PostgreSQL, integrating semantic,...

MarkTechPost
Sakana AI Proposes DiffusionBlocks: a Block-wise Training Framework That Converts Residual Networks into Independently Trainable Denoising Modules
Research & Papers

Sakana AI Proposes DiffusionBlocks: a Block-wise Training Framework That Converts Residual Networks into Independently Trainable Denoising Modules

Sakana AI proposes DiffusionBlocks, a novel framework that converts residual networks into independently trainable...

MarkTechPost
MEMO: A Modular Framework for Training a Dedicated Memory Model on New Knowledge Without Modifying LLM Parameters
Research & Papers

MEMO: A Modular Framework for Training a Dedicated Memory Model on New Knowledge Without Modifying LLM Parameters

Researchers from NUS, MIT, and A*STAR introduce MEMO, a modular framework that trains a separate memory model to encode...

MarkTechPost
Design a High-Precision Retrieve-and-Rerank Pipeline with ZeroEntropy Zerank-2 Reranker
Research & Papers

Design a High-Precision Retrieve-and-Rerank Pipeline with ZeroEntropy Zerank-2 Reranker

This tutorial demonstrates how to build a high-precision retrieve-and-rerank pipeline using the ZeroEntropy Zerank-2...

MarkTechPost
Stability AI Releases Stable Audio 3: A Family of Fast Latent Diffusion Models for Audio Generation and Editing
Research & Papers

Stability AI Releases Stable Audio 3: A Family of Fast Latent Diffusion Models for Audio Generation and Editing

Stability AI has released Stable Audio 3, a family of latent diffusion models for audio generation and editing with...

MarkTechPost
Import AI 458: Reckoning with the future; and a singularity story
Research & Papers

Import AI 458: Reckoning with the future; and a singularity story

An article from Import AI 458 discussing potential AI-driven breakthroughs and developments expected in the coming...

Import AI
Design a Complete Multimodal RLVR Pipeline with Open-MM-RL, Vision-Language Prompting, Reward Scoring, and GRPO Export
Research & Papers

Design a Complete Multimodal RLVR Pipeline with Open-MM-RL, Vision-Language Prompting, Reward Scoring, and GRPO Export

This tutorial demonstrates how to build a complete multimodal reinforcement learning pipeline using the Open-MM-RL...

MarkTechPost
Step by Step Guide to Build and Compare FedAvg and FedProx Federated Learning on Non-IID CIFAR-10 with NVIDIA FLARE
Research & Papers

Step by Step Guide to Build and Compare FedAvg and FedProx Federated Learning on Non-IID CIFAR-10 with NVIDIA FLARE

This tutorial demonstrates how to build and compare FedAvg and FedProx federated learning algorithms using NVIDIA FLARE...

MarkTechPost
Google Deepmind's AlphaProof Nexus solves decades-old math problems for a few hundred dollars
Research & PapersGoogle

Google Deepmind's AlphaProof Nexus solves decades-old math problems for a few hundred dollars

Google DeepMind's AlphaProof Nexus has autonomously solved nine open Erdős problems, including two unsolved for 56...

The Decoder
Deepmind's Hassabis sees humanity "in the foothills of the singularity" while LeCun says current AI isn't intelligent
Research & PapersGoogle

Deepmind's Hassabis sees humanity "in the foothills of the singularity" while LeCun says current AI isn't intelligent

DeepMind's Demis Hassabis and other AI leaders debate the current state of artificial intelligence, with Hassabis...

The Decoder
Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed
Research & Papers

Researchers let Claude Code discover AI scaling algorithms that humans probably wouldn't have designed

Researchers used AutoTTS with Claude Code to autonomously discover AI control algorithms that reduce compute...

The Decoder
NVIDIA AI Releases Gated DeltaNet-2: A Linear Attention Layer That Decouples Erase and Write in the Delta Rule
Research & Papers

NVIDIA AI Releases Gated DeltaNet-2: A Linear Attention Layer That Decouples Erase and Write in the Delta Rule

NVIDIA AI releases Gated DeltaNet-2, a new linear attention layer that improves upon previous delta-rule models by...

MarkTechPost
Nous Research Releases Contrastive Neuron Attribution (CNA): Sparse MLP Circuit Steering Without SAE Training or Weight Modification
Research & Papers

Nous Research Releases Contrastive Neuron Attribution (CNA): Sparse MLP Circuit Steering Without SAE Training or Weight Modification

Nous Research introduces Contrastive Neuron Attribution (CNA), a novel method for steering large language model...

MarkTechPost
Catch up on the Dialogues stage at Google I/O 2026.
Research & PapersGoogle

Catch up on the Dialogues stage at Google I/O 2026.

Google I/O 2026 featured the Dialogues stage where industry leaders discussed advancements and future directions in AI,...

Google AI Blog
Build Recurrent-Depth Transformers with OpenMythos for MLA, GQA, Sparse MoE, and Loop-Scaled Reasoning
Research & Papers

Build Recurrent-Depth Transformers with OpenMythos for MLA, GQA, Sparse MoE, and Loop-Scaled Reasoning

This tutorial demonstrates how to build advanced recurrent-depth transformer models using OpenMythos, including MLA and...

MarkTechPost
How CopilotKit Is Redefining the Agentic AI Stack in 2026
Research & Papers

How CopilotKit Is Redefining the Agentic AI Stack in 2026

An inside look at CopilotKit’s 2026 shipping cycle. Learn how the new AG-UI protocol, AIMock testing suite, and...

MarkTechPost
Qwen Introduces Qwen3.7-Max: A Reasoning Agent Model With a 1M-Token Context Window
Research & Papers

Qwen Introduces Qwen3.7-Max: A Reasoning Agent Model With a 1M-Token Context Window

Alibaba's Qwen team introduced Qwen3.7-Max at the 2026 Alibaba Cloud Summit, describing it as its most advanced and...

MarkTechPost
Cohere Releases Command A+: A 218B Sparse MoE Model for Agentic Workflows That Runs on as Few as Two H100 GPUs
Research & Papers

Cohere Releases Command A+: A 218B Sparse MoE Model for Agentic Workflows That Runs on as Few as Two H100 GPUs

Cohere releases Command A+, an open-source 218B Sparse Mixture-of-Experts model consolidating four prior Command A...

MarkTechPost

Stay Updated

Get the latest AI news delivered to your inbox every morning. No spam, unsubscribe anytime.