Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
vllm
Clear
Titles
Summaries
April
6
VibeVoice: Open-Source Frontier Voice AI Suite with Long-Form ASR and Real-Time TTS
Open Source
1
Apr 28
6
VibeVoice: Open-Source Frontier Voice AI Suite with Long-Form ASR and Real-Time TTS
Open Source
· 1 src · Apr 28
Discuss
March
8
NVIDIA Dynamo 1.0 Launches for Multi-Node AI Inference at Scale
Infra
1
Mar 17
8
NVIDIA Dynamo 1.0 Launches for Multi-Node AI Inference at Scale
Top
Infra
· 1 src · Mar 17
Discuss
7
Mistral Small 4: Unified 119B MoE Model Released Under Apache 2.0
Models
2
Mar 17
7
Mistral Small 4: Unified 119B MoE Model Released Under Apache 2.0
Models
· 2 srcs · Mar 17
Discuss
7
AWS + llm-d Bring Disaggregated LLM Inference to SageMaker and EKS
Infra
1
Mar 16
7
AWS + llm-d Bring Disaggregated LLM Inference to SageMaker and EKS
Infra
· 1 src · Mar 16
Discuss
Last Week
6
TRL Adds Delta Weight Sync to Cut Async RL Transfer Costs by ~98%
Open Source
1
5d ago
6
TRL Adds Delta Weight Sync to Cut Async RL Transfer Costs by ~98%
Open Source
· 1 src · 5d ago
Discuss
2 Weeks Ago
6
Amazon SageMaker AI Adds Bidirectional Streaming for Real-Time Voice Apps
Products
1
May 20
6
Amazon SageMaker AI Adds Bidirectional Streaming for Real-Time Voice Apps
Products
· 1 src · May 20
Discuss
Last Month
7
TokenSpeed: Compiler-Backed LLM Inference Engine Built for Agentic Coding Workloads
Infra
1
May 7
7
TokenSpeed: Compiler-Backed LLM Inference Engine Built for Agentic Coding Workloads
Infra
· 1 src · May 7
Discuss
6
vLLM V1 Migration: Four Fixes Required for RL Training Parity
Open Source
1
May 6
6
vLLM V1 Migration: Four Fixes Required for RL Training Parity
Open Source
· 1 src · May 6
Discuss
7
Google Releases MTP Drafters for Gemma 4, Enabling Up to 3x Faster Inference
Updated
Products
2
May 6
7
Google Releases MTP Drafters for Gemma 4, Enabling Up to 3x Faster Inference
Products
· 2 srcs · May 6
Discuss
7
DigitalOcean Launches AI-Native Cloud at Deploy 2026 with 15 New Products
Infra
1
May 5
7
DigitalOcean Launches AI-Native Cloud at Deploy 2026 with 15 New Products
Infra
· 1 src · May 5
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss