Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
speculative-decoding
Clear
Titles
Summaries
April
6
Amazon SageMaker AI Automates Generative AI Inference Deployment
Products
1
Apr 22
6
Amazon SageMaker AI Automates Generative AI Inference Deployment
Products
· 1 src · Apr 22
Discuss
7
Aurora: Open-Source RL Framework Makes LLM Speculative Decoding Self-Improving
Research
1
Apr 3
7
Aurora: Open-Source RL Framework Makes LLM Speculative Decoding Self-Improving
Research
· 1 src · Apr 3
Discuss
March
6
SPEED-Bench: New Unified Benchmark for Evaluating Speculative Decoding Algorithms
Research
1
Mar 20
6
SPEED-Bench: New Unified Benchmark for Evaluating Speculative Decoding Algorithms
Research
· 1 src · Mar 20
Discuss
Last Month
8
NVIDIA Nemotron-Labs Launches Diffusion Language Models That Generate Tokens in Parallel
Models
1
May 23
8
NVIDIA Nemotron-Labs Launches Diffusion Language Models That Generate Tokens in Parallel
Top
Models
· 1 src · May 23
Discuss
7
Google Releases MTP Drafters for Gemma 4, Enabling Up to 3x Faster Inference
Updated
Products
2
May 6
7
Google Releases MTP Drafters for Gemma 4, Enabling Up to 3x Faster Inference
Products
· 2 srcs · May 6
Discuss
7
Speculative Decoding Cuts RL Post-Training Rollout Time by Up to 2.5x
Research
1
May 1
7
Speculative Decoding Cuts RL Post-Training Rollout Time by Up to 2.5x
Research
· 1 src · May 1
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss