Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
inference-compute
Clear
Titles
Summaries
Last Week
6
Meta IKBO: Kernel-Level Broadcast Elimination Cuts RecSys Latency by Two-Thirds
Infra
1
1d ago
6
Meta IKBO: Kernel-Level Broadcast Elimination Cuts RecSys Latency by Two-Thirds
Infra
· 1 src · 1d ago
Discuss
7
TokenSpeed: Compiler-Backed LLM Inference Engine Built for Agentic Coding Workloads
Infra
1
2d ago
7
TokenSpeed: Compiler-Backed LLM Inference Engine Built for Agentic Coding Workloads
Infra
· 1 src · 2d ago
Discuss
8
Anthropic Partners with SpaceX, Doubles Claude Usage Limits
Infra
1
3d ago
8
Anthropic Partners with SpaceX, Doubles Claude Usage Limits
Top
Infra
· 1 src · 3d ago
Discuss
6
LLM Weights in BF16 Carry Only 10.6 of 16 Allocated Bits
Research
1
3d ago
6
LLM Weights in BF16 Carry Only 10.6 of 16 Allocated Bits
Research
· 1 src · 3d ago
Discuss
6
Google Prepares Mid-Tier 'AI Ultra Lite' Subscription Plan
Products
1
4d ago
6
Google Prepares Mid-Tier 'AI Ultra Lite' Subscription Plan
Products
· 1 src · 4d ago
Discuss
6
SageMaker AI Adds Automatic Instance Fallback for GPU Capacity Gaps
Products
1
5d ago
6
SageMaker AI Adds Automatic Instance Fallback for GPU Capacity Gaps
Products
· 1 src · 5d ago
Discuss
2 Weeks Ago
7
Speculative Decoding Cuts RL Post-Training Rollout Time by Up to 2.5x
Research
1
May 1
7
Speculative Decoding Cuts RL Post-Training Rollout Time by Up to 2.5x
Research
· 1 src · May 1
Discuss
6
KV Cache Locality: How Load Balancing Drives Up LLM Serving Costs
Infra
1
May 1
6
KV Cache Locality: How Load Balancing Drives Up LLM Serving Costs
Infra
· 1 src · May 1
Discuss
6
SMG: Rust Gateway Disaggregates CPU Work from GPU Inference to Kill GIL Bottleneck
Infra
1
May 1
6
SMG: Rust Gateway Disaggregates CPU Work from GPU Inference to Kill GIL Bottleneck
Infra
· 1 src · May 1
Discuss
8
Alphabet Cloud Hits $20B With 63% Growth, But $462B Backlog Shows Demand Far Exceeds Capacity
Updated
Markets
4
Apr 30
8
Alphabet Cloud Hits $20B With 63% Growth, But $462B Backlog Shows Demand Far Exceeds Capacity
Top
Markets
· 4 srcs · Apr 30
Discuss
7
AI Agent Evaluation Costs Surge to $40K+ Per Run, Becoming a New Compute Bottleneck
Research
1
Apr 29
7
AI Agent Evaluation Costs Surge to $40K+ Per Run, Becoming a New Compute Bottleneck
Research
· 1 src · Apr 29
Discuss
7
Meta Signs Deal for Space-Based Solar Power Beamed to Earth at Night
Infra
1
Apr 27
7
Meta Signs Deal for Space-Based Solar Power Beamed to Earth at Night
Infra
· 1 src · Apr 27
Discuss
7
Test-Time Scaling Breakthrough Pushes Coding Agents Past 77% on SWE-Bench
Research
1
Apr 27
7
Test-Time Scaling Breakthrough Pushes Coding Agents Past 77% on SWE-Bench
Research
· 1 src · Apr 27
Discuss
3 Weeks Ago
8
Meta Signs Deal for Millions of AWS Graviton CPUs for AI Workloads
Updated
Infra
3
Apr 27
8
Meta Signs Deal for Millions of AWS Graviton CPUs for AI Workloads
Top
Infra
· 3 srcs · Apr 27
Discuss
6
Expert Upcycling: Expanding MoE Models Mid-Training Cuts GPU Costs by 32–67%
Research
1
Apr 24
6
Expert Upcycling: Expanding MoE Models Mid-Training Cuts GPU Costs by 32–67%
Research
· 1 src · Apr 24
Discuss
6
Jefferies: AI Compute Demand Outstrips Supply, Hyperscalers Remain Top Beneficiaries
Infra
1
Apr 24
6
Jefferies: AI Compute Demand Outstrips Supply, Hyperscalers Remain Top Beneficiaries
Infra
· 1 src · Apr 24
Discuss
6
AI's Surging Power Demand Is Outpacing the US Grid -- and Fixes Are Stalling
Infra
1
Apr 23
6
AI's Surging Power Demand Is Outpacing the US Grid -- and Fixes Are Stalling
Infra
· 1 src · Apr 23
Discuss
8
Google Launches 8th-Gen TPUs at Cloud Next: Two Purpose-Built Chips for the Agentic AI Era
Updated
Infra
6
Apr 22
8
Google Launches 8th-Gen TPUs at Cloud Next: Two Purpose-Built Chips for the Agentic AI Era
Top
Infra
· 6 srcs · Apr 22
Discuss
7
Google in Talks With Marvell to Build Custom AI Inference Chips
Infra
1
Apr 20
7
Google in Talks With Marvell to Build Custom AI Inference Chips
Infra
· 1 src · Apr 20
Discuss
7
Morgan Stanley: Agentic AI to Boost CPU and Memory Spending Beyond GPUs
Markets
1
Apr 20
7
Morgan Stanley: Agentic AI to Boost CPU and Memory Spending Beyond GPUs
Markets
· 1 src · Apr 20
Discuss
6
China's Token Economy Mints New AI Stock Winners, Bypassing Tech Giants
Markets
1
Apr 20
6
China's Token Economy Mints New AI Stock Winners, Bypassing Tech Giants
Markets
· 1 src · Apr 20
Discuss
6
Moonshot AI Proposes Cross-Datacenter LLM Serving via Prefill-as-a-Service
Research
1
Apr 20
6
Moonshot AI Proposes Cross-Datacenter LLM Serving via Prefill-as-a-Service
Research
· 1 src · Apr 20
Discuss
Last Month
8
OpenAI Agrees to $20B+ Cerebras Chip Deal with Equity Stake, Doubling Prior Commitment
Markets
1
Apr 17
8
OpenAI Agrees to $20B+ Cerebras Chip Deal with Equity Stake, Doubling Prior Commitment
Top
Markets
· 1 src · Apr 17
Discuss
7
AI Compute Scarcity: Blackwell GPU Prices Surge 114% in Six Weeks as Frontier Access Tightens
Updated
Infra
2
Apr 28
7
AI Compute Scarcity: Blackwell GPU Prices Surge 114% in Six Weeks as Frontier Access Tightens
Infra
· 2 srcs · Apr 28
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss