Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
gpu
Clear
Titles
Summaries
April
7
AutoSP: DeepSpeed Tool Automates Sequence Parallelism for Long-Context LLM Training
Infra
1
Apr 30
7
AutoSP: DeepSpeed Tool Automates Sequence Parallelism for Long-Context LLM Training
Infra
· 1 src · Apr 30
Discuss
6
Hyperscalers Raise Data-Center Capex, Boosting AI Chip Suppliers
Markets
1
Apr 30
6
Hyperscalers Raise Data-Center Capex, Boosting AI Chip Suppliers
Markets
· 1 src · Apr 30
Discuss
8
Meta Raises 2026 Capex to $125B–$145B Amid AI Spending Anxiety
Updated
Infra
5
Apr 30
8
Meta Raises 2026 Capex to $125B–$145B Amid AI Spending Anxiety
Top
Infra
· 5 srcs · Apr 30
Discuss
7
Microsoft Commits A$25B (~$18B USD) to Australia for AI, Cloud, and Digital Infrastructure
Infra
1
Apr 24
7
Microsoft Commits A$25B (~$18B USD) to Australia for AI, Cloud, and Digital Infrastructure
Infra
· 1 src · Apr 24
Discuss
6
Jefferies: AI Compute Demand Outstrips Supply, Hyperscalers Remain Top Beneficiaries
Infra
1
Apr 24
6
Jefferies: AI Compute Demand Outstrips Supply, Hyperscalers Remain Top Beneficiaries
Infra
· 1 src · Apr 24
Discuss
6
Amazon SageMaker AI Automates Generative AI Inference Deployment
Products
1
Apr 22
6
Amazon SageMaker AI Automates Generative AI Inference Deployment
Products
· 1 src · Apr 22
Discuss
7
Stargate AI: All Seven US Sites Under Active Development
Infra
1
Apr 21
7
Stargate AI: All Seven US Sites Under Active Development
Infra
· 1 src · Apr 21
Discuss
8
Multi-Agent System Achieves 38% CUDA Kernel Speedup in 3 Weeks for NVIDIA Blackwell GPUs
Infra
1
Apr 15
8
Multi-Agent System Achieves 38% CUDA Kernel Speedup in 3 Weeks for NVIDIA Blackwell GPUs
Infra
· 1 src · Apr 15
Discuss
6
Parasail Raises $32M Series A to Scale Cheap AI Inference
Markets
1
Apr 15
6
Parasail Raises $32M Series A to Scale Cheap AI Inference
Markets
· 1 src · Apr 15
Discuss
6
Allbirds Sells Shoe Brand for $39M, Rebrands as NewBird AI GPU Cloud Provider
Updated
Markets
3
Apr 16
6
Allbirds Sells Shoe Brand for $39M, Rebrands as NewBird AI GPU Cloud Provider
Markets
· 3 srcs · Apr 16
Discuss
7
AI Compute Scarcity: Blackwell GPU Prices Surge 114% in Six Weeks as Frontier Access Tightens
Updated
Infra
2
Apr 28
7
AI Compute Scarcity: Blackwell GPU Prices Surge 114% in Six Weeks as Frontier Access Tightens
Infra
· 2 srcs · Apr 28
Discuss
7
Stanford AI Index 2026: Compute Surge, Robotics Gap, and a Divided Public
Updated
Research
3
Apr 19
7
Stanford AI Index 2026: Compute Surge, Robotics Gap, and a Divided Public
Research
· 3 srcs · Apr 19
Discuss
6
Kepler Communications Opens Largest Orbital Compute Cluster for Business
Infra
1
Apr 13
6
Kepler Communications Opens Largest Orbital Compute Cluster for Business
Infra
· 1 src · Apr 13
Discuss
7
CoreWeave Signs Multi-Year Compute Agreement With Anthropic
Updated
Infra
2
Apr 13
7
CoreWeave Signs Multi-Year Compute Agreement With Anthropic
Infra
· 2 srcs · Apr 13
Discuss
7
CoreWeave Secures $21B Additional Meta Deal, Pushing Revenue Backlog to $87.8B
Infra
2
Apr 10
7
CoreWeave Secures $21B Additional Meta Deal, Pushing Revenue Backlog to $87.8B
Infra
· 2 srcs · Apr 10
Discuss
6
Monarch: PyTorch Framework Brings Supercomputer Control to Python API
Infra
1
Apr 9
6
Monarch: PyTorch Framework Brings Supercomputer Control to Python API
Infra
· 1 src · Apr 9
Discuss
7
Apple Signs Tiny Corp. Driver Enabling Nvidia GPUs on Apple Silicon Macs
Infra
1
Apr 8
7
Apple Signs Tiny Corp. Driver Enabling Nvidia GPUs on Apple Silicon Macs
Infra
· 1 src · Apr 8
Discuss
6
Warp Decode: 1.84x Faster MoE Inference by Flipping the Parallelism Axis on Blackwell GPUs
Research
1
Apr 8
6
Warp Decode: 1.84x Faster MoE Inference by Flipping the Parallelism Axis on Blackwell GPUs
Research
· 1 src · Apr 8
Discuss
7
Nvidia Acquires SchedMD: Slurm Ownership Raises Competitive Neutrality Concerns
Markets
1
Apr 7
7
Nvidia Acquires SchedMD: Slurm Ownership Raises Competitive Neutrality Concerns
Markets
· 1 src · Apr 7
Discuss
6
US vs. China AI Race: Brains vs. Bodies, Chips vs. Robots
Policy
1
Apr 7
6
US vs. China AI Race: Brains vs. Bodies, Chips vs. Robots
Policy
· 1 src · Apr 7
Discuss
6
Micron, Amazon, and Microsoft Sell Off Despite Strong AI-Driven Earnings
Markets
1
Apr 5
6
Micron, Amazon, and Microsoft Sell Off Despite Strong AI-Driven Earnings
Markets
· 1 src · Apr 5
Discuss
6
NVIDIA's 20x KV Cache Compression Breakthrough and Speculative Tesla FSD HW3 Application
Research
1
Apr 3
6
NVIDIA's 20x KV Cache Compression Breakthrough and Speculative Tesla FSD HW3 Application
Research
· 1 src · Apr 3
Discuss
March
7
ScaleOps Raises $130M Series C to Automate Kubernetes Resource Management
Markets
1
Mar 30
7
ScaleOps Raises $130M Series C to Automate Kubernetes Resource Management
Markets
· 1 src · Mar 30
Discuss
6
SageMaker Training Plans Now Reserve GPU Capacity for Inference Endpoints
Products
1
Mar 28
6
SageMaker Training Plans Now Reserve GPU Capacity for Inference Endpoints
Products
· 1 src · Mar 28
Discuss
7.12
SK Hynix Files for US IPO, Targeting $10B–$14B to Close Valuation Gap
Markets
1
Mar 27
7.12
SK Hynix Files for US IPO, Targeting $10B–$14B to Close Valuation Gap
Markets
· 1 src · Mar 27
Discuss
7
AI Supply Chain Faces Cascading Risk From Middle East War and Energy Shock
Infra
3
Mar 27
7
AI Supply Chain Faces Cascading Risk From Middle East War and Energy Shock
Infra
· 3 srcs · Mar 27
Discuss
Last Week
7
Kog AI Claims 3,000 Tokens/s Single-Request Inference on Standard GPUs
Infra
1
3d ago
7
Kog AI Claims 3,000 Tokens/s Single-Request Inference on Standard GPUs
Infra
· 1 src · 3d ago
Discuss
6
NVIDIA CompileIQ: AI-Powered Compiler Auto-Tuning Lands in CUDA 13.3
Infra
1
6d ago
6
NVIDIA CompileIQ: AI-Powered Compiler Auto-Tuning Lands in CUDA 13.3
Infra
· 1 src · 6d ago
Discuss
6
Analysis: AI Data Center Buildout Could Rival Railroad Era, Spark Neocloud Boom
Markets
1
May 25
6
Analysis: AI Data Center Buildout Could Rival Railroad Era, Spark Neocloud Boom
Markets
· 1 src · May 25
Discuss
7
Nvidia CFO: H100 GPU Rental Prices Up 20% in 2026 Amid Chip Shortage
Markets
1
May 24
7
Nvidia CFO: H100 GPU Rental Prices Up 20% in 2026 Amid Chip Shortage
Markets
· 1 src · May 24
Discuss
2 Weeks Ago
8
NVIDIA Nemotron-Labs Launches Diffusion Language Models That Generate Tokens in Parallel
Models
1
May 23
8
NVIDIA Nemotron-Labs Launches Diffusion Language Models That Generate Tokens in Parallel
Top
Models
· 1 src · May 23
Discuss
7
Frontier AI Labs Use Less Than Half of Global AI Compute, Epoch AI Estimates
Infra
1
May 22
7
Frontier AI Labs Use Less Than Half of Global AI Compute, Epoch AI Estimates
Infra
· 1 src · May 22
Discuss
8
Nvidia Q1 FY2026: $81.6B Revenue, $75.2B Data Center Record, $43B Startup Stakes, $80B Buyback
Updated
Markets
6
May 21
8
Nvidia Q1 FY2026: $81.6B Revenue, $75.2B Data Center Record, $43B Startup Stakes, $80B Buyback
Top
Markets
· 6 srcs · May 21
Discuss
7
NVIDIA Vera Rubin Enters Full Production: Pod-Scale AI Factories Ramping Worldwide
Updated
Infra
2
2d ago
7
NVIDIA Vera Rubin Enters Full Production: Pod-Scale AI Factories Ramping Worldwide
Infra
· 2 srcs · 2d ago
Discuss
6
HIVE Digital Plans 320MW AI Data Centre Near Toronto for CAD $3.5B
Infra
1
May 18
6
HIVE Digital Plans 320MW AI Data Centre Near Toronto for CAD $3.5B
Infra
· 1 src · May 18
Discuss
3 Weeks Ago
6
Async Continuous Batching Eliminates 24% GPU Idle Time in LLM Inference
Research
1
May 15
6
Async Continuous Batching Eliminates 24% GPU Idle Time in LLM Inference
Research
· 1 src · May 15
Discuss
6
US-China AI Race: Two Scenarios for 2028 Global Leadership
Policy
1
May 15
6
US-China AI Race: Two Scenarios for 2028 Global Leadership
Policy
· 1 src · May 15
Discuss
7
PyTorch 2.12: 100x Eigendecomp Speedup, Unified Graph API
Open Source
1
May 14
7
PyTorch 2.12: 100x Eigendecomp Speedup, Unified Graph API
Open Source
· 1 src · May 14
Discuss
7
Modal Explains Four Ingredients for Serverless GPU Scaling
Infra
1
May 13
7
Modal Explains Four Ingredients for Serverless GPU Scaling
Infra
· 1 src · May 13
Discuss
7
Nebius Revenue Jumps Nearly 8x on AI Infrastructure Demand
Markets
1
May 13
7
Nebius Revenue Jumps Nearly 8x on AI Infrastructure Demand
Markets
· 1 src · May 13
Discuss
7
NVIDIA-Backed Sparsity Technique Reported to Deliver 20% LLM Speedup on H100 GPUs
Research
1
May 12
7
NVIDIA-Backed Sparsity Technique Reported to Deliver 20% LLM Speedup on H100 GPUs
Research
· 1 src · May 12
Discuss
6
Nvidia Hits Record $219 Close as AI Trade Accelerates Pre-Earnings
Markets
1
May 12
6
Nvidia Hits Record $219 Close as AI Trade Accelerates Pre-Earnings
Markets
· 1 src · May 12
Discuss
8
Cerebras IPO: $5.55B Raised, Stock Doubles — Chip Is 58x Larger Than Nvidia's B200
Updated
Infra
9
May 22
8
Cerebras IPO: $5.55B Raised, Stock Doubles — Chip Is 58x Larger Than Nvidia's B200
Top
Infra
· 9 srcs · May 22
Discuss
6
CUDA: Why Nvidia's Real Moat Is Software, Not Hardware
Infra
1
May 11
6
CUDA: Why Nvidia's Real Moat Is Software, Not Hardware
Infra
· 1 src · May 11
Discuss
Last Month
6
Meta IKBO: Kernel-Level Broadcast Elimination Cuts RecSys Latency by Two-Thirds
Infra
1
May 8
6
Meta IKBO: Kernel-Level Broadcast Elimination Cuts RecSys Latency by Two-Thirds
Infra
· 1 src · May 8
Discuss
6
Unsloth + NVIDIA Collaboration Cuts LLM Training Time by ~25%
Infra
1
May 7
6
Unsloth + NVIDIA Collaboration Cuts LLM Training Time by ~25%
Infra
· 1 src · May 7
Discuss
8
Anthropic Signs $1.25B/Month Compute Deal With xAI's Colossus 1 Data Center
Updated
Infra
7
May 20
8
Anthropic Signs $1.25B/Month Compute Deal With xAI's Colossus 1 Data Center
Top
Infra
· 7 srcs · May 20
Discuss
7
AMD Forecasts Q2 Revenue Above Expectations on Strong AI Chip Demand
Markets
1
May 5
7
AMD Forecasts Q2 Revenue Above Expectations on Strong AI Chip Demand
Markets
· 1 src · May 5
Discuss
7
DigitalOcean Launches AI-Native Cloud at Deploy 2026 with 15 New Products
Infra
1
May 5
7
DigitalOcean Launches AI-Native Cloud at Deploy 2026 with 15 New Products
Infra
· 1 src · May 5
Discuss
6
KV Cache Locality: How Load Balancing Drives Up LLM Serving Costs
Infra
1
May 1
6
KV Cache Locality: How Load Balancing Drives Up LLM Serving Costs
Infra
· 1 src · May 1
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss