Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
datasets
Clear
Titles
Summaries
April
7
Failed Startups Sell Slack Logs and Emails to Train AI Agents
Security
1
Apr 20
7
Failed Startups Sell Slack Logs and Emails to Train AI Agents
Security
· 1 src · Apr 20
Discuss
6
Data Pruning at Training Time Boosts LLM Fact Memorization by 1.3X
Research
1
Apr 14
6
Data Pruning at Training Time Boosts LLM Fact Memorization by 1.3X
Research
· 1 src · Apr 14
Discuss
7
Scale AI Workers Scraped Meta User Data and Explicit Content to Train AI
Safety
1
Apr 7
7
Scale AI Workers Scraped Meta User Data and Explicit Content to Train AI
Safety
· 1 src · Apr 7
Discuss
8
Meta Pauses Mercor Work After Breach Exposes AI Training Data
Security
2
Apr 3
8
Meta Pauses Mercor Work After Breach Exposes AI Training Data
Top
Security
· 2 srcs · Apr 3
Discuss
7
Humanoid Robot Training Fuels Global Gig Economy for At-Home Video Data Collection
Enterprise
1
Apr 3
7
Humanoid Robot Training Fuels Global Gig Economy for At-Home Video Data Collection
Enterprise
· 1 src · Apr 3
Discuss
6
NomadicML Raises $8.4M to Structurize Autonomous Vehicle Fleet Data
Markets
1
Apr 3
6
NomadicML Raises $8.4M to Structurize Autonomous Vehicle Fleet Data
Markets
· 1 src · Apr 3
Discuss
March
7
NanoGPT Slowrun Achieves 10x Data Efficiency via Ensembling
Research
1
Mar 20
7
NanoGPT Slowrun Achieves 10x Data Efficiency via Ensembling
Research
· 1 src · Mar 20
Discuss
7
AI2 Releases MolmoPoint: Native Visual Grounding for Vision-Language Models
Research
1
Mar 20
7
AI2 Releases MolmoPoint: Native Visual Grounding for Vision-Language Models
Research
· 1 src · Mar 20
Discuss
6
DoorDash Launches 'Tasks' App, Paying Couriers to Collect AI Training Data
Updated
Products
2
Mar 21
6
DoorDash Launches 'Tasks' App, Paying Couriers to Collect AI Training Data
Products
· 2 srcs · Mar 21
Discuss
8
Open-H-Embodiment Launches 778-Hour Healthcare Robotics Dataset and Models
Updated
Research
2
Mar 17
8
Open-H-Embodiment Launches 778-Hour Healthcare Robotics Dataset and Models
Research
· 2 srcs · Mar 17
Discuss
7
NVIDIA Releases 2+ Petabytes of Open AI Training Data on HuggingFace
Open Source
1
Mar 13
7
NVIDIA Releases 2+ Petabytes of Open AI Training Data on HuggingFace
Open Source
· 1 src · Mar 13
Discuss
6
NVIDIA Releases 15M-Problem Synthetic Code Dataset, Boosts HumanEval by 6 Points
Open Source
1
Mar 13
6
NVIDIA Releases 15M-Problem Synthetic Code Dataset, Boosts HumanEval by 6 Points
Open Source
· 1 src · Mar 13
Discuss
Last Week
7
NVIDIA LocateAnything: Parallel Box Decoding Breaks VLM Grounding Speed-Accuracy Tradeoff
Research
1
5d ago
7
NVIDIA LocateAnything: Parallel Box Decoding Breaks VLM Grounding Speed-Accuracy Tradeoff
Research
· 1 src · 5d ago
Discuss
6
Human Archive Raises $8.2M to Harvest Robot Training Data from India's Gig Economy
Markets
1
May 26
6
Human Archive Raises $8.2M to Harvest Robot Training Data from India's Gig Economy
Markets
· 1 src · May 26
Discuss
6
Norway's National Library Builds Sovereign Norwegian LLM on 2PB Flash
Infra
1
May 26
6
Norway's National Library Builds Sovereign Norwegian LLM on 2PB Flash
Infra
· 1 src · May 26
Discuss
3 Weeks Ago
7
Meta's Mandatory Employee Laptop Surveillance Sparks Internal Petition and Unionization Push
Policy
1
May 14
7
Meta's Mandatory Employee Laptop Surveillance Sparks Internal Petition and Unionization Push
Policy
· 1 src · May 14
Discuss
6
Internet Archive at 30: AI Era Threatens Web's Memory Bank
Policy
1
May 14
6
Internet Archive at 30: AI Era Threatens Web's Memory Bank
Policy
· 1 src · May 14
Discuss
Last Month
6
Microsoft Research Releases Open U.S. Power Grid Dataset for AI Planning
Infra
1
May 8
6
Microsoft Research Releases Open U.S. Power Grid Dataset for AI Planning
Infra
· 1 src · May 8
Discuss
8
AI2 Releases MolmoAct 2: Open Robotics Model Beats GPT-5 on Embodied Reasoning
Models
1
May 6
8
AI2 Releases MolmoAct 2: Open Robotics Model Beats GPT-5 on Embodied Reasoning
Top
Models
· 1 src · May 6
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss