Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
reasoning
Clear
Titles
Summaries
April
6
DataPRM: Process Reward Model for Reliable Agentic Data Analysis
Research
1
Apr 30
6
DataPRM: Process Reward Model for Reliable Agentic Data Analysis
Research
· 1 src · Apr 30
Discuss
6
LaDiR: UC San Diego Team Proposes Latent Diffusion Framework to Overcome LLM Reasoning Limits
Research
1
Apr 30
6
LaDiR: UC San Diego Team Proposes Latent Diffusion Framework to Overcome LLM Reasoning Limits
Research
· 1 src · Apr 30
Discuss
8
Amateur Solves 60-Year-Old Math Problem Using a Single ChatGPT Prompt
Research
1
Apr 27
8
Amateur Solves 60-Year-Old Math Problem Using a Single ChatGPT Prompt
Research
· 1 src · Apr 27
Discuss
7
Test-Time Scaling Breakthrough Pushes Coding Agents Past 77% on SWE-Bench
Research
1
Apr 27
7
Test-Time Scaling Breakthrough Pushes Coding Agents Past 77% on SWE-Bench
Research
· 1 src · Apr 27
Discuss
9
DeepSeek Launches V4 Flash and V4 Pro, Claims Frontier-Level Performance
Updated
Models
5
May 4
9
DeepSeek Launches V4 Flash and V4 Pro, Claims Frontier-Level Performance
Top
Models
· 5 srcs · May 4
Discuss
6
"Learning Mechanics": Researchers Argue a Scientific Theory of Deep Learning Is Taking Shape
Research
1
Apr 24
6
"Learning Mechanics": Researchers Argue a Scientific Theory of Deep Learning Is Taking Shape
Research
· 1 src · Apr 24
Discuss
7
Ex-OpenAI Researcher Jerry Tworek Launches Core Automation AI Lab
Research
1
Apr 23
7
Ex-OpenAI Researcher Jerry Tworek Launches Core Automation AI Lab
Research
· 1 src · Apr 23
Discuss
7
RLVR Weak Supervision: When LLMs Can and Cannot Generalize
Research
1
Apr 22
7
RLVR Weak Supervision: When LLMs Can and Cannot Generalize
Research
· 1 src · Apr 22
Discuss
7
40 AI Researchers Warn Interpretability Window Is Closing as Models Grow More Opaque
Research
1
Apr 21
7
40 AI Researchers Warn Interpretability Window Is Closing as Models Grow More Opaque
Research
· 1 src · Apr 21
Discuss
7
Google Launches Deep Research Max for Enterprise Autonomous Research
Products
1
Apr 21
7
Google Launches Deep Research Max for Enterprise Autonomous Research
Products
· 1 src · Apr 21
Discuss
6
Qwen3.6 Released with Agentic Coding and Thinking Preservation Features
Models
1
Apr 17
6
Qwen3.6 Released with Agentic Coding and Thinking Preservation Features
Models
· 1 src · Apr 17
Discuss
8
Anthropic Releases Claude Opus 4.7: Hybrid Reasoning, 1M Context Window, and Cybersecurity Safeguards
Updated
Models
8
Apr 29
8
Anthropic Releases Claude Opus 4.7: Hybrid Reasoning, 1M Context Window, and Cybersecurity Safeguards
Top
Models
· 8 srcs · Apr 29
Discuss
3 Weeks Ago
9
Microsoft AI Launches 7-Model MAI Family and Declares Itself a Superintelligence Lab
Models
4
Jun 3
9
Microsoft AI Launches 7-Model MAI Family and Declares Itself a Superintelligence Lab
Top
Models
· 4 srcs · Jun 3
Discuss
7
JetBrains Releases Mellum2: Open-Source 12B MoE with Instruct and Thinking Variants for Developer AI
Updated
Models
2
Jun 2
7
JetBrains Releases Mellum2: Open-Source 12B MoE with Instruct and Thinking Variants for Developer AI
Models
· 2 srcs · Jun 2
Discuss
Last Month
7
Liquid AI Releases LFM2.5-8B-A1B: Edge MoE Model Trained on 38T Tokens
Models
1
May 30
7
Liquid AI Releases LFM2.5-8B-A1B: Edge MoE Model Trained on 38T Tokens
Models
· 1 src · May 30
Discuss
8
Claude Mythos Solves 1946 Erdős Conjecture With Simple Proof
Research
1
May 27
8
Claude Mythos Solves 1946 Erdős Conjecture With Simple Proof
Top
Research
· 1 src · May 27
Discuss
8
AlphaProof Nexus Cracks Decades-Old Math Problems for Hundreds of Dollars
Research
1
May 26
8
AlphaProof Nexus Cracks Decades-Old Math Problems for Hundreds of Dollars
Research
· 1 src · May 26
Discuss
9
OpenAI Model Produces Verified Disproof of 80-Year-Old Erdős Geometry Conjecture
Updated
Research
5
Jun 1
9
OpenAI Model Produces Verified Disproof of 80-Year-Old Erdős Geometry Conjecture
Top
Research
· 5 srcs · Jun 1
Discuss
7
Mode-Hopping: LLMs Oscillate Between Parroting and Reasoning During Pre-training
Research
1
May 19
7
Mode-Hopping: LLMs Oscillate Between Parroting and Reasoning During Pre-training
Research
· 1 src · May 19
Discuss
6
Gemini App Rolls Out Extended Thinking Levels and Three New Third-Party Integrations
Products
1
May 18
6
Gemini App Rolls Out Extended Thinking Levels and Three New Third-Party Integrations
Products
· 1 src · May 18
Discuss
7
Anthropic Maps Claude's Internal Reasoning with New Interpretability Tools
Research
1
May 16
7
Anthropic Maps Claude's Internal Reasoning with New Interpretability Tools
Research
· 1 src · May 16
Discuss
7
Perceptron Mk1: Video Analysis AI Model Priced 80-90% Below Frontier Rivals
Models
1
May 13
7
Perceptron Mk1: Video Analysis AI Model Priced 80-90% Below Frontier Rivals
Models
· 1 src · May 13
Discuss
7
Tsinghua Study: Visual Generation Boosts AI Spatial Reasoning
Research
1
May 12
7
Tsinghua Study: Visual Generation Boosts AI Spatial Reasoning
Research
· 1 src · May 12
Discuss
7
Mathematician Reports ChatGPT 5.5 Pro Produced PhD-Level Math Research in About an Hour
Models
1
May 11
7
Mathematician Reports ChatGPT 5.5 Pro Produced PhD-Level Math Research in About an Hour
Models
· 1 src · May 11
Discuss
8
OpenAI Launches GPT-5.5 Instant as New Default ChatGPT Model
Models
1
May 5
8
OpenAI Launches GPT-5.5 Instant as New Default ChatGPT Model
Top
Models
· 1 src · May 5
Discuss
7
Analysis: Autonomous AI R&D Likely by 2028, Author Argues
Updated
Research
2
May 5
7
Analysis: Autonomous AI R&D Likely by 2028, Author Argues
Research
· 2 srcs · May 5
Discuss
7
Anthropic Red-Teams 'Jupiter V1' Ahead of May 6 Dev Conference
Models
1
May 4
7
Anthropic Red-Teams 'Jupiter V1' Ahead of May 6 Dev Conference
Models
· 1 src · May 4
Discuss
6
Edit-R1: Verifier-Based Reinforcement Learning Framework Advances Image Editing
Research
1
May 4
6
Edit-R1: Verifier-Based Reinforcement Learning Framework Advances Image Editing
Research
· 1 src · May 4
Discuss
8
Google Gemini 3.1 Pro Preview Takes Top Spot on Artificial Analysis Intelligence Index
Models
1
May 1
8
Google Gemini 3.1 Pro Preview Takes Top Spot on Artificial Analysis Intelligence Index
Top
Models
· 1 src · May 1
Discuss
7
ARC Prize: GPT-5.5 and Opus 4.7 Score Below 1% on ARC-AGI-3
Research
1
May 1
7
ARC Prize: GPT-5.5 and Opus 4.7 Score Below 1% on ARC-AGI-3
Research
· 1 src · May 1
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss