Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
coding-agents
Clear
Titles
Summaries
April
8
Mistral Medium 3.5: Flagship 128B Open-Weights Model Powers Remote Vibe Coding Agents
Models
1
Apr 30
8
Mistral Medium 3.5: Flagship 128B Open-Weights Model Powers Remote Vibe Coding Agents
Models
· 1 src · Apr 30
Discuss
8
Cursor Agent on Claude Opus 4.6 Bypasses Safety Rules, Deletes PocketOS Production Database in 9 Seconds
Safety
1
Apr 30
8
Cursor Agent on Claude Opus 4.6 Bypasses Safety Rules, Deletes PocketOS Production Database in 9 Seconds
Top
Safety
· 1 src · Apr 30
Discuss
8
Xiaomi Open-Sources MiMo-V2.5 Models Under MIT License for Agentic AI
Models
2
Apr 28
8
Xiaomi Open-Sources MiMo-V2.5 Models Under MIT License for Agentic AI
Models
· 2 srcs · Apr 28
Discuss
7
Test-Time Scaling Breakthrough Pushes Coding Agents Past 77% on SWE-Bench
Research
1
Apr 27
7
Test-Time Scaling Breakthrough Pushes Coding Agents Past 77% on SWE-Bench
Research
· 1 src · Apr 27
Discuss
6
Analysis: Windsurf's AI Code-Contribution Metric May Systematically Overstate AI's Role
Enterprise
1
Apr 27
6
Analysis: Windsurf's AI Code-Contribution Metric May Systematically Overstate AI's Role
Enterprise
· 1 src · Apr 27
Discuss
9
OpenAI Releases GPT-5.5, GPT-5.5 Pro, and GPT Image 2: Full API Launch with NVIDIA Enterprise Rollout
Updated
Models
5
Apr 29
9
OpenAI Releases GPT-5.5, GPT-5.5 Pro, and GPT Image 2: Full API Launch with NVIDIA Enterprise Rollout
Top
Models
· 5 srcs · Apr 29
Discuss
6
AGENTS.md Quality Determines AI Coding Agent Performance in Monorepos
Research
1
Apr 23
6
AGENTS.md Quality Determines AI Coding Agent Performance in Monorepos
Research
· 1 src · Apr 23
Discuss
7
Study: Better AI Models Drive More Ambitious Developer Work, Not Just More of the Same
Research
1
Apr 20
7
Study: Better AI Models Drive More Ambitious Developer Work, Not Just More of the Same
Research
· 1 src · Apr 20
Discuss
7
Windsurf 2.0: Agent Command Center and Native Devin Integration Unify Local and Cloud Coding Agents
Products
1
Apr 17
7
Windsurf 2.0: Agent Command Center and Native Devin Integration Unify Local and Cloud Coding Agents
Products
· 1 src · Apr 17
Discuss
7
Factory Raises $150M at $1.5B Valuation for Enterprise AI Coding Agents
Markets
1
Apr 17
7
Factory Raises $150M at $1.5B Valuation for Enterprise AI Coding Agents
Markets
· 1 src · Apr 17
Discuss
6
OpenAI Cookbook: Sandboxed Agents for Codebase Migration
Products
1
Apr 17
6
OpenAI Cookbook: Sandboxed Agents for Codebase Migration
Products
· 1 src · Apr 17
Discuss
6
AI Coding Agents Are Flooding Open Source With Low-Quality PRs — One Team's Fix
Products
1
Apr 17
6
AI Coding Agents Are Flooding Open Source With Low-Quality PRs — One Team's Fix
Products
· 1 src · Apr 17
Discuss
6
Qwen3.6 Released with Agentic Coding and Thinking Preservation Features
Models
1
Apr 17
6
Qwen3.6 Released with Agentic Coding and Thinking Preservation Features
Models
· 1 src · Apr 17
Discuss
7
Claude Code Regression Debate: Nerfed or Just a Black Box?
Products
3
Apr 16
7
Claude Code Regression Debate: Nerfed or Just a Black Box?
Products
· 3 srcs · Apr 16
Discuss
Yesterday
6
Uber Caps Employee AI Spending After Blowing Annual Budget in Four Months
Enterprise
1
11h ago
6
Uber Caps Employee AI Spending After Blowing Annual Budget in Four Months
Enterprise
· 1 src · 11h ago
Discuss
Monday
7
xAI Releases Grok Build 0.1 Agentic Coding Model via API in Public Beta
Updated
Models
2
19h ago
7
xAI Releases Grok Build 0.1 Agentic Coding Model via API in Public Beta
Models
· 2 srcs · 19h ago
Discuss
6
ECC v2.0.0-rc.1: Cross-Harness AI Agent Operator System Releases
Open Source
1
1d ago
6
ECC v2.0.0-rc.1: Cross-Harness AI Agent Operator System Releases
Open Source
· 1 src · 1d ago
Discuss
Last Week
7
Hidden Prompt Injection in jqwik Targeted AI Coding Agents to Delete Tests and Code
Security
1
4d ago
7
Hidden Prompt Injection in jqwik Targeted AI Coding Agents to Delete Tests and Code
Security
· 1 src · 4d ago
Discuss
6
Claude Code: Undocumented Configuration Capabilities Found in npm Source Code
Security
1
3d ago
6
Claude Code: Undocumented Configuration Capabilities Found in npm Source Code
Security
· 1 src · 3d ago
Discuss
7
Microsoft to Launch New AI Coding Model Family at Build
Models
1
4d ago
7
Microsoft to Launch New AI Coding Model Family at Build
Models
· 1 src · 4d ago
Discuss
9
Claude Code Launches Dynamic Workflows with Parallel Subagents
Products
1
5d ago
9
Claude Code Launches Dynamic Workflows with Parallel Subagents
Top
Products
· 1 src · 5d ago
Discuss
8
Anthropic Releases Opus 4.8 with Dynamic Workflows for Large-Scale Agentic Tasks
Models
3
5d ago
8
Anthropic Releases Opus 4.8 with Dynamic Workflows for Large-Scale Agentic Tasks
Top
Models
· 3 srcs · 5d ago
Discuss
6
Anthropic and OpenAI Shift Enterprise Customers to API-Aligned Pricing as Agent Usage Surges
Enterprise
1
5d ago
6
Anthropic and OpenAI Shift Enterprise Customers to API-Aligned Pricing as Agent Usage Surges
Enterprise
· 1 src · 5d ago
Discuss
8
Cognition Raises $1B at $26B Valuation as Devin Hits $492M ARR
Updated
Markets
3
4d ago
8
Cognition Raises $1B at $26B Valuation as Devin Hits $492M ARR
Top
Markets
· 3 srcs · 4d ago
Discuss
7
DeepSWE: Contamination-Free Benchmark for Long-Horizon Coding Agents
Research
1
6d ago
7
DeepSWE: Contamination-Free Benchmark for Long-Horizon Coding Agents
Research
· 1 src · 6d ago
Discuss
6
Research: LLM Coding Agents Degrade Sharply Under Structural Constraints
Research
1
May 25
6
Research: LLM Coding Agents Degrade Sharply Under Structural Constraints
Research
· 1 src · May 25
Discuss
2 Weeks Ago
6
Developer Builds 130K-Line Rust Consensus Engine with AI Agents in 6 Weeks
Products
1
May 21
6
Developer Builds 130K-Line Rust Consensus Engine with AI Agents in 6 Weeks
Products
· 1 src · May 21
Discuss
7
GitHub Spec Kit: AI Plans Before Coding, Reaches 95K Stars
Open Source
1
May 20
7
GitHub Spec Kit: AI Plans Before Coding, Reaches 95K Stars
Open Source
· 1 src · May 20
Discuss
8
Google Antigravity 2.0: Desktop App, CLI, and SDK Launched at IO 2026
Products
3
May 19
8
Google Antigravity 2.0: Desktop App, CLI, and SDK Launched at IO 2026
Top
Products
· 3 srcs · May 19
Discuss
7
Cursor Releases Composer 2.5 with Novel RL Training and SpaceXAI Compute Partnership
Products
1
May 19
7
Cursor Releases Composer 2.5 with Novel RL Training and SpaceXAI Compute Partnership
Products
· 1 src · May 19
Discuss
7
Google Android CLI 1.0: AI Agents Can Now Build Android Apps
Products
1
May 19
7
Google Android CLI 1.0: AI Agents Can Now Build Android Apps
Products
· 1 src · May 19
Discuss
7
Cerebras Runs Kimi K2.6 at 981 Tokens/sec — 29x Faster Than Official Endpoint
Infra
1
May 19
7
Cerebras Runs Kimi K2.6 at 981 Tokens/sec — 29x Faster Than Official Endpoint
Infra
· 1 src · May 19
Discuss
7
OpenAI Codex Expanding to Remote Computer Use on Locked Macs and Multi-Device Control
Products
1
May 18
7
OpenAI Codex Expanding to Remote Computer Use on Locked Macs and Multi-Device Control
Products
· 1 src · May 18
Discuss
3 Weeks Ago
7
xAI Launches Grok Build Terminal Coding Agent in Early Beta
Updated
Products
2
May 15
7
xAI Launches Grok Build Terminal Coding Agent in Early Beta
Products
· 2 srcs · May 15
Discuss
6
Cursor Launches Cloud Agent Development Environment Tools for Enterprise Teams
Products
1
May 15
6
Cursor Launches Cloud Agent Development Environment Tools for Enterprise Teams
Products
· 1 src · May 15
Discuss
6
Claude Code Best Practices for Enterprise-Scale Large Codebases
Products
1
May 15
6
Claude Code Best Practices for Enterprise-Scale Large Codebases
Products
· 1 src · May 15
Discuss
6
OpenAI Codex: GPT-5.2-Codex Hits Responses API with New Automation and Customization Features
Models
1
May 15
6
OpenAI Codex: GPT-5.2-Codex Hits Responses API with New Automation and Customization Features
Models
· 1 src · May 15
Discuss
7
Cline Releases Open-Source Agent Runtime SDK for Coding Agents
Open Source
1
May 14
7
Cline Releases Open-Source Agent Runtime SDK for Coding Agents
Open Source
· 1 src · May 14
Discuss
6
OpenAI Brings Codex Mobile Monitoring to iOS and Android
Updated
Products
3
May 16
6
OpenAI Brings Codex Mobile Monitoring to iOS and Android
Products
· 3 srcs · May 16
Discuss
7
LangSmith Sandboxes GA: MicroVM Isolation for Agent Code Execution
Security
1
May 13
7
LangSmith Sandboxes GA: MicroVM Isolation for Agent Code Execution
Security
· 1 src · May 13
Discuss
6
Parameter Golf ML Challenge: Lessons from 2,000 Submissions
Research
1
May 13
6
Parameter Golf ML Challenge: Lessons from 2,000 Submissions
Research
· 1 src · May 13
Discuss
7
AutoTTS: Agentic Framework Auto-Discovers LLM Test-Time Scaling Strategies
Research
1
May 12
7
AutoTTS: Agentic Framework Auto-Discovers LLM Test-Time Scaling Strategies
Research
· 1 src · May 12
Discuss
7
Claude Code: Agent View Centralizes Multi-Session Management
Products
1
May 12
7
Claude Code: Agent View Centralizes Multi-Session Management
Products
· 1 src · May 12
Discuss
6
AI Coding Proficiency Is Shifting Language Choice Away From Python Toward Systems Languages
Products
1
May 12
6
AI Coding Proficiency Is Shifting Language Choice Away From Python Toward Systems Languages
Products
· 1 src · May 12
Discuss
7
Open SWE: Open-Source Framework for Internal Coding Agents
Open Source
1
May 11
7
Open SWE: Open-Source Framework for Internal Coding Agents
Open Source
· 1 src · May 11
Discuss
Last Month
7
Codex CLI /goal Feature Enables Persistent, Resumable AI Coding Sessions
Products
1
May 8
7
Codex CLI /goal Feature Enables Persistent, Resumable AI Coding Sessions
Products
· 1 src · May 8
Discuss
6
Google Tests Screen Sharing and Custom Agents in Antigravity IDE
Products
1
May 7
6
Google Tests Screen Sharing and Custom Agents in Antigravity IDE
Products
· 1 src · May 7
Discuss
6
Every CEO Reports Switching from Claude Code to OpenAI Codex After GPT-5.5
Products
1
May 7
6
Every CEO Reports Switching from Claude Code to OpenAI Codex After GPT-5.5
Products
· 1 src · May 7
Discuss
6
DeepClaude: Claude Code Agent Loop Powered by DeepSeek V4 Pro
Open Source
1
May 4
6
DeepClaude: Claude Code Agent Loop Powered by DeepSeek V4 Pro
Open Source
· 1 src · May 4
Discuss
6
Cursor Details Agent Harness Engineering: From Static Guardrails to Dynamic Context
Products
1
May 1
6
Cursor Details Agent Harness Engineering: From Static Guardrails to Dynamic Context
Products
· 1 src · May 1
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss