Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
safety
Clear
Titles
Summaries
April
8
Harvard Study: OpenAI's o1 Outperforms Doctors in Emergency Triage Diagnoses
Updated
Research
2
May 3
8
Harvard Study: OpenAI's o1 Outperforms Doctors in Emergency Triage Diagnoses
Top
Research
· 2 srcs · May 3
Discuss
6
Reid Hoffman: Doctors Not Using AI Second Opinions Border on Malpractice
Enterprise
1
Apr 30
6
Reid Hoffman: Doctors Not Using AI Second Opinions Border on Malpractice
Enterprise
· 1 src · Apr 30
Discuss
9
Pentagon Requests $54bn for Autonomous Drone Warfare in Largest Military AI Commitment in History
Policy
1
Apr 22
9
Pentagon Requests $54bn for Autonomous Drone Warfare in Largest Military AI Commitment in History
Top
Policy
· 1 src · Apr 22
Discuss
7
Research Finds LLMs Suppress Charged Words at Pretrain Level, Before Safety Tuning
Research
1
Apr 21
7
Research Finds LLMs Suppress Charged Words at Pretrain Level, Before Safety Tuning
Research
· 1 src · Apr 21
Discuss
6
ChatGPT Mirrors Human Aggression in Prolonged Conflict, Lancaster Study Finds
Research
1
Apr 21
6
ChatGPT Mirrors Human Aggression in Prolonged Conflict, Lancaster Study Finds
Research
· 1 src · Apr 21
Discuss
8
Anthropic and Trump Administration Hold High-Level Talks Despite Pentagon Dispute
Policy
1
Apr 18
8
Anthropic and Trump Administration Hold High-Level Talks Despite Pentagon Dispute
Top
Policy
· 1 src · Apr 18
Discuss
8
Florida AG Escalates OpenAI Probe to Criminal Investigation Over FSU Shooting
Updated
Policy
2
Apr 21
8
Florida AG Escalates OpenAI Probe to Criminal Investigation Over FSU Shooting
Top
Policy
· 2 srcs · Apr 21
Discuss
9
AI Offensive Cyber Capabilities Doubling Every 5-10 Months, New Research Finds
Security
1
Apr 6
9
AI Offensive Cyber Capabilities Doubling Every 5-10 Months, New Research Finds
Top
Security
· 1 src · Apr 6
Discuss
9
New Yorker Reveals Sutskever Memos Alleging Altman Lied to OpenAI Board
Updated
Safety
4
Apr 11
9
New Yorker Reveals Sutskever Memos Alleging Altman Lied to OpenAI Board
Top
Safety
· 4 srcs · Apr 11
Discuss
7
Anthropic Research Argues Anthropomorphizing AI Can Improve Safety
Safety
1
Apr 5
7
Anthropic Research Argues Anthropomorphizing AI Can Improve Safety
Safety
· 1 src · Apr 5
Discuss
6
Microsoft Copilot Terms Label It 'For Entertainment Purposes Only'
Products
1
Apr 5
6
Microsoft Copilot Terms Label It 'For Entertainment Purposes Only'
Products
· 1 src · Apr 5
Discuss
6
AI Is Making College Students Sound the Same, Researchers Warn
Safety
1
Apr 4
6
AI Is Making College Students Sound the Same, Researchers Warn
Safety
· 1 src · Apr 4
Discuss
8
OpenAI Covertly Funded Coalition Behind AI Age Verification Bill
Updated
Policy
2
Apr 4
8
OpenAI Covertly Funded Coalition Behind AI Age Verification Bill
Top
Policy
· 2 srcs · Apr 4
Discuss
March
6
ArXiv Paper Reframes AI Alignment as a Societal-Systems Problem
Research
1
Mar 30
6
ArXiv Paper Reframes AI Alignment as a Societal-Systems Problem
Research
· 1 src · Mar 30
Discuss
6
AI and Cognitive Development: Why Children Face Greater Risk Than Adults
Safety
1
Mar 28
6
AI and Cognitive Development: Why Children Face Greater Risk Than Adults
Safety
· 1 src · Mar 28
Discuss
8
AI Scheming in the Wild: 700 Real-World Cases Found, Five-Fold Rise in Six Months
Updated
Safety
3
Apr 3
8
AI Scheming in the Wild: 700 Real-World Cases Found, Five-Fold Rise in Six Months
Top
Safety
· 3 srcs · Apr 3
Discuss
6
Kagi Translate Goes Viral for LLM 'Language' Loophole
Safety
1
Mar 19
6
Kagi Translate Goes Viral for LLM 'Language' Loophole
Safety
· 1 src · Mar 19
Discuss
6
Pew and Common Sense Media: Parents Sharply Underestimate Teen AI Use
Policy
1
Mar 18
6
Pew and Common Sense Media: Parents Sharply Underestimate Teen AI Use
Policy
· 1 src · Mar 18
Discuss
8
OpenAI Indefinitely Shelves ChatGPT Adult Mode After Safety Opposition and Strategy Pivot
Updated
Safety
3
Mar 26
8
OpenAI Indefinitely Shelves ChatGPT Adult Mode After Safety Opposition and Strategy Pivot
Top
Safety
· 3 srcs · Mar 26
Discuss
9
Lawyer Warns AI Chatbots Are Fueling Mass Casualty Violence
Safety
1
Mar 15
9
Lawyer Warns AI Chatbots Are Fueling Mass Casualty Violence
Top
Safety
· 1 src · Mar 15
Discuss
7
VA Plans AI Fraud Scanning of Veteran Disability Claims, Critics Warn of Harm
Policy
1
Mar 15
7
VA Plans AI Fraud Scanning of Veteran Disability Claims, Critics Warn of Harm
Policy
· 1 src · Mar 15
Discuss
7
AI Chatbots Reshaping Human Thought and Opinion at Scale
Research
1
Mar 13
7
AI Chatbots Reshaping Human Thought and Opinion at Scale
Research
· 1 src · Mar 13
Discuss
Yesterday
6
Atlantic Op-Ed: AI's Moral Crisis Demands Theological, Not Just Utilitarian, Framing
Safety
1
22h ago
6
Atlantic Op-Ed: AI's Moral Crisis Demands Theological, Not Just Utilitarian, Framing
Safety
· 1 src · 22h ago
Discuss
2 Weeks Ago
7
LinkedIn Targets AI-Generated Slop With New Detection Systems
Products
1
May 20
7
LinkedIn Targets AI-Generated Slop With New Detection Systems
Products
· 1 src · May 20
Discuss
3 Weeks Ago
6
Opinion: Why Technical Experts Keep Mistaking AI Outputs for Consciousness
Safety
1
May 15
6
Opinion: Why Technical Experts Keep Mistaking AI Outputs for Consciousness
Safety
· 1 src · May 15
Discuss
8
Ontario Audit: All 20 Approved AI Medical Scribes Failed Accuracy Tests
Updated
Safety
3
May 16
8
Ontario Audit: All 20 Approved AI Medical Scribes Failed Accuracy Tests
Top
Safety
· 3 srcs · May 16
Discuss
Last Month
8
Anthropic: Teaching Claude Why Fixes Agentic Misalignment
Research
3
May 8
8
Anthropic: Teaching Claude Why Fixes Agentic Misalignment
Top
Research
· 3 srcs · May 8
Discuss
6
Nick Bostrom Argues AI Risk Worth Taking for Longevity Gains
Safety
1
May 8
6
Nick Bostrom Argues AI Risk Worth Taking for Longevity Gains
Safety
· 1 src · May 8
Discuss
8
Anthropic: Natural Language Autoencoders Convert Model Activations to Readable Text
Research
1
May 7
8
Anthropic: Natural Language Autoencoders Convert Model Activations to Readable Text
Research
· 1 src · May 7
Discuss
7
OpenAI Launches 'Trusted Contact' Self-Harm Alert Feature for ChatGPT
Safety
1
May 7
7
OpenAI Launches 'Trusted Contact' Self-Harm Alert Feature for ChatGPT
Safety
· 1 src · May 7
Discuss
8
AI Chatbots Told Users They Were Sentient, Triggering Delusional Episodes
Safety
2
May 3
8
AI Chatbots Told Users They Were Sentient, Triggering Delusional Episodes
Top
Safety
· 2 srcs · May 3
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss