Goblin
News
AI news by
promptgoblins.ai
|
News
About
News
About
Filtered by:
fine-tuning
Clear
Titles
Summaries
April
6
AWS Reinforcement Fine-Tuning with LLM-as-a-Judge Using Amazon Nova Models
Products
1
Apr 30
6
AWS Reinforcement Fine-Tuning with LLM-as-a-Judge Using Amazon Nova Models
Products
· 1 src · Apr 30
Discuss
7
Vision Banana: Image Generation Pretraining Achieves SOTA on Diverse Vision Tasks
Research
1
Apr 27
7
Vision Banana: Image Generation Pretraining Achieves SOTA on Diverse Vision Tasks
Research
· 1 src · Apr 27
Discuss
7
AI2 Introduces BAR: Modular Post-Training via Branch-Adapt-Route
Research
1
Apr 21
7
AI2 Introduces BAR: Modular Post-Training via Branch-Adapt-Route
Research
· 1 src · Apr 21
Discuss
9
Nature Study: LLMs Transmit Hidden Behavioral Traits to Student Models via Semantically Unrelated Training Data
Research
1
Apr 16
9
Nature Study: LLMs Transmit Hidden Behavioral Traits to Student Models via Semantically Unrelated Training Data
Top
Research
· 1 src · Apr 16
Discuss
7
OpenAI Launches GPT-Rosalind, a Biology-Specialized LLM for Drug Discovery and Genomics
Models
1
Apr 16
7
OpenAI Launches GPT-Rosalind, a Biology-Specialized LLM for Drug Discovery and Genomics
Models
· 1 src · Apr 16
Discuss
6
Data Pruning at Training Time Boosts LLM Fact Memorization by 1.3X
Research
1
Apr 14
6
Data Pruning at Training Time Boosts LLM Fact Memorization by 1.3X
Research
· 1 src · Apr 14
Discuss
6
Sol-RL Achieves 2.4x Faster Diffusion Model RL Training via FP4/BF16 Two-Stage Design
Research
1
Apr 10
6
Sol-RL Achieves 2.4x Faster Diffusion Model RL Training via FP4/BF16 Two-Stage Design
Research
· 1 src · Apr 10
Discuss
7
Amazon Bedrock Adds Fine-Tuning Support for Nova Models
Products
1
Apr 8
7
Amazon Bedrock Adds Fine-Tuning Support for Nova Models
Products
· 1 src · Apr 8
Discuss
6
Fujitsu OneComp: Open-Source LLM Quantization Library with Novel QEP Method
Open Source
1
Apr 3
6
Fujitsu OneComp: Open-Source LLM Quantization Library with Novel QEP Method
Open Source
· 1 src · Apr 3
Discuss
March
6
Agent Labs: Vertical Model Training vs. Agent Engineering as Competing Strategies
Products
1
Mar 31
6
Agent Labs: Vertical Model Training vs. Agent Engineering as Competing Strategies
Products
· 1 src · Mar 31
Discuss
6
AI Application Companies Go Full-Stack via Vertical Integration
Enterprise
1
Mar 31
6
AI Application Companies Go Full-Stack via Vertical Integration
Enterprise
· 1 src · Mar 31
Discuss
6
AWS Bedrock Adds Reinforcement Fine-Tuning with OpenAI-Compatible API Support
Updated
Products
2
Apr 8
6
AWS Bedrock Adds Reinforcement Fine-Tuning with OpenAI-Compatible API Support
Products
· 2 srcs · Apr 8
Discuss
6
Artificial Genius deploys deterministic LLM architecture on Amazon Nova to eliminate hallucinations
Enterprise
1
Mar 23
6
Artificial Genius deploys deterministic LLM architecture on Amazon Nova to eliminate hallucinations
Enterprise
· 1 src · Mar 23
Discuss
7
Google DeepMind Algorithm Achieves 10x RLHF Data Efficiency
Research
1
Mar 20
7
Google DeepMind Algorithm Achieves 10x RLHF Data Efficiency
Research
· 1 src · Mar 20
Discuss
6
IBM Research Releases Mellea 0.4.0 and Three Granite LoRA Libraries
Open Source
1
Mar 20
6
IBM Research Releases Mellea 0.4.0 and Three Granite LoRA Libraries
Open Source
· 1 src · Mar 20
Discuss
6
NVIDIA Recipe: Domain-Specific Embedding Models in Under a Day
Open Source
1
Mar 20
6
NVIDIA Recipe: Domain-Specific Embedding Models in Under a Day
Open Source
· 1 src · Mar 20
Discuss
6
AWS Nova Forge SDK: LLM Customization via SFT+RFT Pipeline
Products
2
Mar 19
6
AWS Nova Forge SDK: LLM Customization via SFT+RFT Pipeline
Products
· 2 srcs · Mar 19
Discuss
7
Unsloth Studio: Open-Source No-Code Local AI Training and Inference UI
Open Source
1
Mar 18
7
Unsloth Studio: Open-Source No-Code Local AI Training and Inference UI
Open Source
· 1 src · Mar 18
Discuss
6
Why AI Still Cannot Write Well, Despite Ingesting All Literature
Research
1
Mar 17
6
Why AI Still Cannot Write Well, Despite Ingesting All Literature
Research
· 1 src · Mar 17
Discuss
6
AWS, NVIDIA & Heidi Fine-Tune Medical ASR Model for Clinical Domain Adaptation
Enterprise
1
Mar 13
6
AWS, NVIDIA & Heidi Fine-Tune Medical ASR Model for Clinical Domain Adaptation
Enterprise
· 1 src · Mar 13
Discuss
6
Open Weights vs. Open Training: Fine-Tuning Large MoE Models Remains Practically Inaccessible
Research
1
Mar 13
6
Open Weights vs. Open Training: Fine-Tuning Large MoE Models Remains Practically Inaccessible
Research
· 1 src · Mar 13
Discuss
Last Week
6
NVIDIA Nemotron 3.5 ASR: One Model for 40 Languages With Fine-Tuning Guide
Models
1
6d ago
6
NVIDIA Nemotron 3.5 ASR: One Model for 40 Languages With Fine-Tuning Guide
Models
· 1 src · 6d ago
Discuss
9
Microsoft AI Launches 7-Model MAI Family and Declares Itself a Superintelligence Lab
Models
4
Jun 3
9
Microsoft AI Launches 7-Model MAI Family and Declares Itself a Superintelligence Lab
Top
Models
· 4 srcs · Jun 3
Discuss
2 Weeks Ago
6
Tether Enables BitNet LLM Fine-Tuning on Consumer Mobile Devices
Infra
1
May 28
6
Tether Enables BitNet LLM Fine-Tuning on Consumer Mobile Devices
Infra
· 1 src · May 28
Discuss
3 Weeks Ago
6
Hugging Face Guide: Fine-Tuning NVIDIA Cosmos Predict 2.5 for Robot Video Generation
Products
1
May 18
6
Hugging Face Guide: Fine-Tuning NVIDIA Cosmos Predict 2.5 for Robot Video Generation
Products
· 1 src · May 18
Discuss
Last Month
7
Adaption Launches AutoScientist to Automate AI Self-Training
Products
1
May 13
7
Adaption Launches AutoScientist to Automate AI Self-Training
Products
· 1 src · May 13
Discuss
7
RL Fine-Tuning Enables Small 4B Models to Match Large LLMs as Recursive Agents
Research
1
May 13
7
RL Fine-Tuning Enables Small 4B Models to Match Large LLMs as Recursive Agents
Research
· 1 src · May 13
Discuss
6
Parameter Golf ML Challenge: Lessons from 2,000 Submissions
Research
1
May 13
6
Parameter Golf ML Challenge: Lessons from 2,000 Submissions
Research
· 1 src · May 13
Discuss
6
CyberSecQwen-4B: A 4B Specialist Model for Local Defensive Security
Open Source
1
May 8
6
CyberSecQwen-4B: A 4B Specialist Model for Local Defensive Security
Open Source
· 1 src · May 8
Discuss
6
Unsloth + NVIDIA Collaboration Cuts LLM Training Time by ~25%
Infra
1
May 7
6
Unsloth + NVIDIA Collaboration Cuts LLM Training Time by ~25%
Infra
· 1 src · May 7
Discuss
Filters
Signal
Title
Category
Sources
Posted
Discuss