Blog

04 Nov 2025

Post Training Methods Language Models

Post-training adapts language models for specific, safe, and practical uses. This overview highlights key methods and the open source training_hub library.

27 May 2025

Getting Reasoning Models Enterprise Ready

Customize reasoning models with synthetic data generation for enterprise deployment. Learn techniques from Red Hat's AI Innovation Team.

13 May 2025

Beyond tokens per second: Unlocking smarter enterprise AI with inference-time scaling

Discover inference-time scaling techniques that improve AI quality and reliability for enterprise applications beyond just speed optimization.

05 Apr 2025

Async-GRPO - Open, Fast, and Performant

Introducing Async-GRPO - an open-source library for scalable reinforcement learning with 42% efficiency gains over VERL and 11x over TRL for GRPO training.

04 Apr 2025

Sculpting Subspaces: How We Solved Continual Learning in Large Language Models

Learn how our adaptive SVD method enables continual learning in LLMs with near-zero catastrophic forgetting, achieving 7% higher accuracy than baselines.

17 Feb 2025

Update 3 - On Reasoning vs Inference-time scaling - Lessons on Reproducing R1-like Reasoning in Small LLMs without using DeepSeek-R1-Zero (or its derivatives)

Understanding the distinction between reasoning and inference-time scaling in LLMs - insights from our R1 reproduction experiments.

07 Feb 2025

Update 2 - Lessons on Reproducing R1-like Reasoning in Small LLMs without using DeepSeek-R1-Zero (or its derivatives)