aayush garg
garg-aayush
AI & ML interests
None yet
Organizations
Training-LLMs
- Running on CPU UpgradeFeatured3.2k
The Smol Training Playbook
📚3.2kThe secrets to building world-class LLMs
- Running3.86k
The Ultra-Scale Playbook
🌌3.86kThe ultimate guide to training LLM on large GPU Clusters
- RunningFeatured1.35k
FineWeb: decanting the web for the finest text data at scale
🍷1.35kExplore and download the FineWeb web‑scale text dataset
- Running224
FineVision: Open Data is All You Need
📝224A new open-source dataset for training VLMs
RLHF Papers
-
Proximal Policy Optimization Algorithms
Paper • 1707.06347 • Published • 11 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 66 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 145 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 452
Llama papers and reports
List of papers and reports related to llama models
LLM Tech Reports
RLHF Papers
-
Proximal Policy Optimization Algorithms
Paper • 1707.06347 • Published • 11 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 66 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 145 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 452
Training-LLMs
- Running on CPU UpgradeFeatured3.2k
The Smol Training Playbook
📚3.2kThe secrets to building world-class LLMs
- Running3.86k
The Ultra-Scale Playbook
🌌3.86kThe ultimate guide to training LLM on large GPU Clusters
- RunningFeatured1.35k
FineWeb: decanting the web for the finest text data at scale
🍷1.35kExplore and download the FineWeb web‑scale text dataset
- Running224
FineVision: Open Data is All You Need
📝224A new open-source dataset for training VLMs
Llama papers and reports
List of papers and reports related to llama models