6 24

Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, ML4Science

Recent Activity

liked a model about 8 hours ago

mistralai/Mistral-Medium-3.5-128B

liked a model 8 days ago

Qwen/Qwen3.6-35B-A3B

liked a dataset 4 months ago

ILSVRC/imagenet-1k

View all activity

Organizations

None yet

liked a model about 8 hours ago

mistralai/Mistral-Medium-3.5-128B

128B • Updated 25 days ago • 324k • 344

liked a model 8 days ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated Apr 24 • 5.74M • • 1.94k

liked a dataset 4 months ago

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 80.3k • 814

liked a dataset 11 months ago

LEAP/ClimSim_high-res

Updated Sep 29, 2023 • 2.42k • 12

upvoted an article 11 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo

•

Dec 19, 2024

• 742

liked a dataset about 1 year ago

mcherukara/PtychoNN_data

Updated Mar 18, 2025 • 65 • 2

liked 2 models about 1 year ago

allenai/ACE2-ERA5

Updated Apr 14 • 220 • 18

microsoft/aurora

Updated Jun 20, 2025 • 54

upvoted an article about 1 year ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

sirluk

•

Oct 7, 2024

• 71

liked a Space about 1 year ago

Memory Viz

🧠

Memory Viz

liked 2 Spaces over 1 year ago

Predict Memory

🧮

109

Estimate model memory usage and see detailed plots

The Ultra-Scale Playbook

🌌

3.86k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article over 1 year ago

Article

Open-R1: Update #1

open-r1

•

Feb 2, 2025

• 305

liked 2 datasets over 1 year ago

PleIAs/common_corpus

Viewer • Updated 24 days ago • 69.9k • 155k • 400

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 635k • 1.1k

liked 3 models over 1 year ago

upvoted a collection over 1 year ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 159

liked a model over 1 year ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15, 2025 • 2.21M • 1.05k

Thomas Bouvier

AI & ML interests

Recent Activity

Organizations

tbouvier's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Memory Viz

Predict Memory

The Ultra-Scale Playbook

Open-R1: Update #1