DataFlex: A Drop-In Upgrade That Makes LLM Training Data-Aware

Peking University’s DataFlex framework, built on LLaMA-Factory, adds dynamic data selection, mixture tuning, and reweighting to LLM training. It hit #1 on HuggingFace Daily Papers with 160 GitHub stars.
artificial-intelligence
Author

Kabui, Charles

Published

2026-04-06

Keywords

dataflex, llm-training, data-centric-ai, llama-factory, dynamic-data-optimization