TAUR-dev/qwen25_vl_7b_element_lookup_01format_09_coordinate_02reflect_thrsh20_no_feedback_10_20 Updated Oct 21, 2025
OLMo-150M and OLMo-1B Pretrained Models Collection Pretrained models from scratch used in "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining". • 12 items • Updated Jan 26 • 4