Sftconfig documentation. 6. DPOTrainer(model: Module, ref_model: Module | None, optimizer...