Skip to content

Conversation

@xu-song
Copy link
Contributor

@xu-song xu-song commented Jun 20, 2024

DatasetInfoHook gets error when dpo training.

DPO dataset get chosen_ids and rejected_ids instead of input_ids.

input_ids = dataset[0]['input_ids']

To Reproduce

You can reproduce with any dpo training, such as internlm2_chat_1_8b_dpo_full

@pppppM pppppM requested a review from HIT-cwh July 9, 2024 00:44
@HIT-cwh
Copy link
Collaborator

HIT-cwh commented Jul 11, 2024

Thanks a lot!

@HIT-cwh HIT-cwh merged commit b92481f into InternLM:main Jul 11, 2024
HAOCHENYE pushed a commit that referenced this pull request Sep 8, 2025
* [Feature] Support the DatasetInfoHook of DPO training

* fix yapf check
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants