Skip to content

Conversation

@fanqiNO1
Copy link
Contributor

@fanqiNO1 fanqiNO1 commented Sep 12, 2024

If the LLM is too big to be loaded in a single GPU, we need device_map = 'auto' to avoid OOM.

According to the issue #715.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

1 participant