Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepSeek on AWS #2632

Merged
merged 9 commits into from
Jan 30, 2025
Merged

DeepSeek on AWS #2632

merged 9 commits into from
Jan 30, 2025

Conversation

pagezyhf
Copy link
Contributor

WIP document on how to deploy and fine tune DeepSeek R1 models on AWS

Outstanding items:

  • EC2 deployment on inferentia end up in timeout before succeeding cc @dacorvo
  • I need to do a thumbnail
  • IE deployment on Inferentia
  • Add a link to a notebook for fine tuning deepseek models in sagemaker cc @fgbelidji ? Or maybe wait for an official HF container?

cc @jeffboudier for viz and review

@dacorvo
Copy link
Contributor

dacorvo commented Jan 30, 2025

For the time-out of the 70B model, it may be related to a temporary slowdown of the hub: several CI jobs failed approximately at the same time because the models could not be fetched from the hub.

Copy link
Contributor

@dacorvo dacorvo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Apart from my small comment, this looks good to me, thanks !

Copy link
Member

@jeffboudier jeffboudier left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Love it, excited about this blog post!

@pagezyhf pagezyhf merged commit 7d85fdd into main Jan 30, 2025
1 check passed
@pagezyhf pagezyhf deleted the deepseek-aws branch January 30, 2025 17:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants