Skip to content

Conversation

kminhta
Copy link

@kminhta kminhta commented Nov 22, 2023

this PR aims to integrate Intel Extension for PyTorch into TGIS so that users may make use of transformers optimizations to maximize performance on Intel CPU.

changes are to the dockerfile and includes an additional IPEX deployment framework that includes the optimizations

Signed-off-by: kta-intel <[email protected]>
@kminhta kminhta marked this pull request as draft November 30, 2023 14:04
@kminhta kminhta marked this pull request as ready for review January 17, 2024 16:27
@kminhta
Copy link
Author

kminhta commented Jan 17, 2024

Note: PR comments out the nightly build as IPEX version must align with the torch version (i.e. stable torch 2.1 and stable IPEX 2.1)

JRosenkranz pushed a commit to JRosenkranz/text-generation-inference-server that referenced this pull request Jul 10, 2024
… padding (IBM#16)

AFAIK there is no torch device type called "gpu".
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant