RFC: Improve developer experience by anchoring on multimodal use-case #8485
Replies: 8 comments
-
Another success story for iOS specifically (and hopefully for Android too) should look roughly like on this video, where the clients could just add some |
Beta Was this translation helpful? Give feedback.
-
yeah, that's cool. two additional comments:
|
Beta Was this translation helpful? Give feedback.
-
It's great to have this kind of experience in general, but we may need to think more on how the framework can really help. For multimodal we need to learn more on the common pattern. Note that different components may not work together directly out of box due to:
|
Beta Was this translation helpful? Give feedback.
-
I think it would be great if the issue/pain points, solution space and potential way to validate this actually came from users a level higher than framework devs. My fear is that we will do incremental improvements that aesthetically please us based on our own experience. Even if such issues and/or solution space is not driven by other users or product engineers, such personas have to be close part of iterating over solution space. Maybe this would be people from Paris hackathon. |
Beta Was this translation helpful? Give feedback.
-
@kimishpatel I imagine if for users it looks similar to HF transformers or OpenAI API, that should be good enough? |
Beta Was this translation helpful? Give feedback.
-
WHat is OpenAI API? Why do you believe users similar to HF transformers covers spans of users that interact with torchchat/ET? Any examples? |
Beta Was this translation helpful? Give feedback.
-
HF transformers or OpenAI API are sorta de-facto standards how devs and clients interact with LLMs these days. I guess if TC provides a similar interface it at least wouldn't be worse. |
Beta Was this translation helpful? Give feedback.
-
@shoumikhin OpenAI API, AFAIU, is related to end point APIs while building pipeline of components, with customizability across different aspects such as tokenizer, kv cache management, long context management etc. might be different. Dont know enough about HF in this space. I assume that would more closely align with some of the objectives here. And generally @mergennachin, I would probably want to also understand how different end-users envision deploying models. Requirements listed here make sense but I cant place them in larger context and where they are coming from. @shoumikhin's comment regarding HF users do make sense though but does the same exist for on-device use-cases? |
Beta Was this translation helpful? Give feedback.
-
🚀 The feature, motivation and pitch
Let's build an example demo app, perhaps in pytorch-labs, which will become a forcing function to improve developer experience from a user perspective. A positive outcome of this demo app is to define and build new higher level abstractions (e.g., similar to Pipelines).
On a high-level, here's the app we would like to build: LLM based on voice input and output. In terms of implementation, it's a three step process:
Here are the requirements:
Here's a positive outcome of this demo app:
Alternatives
No response
Additional context
Already another RFC, but specifically in the context of LLMs
RFC (Optional)
No response
cc @cccclai @helunwencser @dvorjackz
Beta Was this translation helpful? Give feedback.
All reactions