Provide API usage tracking information

### **Describe the solution you'd like**
Many API based AI services have a [feature itself](https://cline.bot/), a [plugin](https://marketplace.visualstudio.com/items?itemName=Dwtexe.cursor-stats), or a [3rd-party tool](https://cursorusage.com/) to show information to track the API usages.

Since Ailoy provides the feature to use AI APIs, it is essential to provide information about their usage.

- tokens upward / downward
- estimated API costs
- (anything else that is provided by API)

And the other statistics info can help people to know how they are using the AI more even for local or API models.
- latency
  - first token
  - whole response(last token)
- tok/s
  - prefill / decode

---

(This is a little out of the context.)
For the local models, providing GPU computing/memory utilization or amount of memory is being consumed can be helpful.


### **Additional context**
Here are some code repos of the examples that provide that kind of informations, so maybe these can help to implement this.
- https://github.com/Dwtexe/cursor-stats
- https://github.com/cline/cline (a demo gif for cline is below.)
![cline demo](https://media.githubusercontent.com/media/cline/cline/main/assets/docs/demo.gif)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide API usage tracking information #118

Describe the solution you'd like

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Provide API usage tracking information #118

Description

Describe the solution you'd like

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions