VishwamAI

VishwamAI is an advanced language model based on the Transformer architecture, designed for various natural language processing tasks.

Installation

Clone the repository:

git clone https://github.com/VishwamAI/chat-agent.git
cd chat-agent

Install the required packages:
```
pip install -r requirements.txt
```

Usage

Training the model

To train the VishwamAI model, run:

python scripts/train.py

This script will initialize the model and datasets, and start the training process.

Generating text

To generate text using a trained model, use:

python scripts/generate_text.py --prompt "Your prompt here" --max_length 100

Evaluating the model

To evaluate the model on a test dataset, run:

python scripts/evaluate.py --test_file path/to/test/file.txt

Testing Sampling Parameters

To test the VishwamAI model with different sampling parameters (temperature, top-p, top-k) across various prompts, use:

python scripts/sampling_test.py --prompt "Your prompt here" --temperature 0.7 --top_p 0.9 --top_k 50

Note

Before running any scripts that depend on the Hugging Face API, ensure that the HUGGING_FACE_TOKEN environment variable is set up as described in the "Setting Up Environment Variables" section.

Configuration

You can modify the model and training configuration by editing the configuration files in the configs/ directory.

Setting Up Environment Variables

To securely use the Hugging Face API, set the HUGGING_FACE_TOKEN environment variable with your Hugging Face token. This can be done by adding the following line to your shell profile (e.g., .bashrc, .zshrc):

export HUGGING_FACE_TOKEN=your_hugging_face_token

After adding the line, reload your shell profile:

source ~/.bashrc  # or source ~/.zshrc

Documentation

For more detailed information about the model architecture, training process, and API reference, please refer to the docs/ directory.

Contributing

Contributions to VishwamAI are welcome! Please refer to the CONTRIBUTING.md file for guidelines on how to contribute to this project.

License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 548 Commits
.github/workflows		.github/workflows
datasets		datasets
scripts		scripts
tokenizer		tokenizer
vishwamai		vishwamai
vishwamai_model		vishwamai_model
.gitattributes		.gitattributes
.gitignore		.gitignore
ENHANCEMENT_GUIDE.md		ENHANCEMENT_GUIDE.md
LICENSE		LICENSE
README.md		README.md
TRAINING_GUIDE.md		TRAINING_GUIDE.md
TRAINING_INSTRUCTIONS.md		TRAINING_INSTRUCTIONS.md
config_for_27b.yaml		config_for_27b.yaml
config_for_2b.yaml		config_for_2b.yaml
config_for_7b.yaml		config_for_7b.yaml
config_for_9b.yaml		config_for_9b.yaml
haiku_integration_guide.md		haiku_integration_guide.md
integrate_text2text_instructions.sh		integrate_text2text_instructions.sh
integrate_text2text_model.sh		integrate_text2text_model.sh
integrate_text_to_text_features.sh		integrate_text_to_text_features.sh
requirements.txt		requirements.txt
setup.py		setup.py
train_and_upload_models.sh		train_and_upload_models.sh
train_vishwamai.sh		train_vishwamai.sh
upload_model_to_hf.py		upload_model_to_hf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VishwamAI

Installation

Usage

Training the model

Generating text

Evaluating the model

Testing Sampling Parameters

Note

Configuration

Setting Up Environment Variables

Documentation

Contributing

License

About

Releases

Packages

Languages

License

VishwamAirobotics/chat-agent

Folders and files

Latest commit

History

Repository files navigation

VishwamAI

Installation

Usage

Training the model

Generating text

Evaluating the model

Testing Sampling Parameters

Note

Configuration

Setting Up Environment Variables

Documentation

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages