ved1beta

Follow

🍊

santra

Vedant ved1beta

🍊

santra

Follow

Hi, I'm Ved! 🎖️ GPU Engineer : )

35 followers · 121 following

Remote
19:27 (UTC -12:00)
@ant_vedaya
https://www.kaggle.com/vedaya

Sponsors

Achievements

Achievements

Highlights

Developer Program Member
Pro

ved1beta/README.md

Things I Do: )

Triton: making custom triton kernels for better optimizations, working on some big kernel projects
Cuda: cuda architecture for better understanding of kernels and triton
Deep Learning: comp vision, NLP etc. : )

Technical Skills 🛠️

Languages: Python, CUDA, C++
Frameworks & Libraries: Pytorch, Pandas, Matplotlib, triton, Mpi4py
Tools & Platforms: GitHub, Docker, Vercel, Neovim, Vscode, Jupyter Notebook, Aws
Machine Learning Specialist: Proficient in statistical analysis, predictive modeling (Regression, Decision Trees, Random Forest), and advanced algorithms (CatBoost, SGD) with strong focus on optimization and accuracy.

Key Projects 📚

CUDA

GPU Sanghathan: Small scale distributed training of sequential deep learning models, built on Numpy and MPI.
Cuda writer: writing cuda kernels from scratch vec_add to flash_attention and model implementation from scratch.
Flash attention: Implementation of flash attention in tritonutilization

Machine learning

Paligemma-Google: Implemented paligemma vision language model by google from scratch paper
Transformer: Implemented Transformer language model by Google from scratch paper
Mixture of Experts: Mixture of Experts (MoE) model with a focus on efficient routing and expert
Triton/CUDA kernels in my free time : )

Connect with Me 📬

🐦 Twitter
📫 Email
🔗 LinkedIn I'm looking forward to collaborating on projects that are at the intersection of technology and social good. Let's connect! 🌍

Pinned Loading

GPU-sanghathan GPU-sanghathan Public

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Python 3
Paligemma Paligemma Public

vision language model

Python 2
stimulus-py stimulus-py Public

Forked from mathysgrapotte/stimulus-py

Python package for processing deep learning models

Python 1
OLMo OLMo Public

Forked from allenai/OLMo

Modeling, training, eval, and inference code for OLMo

Python 1
research_copilot research_copilot Public

Research Summarizer: A RAG-based model that simplifies complex topics into clear explanations while providing accurate citations for reference.

Python 1
Cuda_writer Cuda_writer Public

Distributed training

Jupyter Notebook 1