Skip to content
View alexchen4ai's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Stanford University
  • US, CA

Highlights

  • Pro

Block or report alexchen4ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alexchen4ai/README.md

Hi πŸ‘‹, I'm Alex Chen!

Founder, CEO & Chief Scientist at Nexa AI | Stanford PhD


πŸš€ About Me

I am an AI researcher and software architect specializing in on-device AI and efficient generative models.

Currently, I am building Nexa AI, where we enable Day-0 support for state-of-the-art generative AI models on edge devices (NPU, GPU, CPU). My work focuses on making AI friction-free, private, and production-ready for mobile, PC, automotive, and IoT platforms.

  • πŸ”­ I’m currently working on: NexaSDK and the NexaML inference engine.
  • 🌱 My research interests: Multimodal AI, Model Quantization, Hardware Acceleration (Qualcomm HTP, CUDA), and Agentic Workflows.
  • πŸ’Ό Previous Experience: Investment Scout at Sequoia Capital; PhD Researcher at Stanford.

πŸ› οΈ Featured Work

  • NexaML: A core inference engine enabling multimodal model deployment on Qualcomm NPU/GPU/CPU. Achieved 7.6K+ GitHub stars.
  • Octopus Model Series (V1-V4): On-device language models that outperform GPT-4o on function-calling benchmarks with 35x faster inference and 70x better energy efficiency.
  • Hyperlink: A fully local, private desktop app for agentic RAG file search.

πŸ“ Selected Publications

Popular repositories Loading

  1. blog blog Public

    My blog to share the AI tech & entrepreneurship

    HTML 3

  2. cs229-2019-summer cs229-2019-summer Public

    Forked from maxim5/cs229-2019-summer

    All notes and materials for the CS229: Machine Learning course by Stanford University

    HTML

  3. Deep-Reinforcement-Learning-Algorithms-with-PyTorch Deep-Reinforcement-Learning-Algorithms-with-PyTorch Public

    Forked from p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

    PyTorch implementations of deep reinforcement learning algorithms and environments

    Python

  4. website website Public

  5. cs224u-1 cs224u-1 Public

    Forked from cgpotts/cs224u

    Code for Stanford CS224u

    Jupyter Notebook

  6. pykoi pykoi Public

    Forked from CambioML/pykoi

    pykoi: Active learning in one unified interface

    Python