Skip to content
View navid72m's full-sized avatar
  • @node-h

Block or report navid72m

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
navid72m/README.md

Hi, I'm Navid Mirnouri

Deep Learning Researcher & Engineer
Exploring the frontiers of Large Language Models, Natural Language Processing, and Reinforcement Learning.


๐Ÿง  About Me

I am a researcher with a deep interest in building intelligent systems that understand, reason, and interact with the world through language. My work spans across foundational and applied aspects of LLMs, NLP, and deep RL, focusing on alignment, generalization, and real-world deployment.

  • ๐Ÿง  Focus: LLMs, Transformers, RLHF, Model Evaluation, Prompt Engineering
  • ๐Ÿงช Research interests: interpretability, alignment, multi-agent systems, societal simulations
  • ๐Ÿ› ๏ธ Tools: PyTorch, Hugging Face, RLlib, LangChain, Weights & Biases
  • ๐ŸŽฏ Goal: Contribute to responsible AI research and advance understanding of intelligent behavior

๐Ÿ“‚ Featured Projects

๐Ÿง  Simulating Society as a Neural Network

A conceptual and experimental framework modeling societal systems as neural networks, aiming to balance fairness, efficiency, and meritocracy using deep learning and RL.
Keywords: Social Simulation, Policy Optimization, Ethics in AI
๐Ÿ“„ Paper Draft (available on request)
๐Ÿ”— GitHub: navid72m/society-as-a-network

๐Ÿค– Real-Time LLMs on Embedded Devices

Research initiative exploring model quantization, compression, and distillation to deploy LLMs on edge hardware like Jetson Orin Nano.
Focus: Efficient inference, low-latency deployment, hardware-aware design.
๐Ÿ”— GitHub: navid72m/llm-embedded

๐Ÿ“ˆ Time Series & Forecasting Toolkit

End-to-end framework for visualizing, analyzing, and forecasting time series data using statistical and deep learning approaches.
Tools: Prophet, ARIMA, LSTM
๐Ÿ”— GitHub: navid72m/time-series-lab

๐Ÿ” RLHF Experiments

Exploring how human feedback can guide the behavior of language models through reinforcement learning.
Includes policy optimization experiments and reward model training.
๐Ÿ”— GitHub: navid72m/rlhf-lab


๐Ÿ”ฌ Ongoing Research Directions

  • ๐ŸŒ Language models in multi-agent environments for emergent behavior analysis
  • ๐ŸŽฒ Game-theoretic approaches to fairness and alignment in RL settings
  • ๐Ÿงฉ Investigating hallucinations in LLMs and developing robust evaluation metrics
  • ๐Ÿง  Preparing for a potential PhD on the intersection of AI, philosophy, and decision theory

๐Ÿ“š Learning & Reading

Currently reading:

  • "Hands-On Large Language Models"
  • Papers on mechanistic interpretability
  • Stanford / DeepMind publications on RLHF and AI alignment

๐Ÿ”— Connect with Me

LinkedIn


Pinned Loading

  1. llm-twin-course llm-twin-course Public

    Forked from decodingml/llm-twin-course

    ๐Ÿค– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป for ๐—ณ๐—ฟ๐—ฒ๐—ฒ how to ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ an end-to-end ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป-๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐˜† ๐—Ÿ๐—Ÿ๐—  & ๐—ฅ๐—”๐—š ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ using ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€ best practices: ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + 11 ๐˜ฉ๐˜ข๐˜ฏ๐˜ฅ๐˜ด-๐˜ฐ๐˜ฏ ๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ฐ๐˜ฏ๐˜ด

    Python

  2. LLMs-from-scratch LLMs-from-scratch Public

    Forked from rasbt/LLMs-from-scratch

    Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

    Jupyter Notebook

  3. time-series-platform time-series-platform Public

    Python

  4. pdf pdf Public

    ๐Ÿ” AI-Powered Document Intelligence System | Retrieval-Augmented Generation (RAG) Advanced document processing platform that combines semantic embedding, intelligent retrieval, and generative AI to โ€ฆ

    Python 13 3

  5. chatbot chatbot Public

    A repository to run llms locally on your laptop as a chatbot

    JavaScript 5

  6. sim-society sim-society Public

    Python 1