Skip to content
View mohammed840's full-sized avatar
:shipit:
:shipit:

Block or report mohammed840

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mohammed840/README.md

MasterHead

Hi 👋, I'm Mohammed Alshehri

AI/ML Research Engineer | Speech & Multimodal Systems | LLM Fine-tuning & Post-Training RL

Coding

abdullah-khaled0


👨‍💻 About Me:

  • 💬 Ask me about: AI/ML Research, Speech Systems (ASR), LLM Fine-tuning, Post-Training Reinforcement Learning, Multimodal AI, Prompt Engineering
  • Fun fact: Passionate about building production ML systems and AI solutions!

🌐 Connect with me:

LinkedIn - Mohammed


🎓 Education


🏫 Bachelor's Degree in Computer Science with Data science

  • University Badge


💼 Experience


🧑‍💻 Researcher & Engineer

Remote
August 2025 - Present

  • Designed an Arabic TTS adaptation pipeline: phoneme normalization, accent controls, prosody tuning; delivered real-time monitoring dashboards for quality and latency.
  • Built an evaluation harness (dataset curation, scoring, regression checks) to track quality across model versions.
  • Fine-tuned Arabic-first LLMs on domain datasets (instruction tuning + preference tuning); improved task accuracy on internal eval sets.

PyTorch LLM Fine-tuning TTS Evaluation Arabic NLP


🧑‍💻 CEO & Lead Research Engineer

Remote
December 2024 - August 2025 (9 months)

  • Co-founded and led ML + product engineering for an AI notetaker: transcription, summarization, and automation workflows.
  • Built summarization pipeline with caching + chunking + streaming; reduced end-to-end processing time by 40% and improved real-time note stability.
  • Actively engaged in sales activities conducting demos, handling client objections, and closing deals while continuing product development.
  • Built MCP-driven summarization algorithms, reducing processing time 40% and improving real-time note quality.

Summarization ASR Node.js React Product Engineering


🧑‍💻 Multimodal AI Engineer

IBM · Hybrid
June 2024 - December 2024 (7 months)

  • Built and optimized multimodal AI systems combining ASR and LLM prompt engineering to improve speech and text understanding in enterprise AI products.
  • Contributed to the Watsonx team, improving semantic accuracy and human–AI interaction across voice and text pipelines.
  • Designed evaluation frameworks, ran model validation, and collaborated on applied experiments with OpenAI researchers to bridge research ideas into production systems.

Multimodal AI ASR Prompt Engineering Watsonx Model Evaluation


🧑‍💻 NLP, AI & Software Engineer Intern

IBM · Hybrid
March 2023 - September 2023 (6 months)

  • Engineered an advanced chatbot utilizing large language models which simplified legal document interactions, achieving a 30% reduction in customer query handling time.
  • Implemented AI-driven algorithms using Python and TensorFlow to automate complex contractual language summarization, cutting document processing time by 40%.
  • Enhanced the chatbot's architecture to deliver 25% quicker and more precise human-like responses, significantly boosting real-time legal jargon interpretation using NLP techniques.

LLM NLP TensorFlow Python Chatbot


🖥️ Software Engineer

Self-employed
Nov 2021 - Dec 2022 (1 year 2 months)
Location: Saudi Arabia / Ireland

Key Projects:

  • FCAI App: Designed to assist classmates with academic materials, streamlining access to resources.
  • Tasbeeh App: Developed a digital tool for tasbeeh, enhancing users' spiritual practices.
  • Hoozgram: Built an app for mood tracking, promoting emotional awareness.
  • 3D Game: Created a 3D game using Unity Engine, offering engaging gameplay experiences.
  • AR App: Developed an augmented reality application, delivering immersive experiences.
  • Food Recipes App: Released a user-friendly app featuring a variety of food recipes.
  • Website Development: Built dynamic websites using various technologies, showcasing a full-stack development skill set.

Unity Problem Solving NoSQL Databases Augmented Reality SQL Web Development User Experience (UX) Agile Methodologies Version Control (Git)



🛠️ Technical Skills

🖥️ Programming Languages

Python R

🤖 Machine Learning & NLP

Scikit-Learn NumPy Pandas Matplotlib PySpark SpaCy NLTK Transformers ASR Pipelines

🧠 Deep Learning & GenAI

PyTorch TensorFlow LLM Fine-tuning Post-Training RLHF Langchain Cohere

🚀 Deployment & MLOps

Streamlit Flask FastAPI Docker MLFlow DVC CI/CD

📊 Data Science & Business Intelligence

SQL Airflow DBT SSIS ETL NoSQL Tableau Power BI Excel Snowflake Statistics Time Series Web Scraping

☁️ Familiar with Cloud Computing & Services

Azure Data Factory Google Cloud Storage Azure Blob Storage AWS Kinesis AWS Lambda BigQuery Amazon SageMaker Azure Databricks S3 Bucket


🌟 Soft Skills

🎯 Problem-Solving & Critical Thinking

  • Proficient in breaking down complex problems and designing efficient, scalable solutions.

🧑‍🤝‍🧑 Teamwork & Collaboration

  • Experience working in cross-functional teams.
  • Strong collaboration and communication skills with both technical and non-technical stakeholders.

💡 Creativity & Innovation

  • Innovative thinker with a passion for exploring new technologies and solving real-world problems through AI and data science.

📅 Time Management & Organization

  • Ability to manage multiple projects simultaneously, prioritizing tasks and meeting tight deadlines in fast-paced environments.

🎙️ Communication & Presentation

  • Skilled at conveying technical insights to diverse audiences, making complex data accessible and actionable.
  • Proficient in presenting data-driven findings clearly and persuasively.

📈 Continuous Learning

  • A mindset of continuous learning with an interest in staying updated with the latest trends in AI, machine learning, and data science.

📊 Recent Projects


🎙️ Voice-Enabled Patient Education Agent (redec.io)

Description:
Built a voice-enabled patient education agent using foundation models to simplify lab results and clinical conversations across speech and text. Implemented an evaluation framework to compare foundation model outputs across patient scenarios using rubrics, automated checks, and human review loops.

Tools & Technologies:
ASR LLM Evaluation Python



⚙️ Automated Machine Learning Pipeline for Data Preprocessing, Training, and Evaluation

Description:
Built a fully automated machine learning pipeline using Python and DVC. This pipeline handles data preprocessing, model training using XGBoost, and evaluation. Versioning and reproducibility are ensured via DVC.

Tools & Technologies:
Python DVC XGBoost


🚀 Comprehensive Sales Performance Enhancement

Description:
Analyzed sales data, leading to a 39% YoY growth in sales by visualizing channel performance and promotional effectiveness using Power BI. Provided actionable insights that optimized future sales strategies, driving revenue growth.

Tools & Technologies:
Power BI DAX Data Visualization


💼 Customer Personality Analysis

Description:
Performed customer segmentation using K-Means Clustering to categorize customers by behavioral patterns. These insights helped the company design personalized marketing strategies, improving customer engagement and revenue.

Tools & Technologies:
Python K-Means Clustering Pandas




🏅 Certifications

🚀 Machine Learning & Data Science

📊 Data Analysis & Visualization

🧮 Mathematics & Statistics

🛠️ Data Engineering

📈 Communication & Strategy



Top Languages

Top Languages

Current Streak

Current Streak

Popular repositories Loading

  1. RLM-implementation RLM-implementation Public

    RVAA: Recursive Vision-Action Agent for Long Video Understanding. Implementation of the RLM paradigm (Zhang, Kraska, Khattab 2025)

    Python 102 15

  2. Water-Quality-Prediction-machine-learning-python Water-Quality-Prediction-machine-learning-python Public

    predicting if the water is safe to consume with given dataset

    Jupyter Notebook 7 5

  3. Tumor-Detection-openai-oss Tumor-Detection-openai-oss Public

    Python 2

  4. VoiceChat-GPT-OSS VoiceChat-GPT-OSS Public

    Trying out GPT-OSS

    Python 1

  5. the-exploration-phase the-exploration-phase Public

    Just having fun with RL meeeeeeh

    Jupyter Notebook 1

  6. reward-hacking-detector reward-hacking-detector Public

    Reward-Hacking Detection via RL-Post-Training of LLMs - Detecting reward hacking in RL agents using language models as trajectory auditors

    Python 1