Skip to content
View taorui-plus's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report taorui-plus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
taorui-plus/README.md

👋 你好,我是陶瑞(Rumi)

📧 邮箱: rumi@plaud.ai | 📝 CSDN: taorui.blog.csdn.net


🎯 个人简介

长期从事语音识别相关算法的开发与研究工作,具备扎实的语音信号处理、深度学习模型训练与优化经验,熟悉业界主流的语音识别系统架构与工程实现。
在语音转写、多语种识别、实时识别等方向有丰富的项目实践与技术积累,致力于推动语音技术在实际应用中的落地与优化。


🛠 科研方向

  • 🎧 音频理解(Audio Understanding)
  • 🔊 声音生成(Sound Generation)
  • 🗣️ 语音识别(Speech Recognition)
  • 🧠 语音合成(Speech Synthesis)
  • 🚨 声音事件监测(Sound Event Detection)
  • 👥 说话人日志(Speaker Diarization)

📝 科研论文

第一作者论文:

  • 📌 Interspeech 2022:
    Couple Learning for Semi-supervised Sound Event Detection

  • 📌 ICME 2024:
    Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection

第二作者论文:

  • 📌 AAAI 2024:
    Audio Generation with Multiple Conditional Diffusion Model

  • 📌 ICASSP 2024:
    Simi-supervised Sound Event Detection with Local and Global Consistency Regularization

  • 📌 DCASE Workshop 2022:
    A Hybrid System of Sound Event Detection Transformer and Frame-wise Model

  • 📌 DCASE Workshop 2021:
    Sound Event Detection Using Metric Learning and Focal Loss for DCASE 2021 Task 4


💼 工作经历

  • 东芝(中国)有限公司
    研究部 · 语音技术部门 · 研究员
    📅 2020.8 - 2024.3

  • 十一贝科技有限公司
    智能语音部门 · Team Leader
    📅 2018.10 - 2020.7

  • 百度金融(现度小满金融)
    信贷风控技术部门 · 高级研发工程师(T4)
    📅 2017.6 - 2018.10


Pinned Loading

  1. OpenNRE OpenNRE Public

    Forked from thunlp/OpenNRE

    哈工大bert上fine turning ,中文人物关系抽取任务准确率0.97

    Python 119 33

  2. Chinese-ASR-gitbook Chinese-ASR-gitbook Public

    工业级中文语音识别系统电子书

    13 3

  3. NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

    Python 13.5k 2.3k

  4. langchain langchain Public

    Forked from langchain-ai/langchain

    🦜🔗 Build context-aware reasoning applications

    Python