Skip to content
View ydli-ai's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@jenkins-zh @CLUEbenchmark @CVI-SZU

Block or report ydli-ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ydli-ai/README.md

Hi there 👋

I am currently a Post-Doc researcher at NGNLab, Tsinghua University. My research focuses on the application of large language models (LLMs).

Education

2021.9 - 2024.12, College of Computer Science & Software Engineering, Shenzhen University
2018.9 - 2021.6, School of Information Engineering, China University of Geosciences (Beijing)
2014.9 - 2018.6, School of Computer Science and Engineering, Central South University

Publications

  • Li, Y., Hou, X., Zheng, D., Shen, L., Zhao, Z. FLIP-80M: 80 Million Visual-Linguistic Pairs for Facial Language-Image Pre-Training. In Proceedings of the 32th ACM International Conference on Multimedia. [paper][code]
  • Xie. J., Ye. K., Li. Y., et al. Learning Visual Prior via Generative Pre-training. Advances in Neural Information Processing Systems. [paper]
  • Li, Y., Feng, Y., Zhou, W., Zhao, Z., Shen, L., Hou, C., Hou, X. Dynamic data sampler for cross-language transfer learning in large language models. In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing [paper][code]
  • Zhao, Z., Li, Y., Hou, C., et. al. TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics [paper][code]
  • Li, Y., Hou, X., Zhao, Z. et. al. Talk2face: A unified sequence-based framework for diverse face generation and analysis tasks. In Proceedings of the 30th ACM International Conference on Multimedia, 25, 3409-3419. [paper][code]
  • Li, Y., Zhang, Y., Zhao, et. al. CSL: A Large-scale Chinese Scientific Literature Dataset. In Proceedings of the 29th International Conference on Computational Linguistics (pp. 3917-3923). [paper][code]
  • Hou. X., Zhang. X., Li. Y., et al. Textface: Text-to-style mapping based face generation and manipulation[J]. IEEE Transactions on Multimedia, 2022, 25: 3409-3419. [paper]
  • Xu L, Hu H, Zhang X, Li L, Cao C, Li Y, et al.  CLUE: A Chinese Language Understanding Evaluation Benchmark. In Proceedings of the 28th International Conference on Computational Linguistics. [paper][code]

Pinned Loading

  1. FLIP FLIP Public

    [ACM MM 2024 (Oral)] FLIP-80M: A Large-Scale Facial Language-Image Dataset 用于预训练的大规模人脸图文数据集

    5 1

  2. CVI-SZU/Linly CVI-SZU/Linly Public

    Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

    Python 3k 236

  3. CSL CSL Public

    [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集

    Python 595 59

  4. Tencent/TencentPretrain Tencent/TencentPretrain Public

    Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

    Python 1.1k 142

  5. dbiir/UER-py dbiir/UER-py Public

    Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

    Python 3k 525

  6. CLUEbenchmark/CLUE CLUEbenchmark/CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4.1k 545