Skip to content
View Yu-Shi's full-sized avatar

Highlights

  • Pro

Organizations

@thunlp

Block or report Yu-Shi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,799 1,385 Updated Feb 1, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,969 362 Updated Mar 1, 2025

Textbook on reinforcement learning from human feedback

TeX 462 34 Updated Mar 1, 2025

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

Python 534 47 Updated Feb 24, 2025

🙌 OpenHands: Code Less, Make More

Python 48,313 5,307 Updated Feb 28, 2025

Simple Python interface for Graphviz

Python 1,687 212 Updated May 13, 2024

Powerful menu bar manager for macOS

Swift 17,012 301 Updated Jan 26, 2025

Build & Optimize your RAG.

Python 442 32 Updated Feb 27, 2025

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 240 8 Updated Feb 23, 2025

Lime: Explaining the predictions of any machine learning classifier

JavaScript 11,777 1,824 Updated Jul 25, 2024

Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"

Python 49 3 Updated Dec 20, 2024

Fast State-of-the-Art Static Embeddings

Python 1,075 49 Updated Feb 28, 2025

Helper for managing arXiv papers in Zotero

TypeScript 156 4 Updated Feb 20, 2025

Common Crawl fork of Apache Nutch

Java 32 2 Updated Jan 8, 2025

Universal Python binding for the LMDB 'Lightning' Database

C 670 109 Updated Jan 9, 2025

A library that provides an embeddable, persistent key-value store for fast storage.

C++ 29,207 6,420 Updated Feb 27, 2025

GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

Scala 1,024 244 Updated Feb 27, 2025

A scalable, mature and versatile web crawler based on Apache Storm

Java 901 262 Updated Feb 28, 2025

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Java 2,912 762 Updated Feb 14, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 31,894 2,661 Updated Feb 28, 2025

Zstandard - Fast real-time compression algorithm

C 24,429 2,196 Updated Feb 28, 2025

Puzzles for learning Triton, play it with minimal environment configuration!

Python 247 21 Updated Dec 3, 2024

The uncompromising Python code formatter

Python 39,858 2,546 Updated Mar 1, 2025

A Visual Studio Code extension with support for the Ruff linter.

TypeScript 1,266 57 Updated Feb 28, 2025

Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]

TypeScript 79 11 Updated Jan 18, 2025

This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".

Python 15 2 Updated Oct 28, 2024

NanoGPT (124M) in 3 minutes

Python 2,332 249 Updated Feb 21, 2025

The repository for the code of the UltraFastBERT paper

Python 518 31 Updated Mar 24, 2024

Acceptance rates for the major AI conferences

Jupyter Notebook 4,385 306 Updated Jan 24, 2025
Next
Showing results