Organize project repo #29

deanwampler · 2024-12-12T23:20:28Z

Our main repo: https://github.com/The-AI-Alliance/trust-safety-evals

We need to have everything nicely organized in the above. Currently things are all over the place:

BenchBench forked here: https://github.com/The-AI-Alliance/tse-ibm-benchbench

HF repos:
SafetyBAT: https://huggingface.co/spaces/aialliance/safetybat
SafetyArena (leaderboard): https://huggingface.co/spaces/aialliance/safetyarena
SafetyArena (backend): https://huggingface.co/spaces/aialliance/safetyarena-backend

deanwampler · 2024-12-12T23:28:53Z

For the HF repos, we can set up forks to copies in the Alliance.

For these repos and for BenchBench, it feels to me like these are standalone projects for deploying tools that might dependencies on https://github.com/The-AI-Alliance/trust-safety-evals, but don't need to live within the repo. True?

When it's better to have the content of one repo nested within another, we should consider using submodules first, then consider if that's sufficient or we should just move the contents into the other repo.

deanwampler · 2025-01-13T23:34:12Z

@bnayahu Thoughts on this? I take it https://github.com/The-AI-Alliance/tse-ibm-benchbench is repo for what's running on HF. What else is deployed to HF that isn't in this repo?

I see a few things we need to do offhand to better engineer all this:

Document all the repos where code is kept, including what's used to deploy to HF.
Document operational procedures, e.g., what do to when something has stopped running. (How can we prevent that or get alerted immediately?)

bnayahu · 2025-01-14T10:25:51Z

We also need to decide on how we govern our code, our dependencies and the lifecycle.

Lets consider SafetyBAT as an example: the "production" environment is a HF space, which is essentially a repo they host (https://huggingface.co/spaces/aialliance/safetybat/tree/main). This space is a clone of the IBM BenchBench space (https://huggingface.co/spaces/ibm/benchbench/tree/main), with some small modifications. The BenchBench space uses the BenchBench library (https://github.com/ibm/benchbench).

The way it SHOULD work (I think): Both IBM-owned repos should be forked into the AIA org (currently the BB library is forked into tse-ibm-benchbench), and serve as our baseline. Any changes or enhancements we need should be done in these forks, potentially (where it makes sense) contributed back to the origins. The HF repo should be configured as an additional remote for our repo, so there's a single source for the code. If this makes sense conceptually, and if it can be done technically, lets do it.

deanwampler · 2025-01-14T14:10:02Z

That makes sense to me. I see from the settings for safetybat that it is associated with the benchbench repo and it appears there are updates to the latter that could be merged. What's not clear to me is how to automate things like this, but we could at least document the process for manual steps. I also don't see a way to move the upstream repo from benchbench to tse-ibm-benchmark, but it might be doable.

Something else to manage, I see in the README editing view that the streamlit dependency can be upgraded. https://huggingface.co/spaces/aialliance/safetybat/edit/main/README.md

bnayahu added this to Trust and Safety Evaluations Dec 9, 2024

deanwampler assigned deanwampler and bnayahu Dec 12, 2024

deanwampler converted this from a draft issue Dec 12, 2024

deanwampler removed this from Trust and Safety Evaluations Dec 12, 2024

deanwampler added this to FA2: TSEI Tasks Dec 12, 2024

deanwampler moved this to Todo in FA2: TSEI Tasks Dec 12, 2024

deanwampler moved this from Todo to Planning in FA2: TSEI Tasks Dec 12, 2024

deanwampler added the administration Project management, etc. label Jan 4, 2025

deanwampler added this to the 2025-01-31 milestone Jan 13, 2025

deanwampler moved this from Planning to Done in FA2: TSEI Tasks Jan 15, 2025

deanwampler moved this from Done to In Progress in FA2: TSEI Tasks Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Organize project repo #29

Organize project repo #29

deanwampler commented Dec 12, 2024

deanwampler commented Dec 12, 2024

deanwampler commented Jan 13, 2025 •

edited

Loading

bnayahu commented Jan 14, 2025

deanwampler commented Jan 14, 2025

Organize project repo #29

Organize project repo #29

Comments

deanwampler commented Dec 12, 2024

deanwampler commented Dec 12, 2024

deanwampler commented Jan 13, 2025 • edited Loading

bnayahu commented Jan 14, 2025

deanwampler commented Jan 14, 2025

deanwampler commented Jan 13, 2025 •

edited

Loading