Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Organize project repo #29

Open
deanwampler opened this issue Dec 12, 2024 · 4 comments
Open

Organize project repo #29

deanwampler opened this issue Dec 12, 2024 · 4 comments
Assignees
Labels
administration Project management, etc.
Milestone

Comments

@deanwampler
Copy link
Contributor

Our main repo: https://github.com/The-AI-Alliance/trust-safety-evals

We need to have everything nicely organized in the above. Currently things are all over the place:

BenchBench forked here: https://github.com/The-AI-Alliance/tse-ibm-benchbench

HF repos:
SafetyBAT: https://huggingface.co/spaces/aialliance/safetybat
SafetyArena (leaderboard): https://huggingface.co/spaces/aialliance/safetyarena
SafetyArena (backend): https://huggingface.co/spaces/aialliance/safetyarena-backend

@deanwampler
Copy link
Contributor Author

For the HF repos, we can set up forks to copies in the Alliance.

For these repos and for BenchBench, it feels to me like these are standalone projects for deploying tools that might dependencies on https://github.com/The-AI-Alliance/trust-safety-evals, but don't need to live within the repo. True?

When it's better to have the content of one repo nested within another, we should consider using submodules first, then consider if that's sufficient or we should just move the contents into the other repo.

@deanwampler deanwampler moved this to Todo in FA2: TSEI Tasks Dec 12, 2024
@deanwampler deanwampler moved this from Todo to Planning in FA2: TSEI Tasks Dec 12, 2024
@deanwampler deanwampler added the administration Project management, etc. label Jan 4, 2025
@deanwampler
Copy link
Contributor Author

deanwampler commented Jan 13, 2025

@bnayahu Thoughts on this? I take it https://github.com/The-AI-Alliance/tse-ibm-benchbench is repo for what's running on HF. What else is deployed to HF that isn't in this repo?

I see a few things we need to do offhand to better engineer all this:

  • Document all the repos where code is kept, including what's used to deploy to HF.
  • Document operational procedures, e.g., what do to when something has stopped running. (How can we prevent that or get alerted immediately?)

@deanwampler deanwampler added this to the 2025-01-31 milestone Jan 13, 2025
@bnayahu
Copy link
Contributor

bnayahu commented Jan 14, 2025

We also need to decide on how we govern our code, our dependencies and the lifecycle.

Lets consider SafetyBAT as an example: the "production" environment is a HF space, which is essentially a repo they host (https://huggingface.co/spaces/aialliance/safetybat/tree/main). This space is a clone of the IBM BenchBench space (https://huggingface.co/spaces/ibm/benchbench/tree/main), with some small modifications. The BenchBench space uses the BenchBench library (https://github.com/ibm/benchbench).

The way it SHOULD work (I think): Both IBM-owned repos should be forked into the AIA org (currently the BB library is forked into tse-ibm-benchbench), and serve as our baseline. Any changes or enhancements we need should be done in these forks, potentially (where it makes sense) contributed back to the origins. The HF repo should be configured as an additional remote for our repo, so there's a single source for the code. If this makes sense conceptually, and if it can be done technically, lets do it.

@deanwampler
Copy link
Contributor Author

That makes sense to me. I see from the settings for safetybat that it is associated with the benchbench repo and it appears there are updates to the latter that could be merged. What's not clear to me is how to automate things like this, but we could at least document the process for manual steps. I also don't see a way to move the upstream repo from benchbench to tse-ibm-benchmark, but it might be doable.

Something else to manage, I see in the README editing view that the streamlit dependency can be upgraded. https://huggingface.co/spaces/aialliance/safetybat/edit/main/README.md

@deanwampler deanwampler moved this from Planning to Done in FA2: TSEI Tasks Jan 15, 2025
@deanwampler deanwampler moved this from Done to In Progress in FA2: TSEI Tasks Jan 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
administration Project management, etc.
Projects
Status: In Progress
Development

No branches or pull requests

2 participants