Skip to content

garmin923/CensusxD_graphExplore

Repository files navigation

A Workstudy Project with the Census xD Group

Attempting to make the Census Data more accessible and understandable

Gareth Minson-Efimov

2025

To Implement

Packages

Install gpt4all, streamlit

Load Data

Have "acs2023_1yr_variables_LABEL_CONCEPT_Btables.json" in the 01_raw folder
Have "acs2023_1yr_c_tables_with_vars_LLM_cleaned.json" in the 03_processed folder

Scripts

  1. Run the "cleaning_script_1.py" file which takes the raw B Table information as input
  2. Run the "Merging_B_C_tables.py" file which combines the processed B Table and the pre-cleaned C Table
  3. Run the "llm_dev_03.py" file as: streamlit run < insert path > llm_dev_03.py [ARGUMENTS]

About this demo

LLM

This demo uses the Meta Llama 3 8B Instruct model. This model is one of the best performing open source models (Hugging Face Meta Llama 3). It runs without a GPU, but it may run more efficiently with one.

ACS Context

The ACS B and C tables are available for the LLM to evaluate. Given the context limits, the demo selects only 5 tables with the variables they cover.
This selection is managed by the random seed, so the 5 tables do not change.
The tables currently contain only information about what variables are addressed. This could be expanded to have information about the universe of the tables and the geography limits.
The context limits could be expanded with a vector database.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors