📦 StratMem-Bench Dataset

This repository contains the StratMem-Bench dataset, a benchmark for evaluating memory use in virtual character dialogue.

📊 Overview

StratMem-Bench is a dataset for single-turn dialogue generation with an emphasis on how models utilize a provided memory pool.

Each instance includes:

a user query
a virtual character persona
a set of candidate memories

Models are expected to generate responses based on these inputs.

📁 Data Format

The dataset is stored as a JSON file:

StratMemBench.json

Each entry has the following structure:

{
  "id": "unique_id",
  "query": "...",
  "query_time": "...",
  "history": "...",
  "roles": {
    "human": "...",
    "virtual_person": "..."
  },
  "memory": {
    "must": [{"fact": "..."}],
    "nice": [{"fact": "..."}],
    "irr": [{"fact": "..."}]
  },
  "virtual_person_persona": {
    "virtual_person_name": "...",
    "summary": "...",
    "traits": [...],
    "style": [...],
    "domains": [...],
    "values": [...]
  }
}

🔍 Field Description

id

Unique identifier for each instance.

query

The current user question.

query_time

Timestamp of the query.

history

Dialogue history (may be empty).

roles

Defines the participants:

human: user name
virtual_person: character name

memory

A set of candidate memory items:

must: essential information
nice: optional supportive information
irr: irrelevant information

Each memory item is a short factual statement: {"fact": "..."}

virtual_person_persona

Describes the virtual character, including:

summary
personality traits
speaking style
domains of knowledge
values

🧪 Example

{
  "query": "When did you last go swimming with your kids?",
  "memory": {
    "must": [
      {"fact": "Melanie went swimming with her kids on 2023-05-08."}
    ],
    "nice": [],
    "irr": [
      {"fact": "Melanie painted a lake sunrise in 2022."}
    ]
  }
}

📥 Usage

Load the dataset

import json

with open("StratMemBench.json", "r") as f:
    data = json.load(f)

print(len(data))

Typical input construction

[Persona] ...

[Dialogue History] ...

[Memory Pool] ...

[User Query] ...

⚠️ Notes

Memory items are provided as unlabeled text during inference (no role annotations should be exposed to models).
The dataset is designed for single-turn generation tasks.
All data is synthetic or anonymized.

📜 License

(To be added)

📬 Contact

(To be added)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
me.py		me.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📦 StratMem-Bench Dataset

📊 Overview

📁 Data Format

🔍 Field Description

id

query

query_time

history

roles

memory

virtual_person_persona

🧪 Example

📥 Usage

Load the dataset

Typical input construction

⚠️ Notes

📜 License

📬 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

📦 StratMem-Bench Dataset

📊 Overview

📁 Data Format

🔍 Field Description

id

query

query_time

history

roles

memory

virtual_person_persona

🧪 Example

📥 Usage

Load the dataset

Typical input construction

⚠️ Notes

📜 License

📬 Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages