Update .gitignore to exclude virtual environment directories and enhance documentation on adding datasets with h5py #2032

bendichter · 2025-02-06T20:53:08Z

Motivation

Added a new section to the editing tutorial that demonstrates how to add custom datasets using h5py. This enhances the documentation by showing users how to add new datasets to existing groups in NWB files when the standard PyNWB API doesn't provide direct methods for doing so. The example specifically shows adding a genotype dataset to the Subject object, which is a common use case for neuroscience data management. This came up recently in the following slack message: https://nwb-users.slack.com/archives/C5XKC14L9/p1738791649800719

I don't think there is a way to do this directly with pynwb. Is that right, @rly?

Adrian Duszkiewicz
Hi all 🙂 I’m wondering about the possible solutions to the problem of blinding to some of the ‘subject’ information in the NWB file.
We are moving our ephys processing pipeline to the NWB format and in our case, the experimenter is blind to the genotype of the animal during pre-processing and initial data analysis. In an ideal situation this info would not be accessible to them in the NWB files they are working with and would only be added at a later stage. However, as I understand, the subject info can be only added when creating the NWB file and cannot be edited later. Is there anything obvious I’m missing or is recreating the whole NWB file from scratch after unblinding the only solution to this issue?
Thank you in advance for all the tips! (edited)

…nce documentation on adding datasets with h5py

codecov · 2025-02-06T20:55:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.73%. Comparing base (716e3ce) to head (730ed7b).

Additional details and impacted files

@@           Coverage Diff           @@
##              dev    #2032   +/-   ##
=======================================
  Coverage   91.73%   91.73%           
=======================================
  Files          27       27           
  Lines        2722     2722           
  Branches      710      710           
=======================================
  Hits         2497     2497           
  Misses        149      149           
  Partials       76       76

Flag	Coverage Δ
integration	`72.96% <ø> (ø)`
unit	`82.29% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rly · 2025-02-07T08:14:53Z

I don't think there is a way to do this directly with pynwb. Is that right, @rly?

If the dataset has not been created yet, it can be appended using pynwb but subject.set_modified() has to be called. (If it has been written and it is the same shape, it can be replaced but only using h5py because it is a scalar dataset, and pynwb does not provide direct access to the h5py.Dataset object for a scalar dataset).

import pynwb
from pynwb.testing.mock.file import mock_NWBFile
nwb = mock_NWBFile()
nwb.subject = pynwb.file.Subject(subject_id="test")
io = pynwb.NWBHDF5IO("test.nwb", "w")
io.write(nwb)
io.close()

io = pynwb.NWBHDF5IO("test.nwb", "a")
nwb = io.read()
nwb.subject.genotype = "test"
nwb.subject.set_modified()
io.write(nwb)
io.close()

io = pynwb.NWBHDF5IO("test.nwb", "r")
nwb = io.read()
print(nwb.subject.genotype)  # returns "test"

set_modified should really be called anytime a field setter is successfully executed but there might be some strange edge cases. I'll look into that in hdmf-dev/hdmf#1244

bendichter · 2025-02-07T19:55:05Z

OK, we should definitely add this to the editing tutorial. Can optional Attributes and optional non-scalar Datasets also be added in this way?

Update .gitignore to exclude virtual environment directories and enha…

730ed7b

…nce documentation on adding datasets with h5py

rly mentioned this pull request Feb 7, 2025

[Feature]: Call set_modified at the end of field setter methods to make appending easier hdmf-dev/hdmf#1244

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update .gitignore to exclude virtual environment directories and enhance documentation on adding datasets with h5py #2032

Update .gitignore to exclude virtual environment directories and enhance documentation on adding datasets with h5py #2032

bendichter commented Feb 6, 2025 •

edited

Loading

codecov bot commented Feb 6, 2025 •

edited

Loading

rly commented Feb 7, 2025

bendichter commented Feb 7, 2025

Update .gitignore to exclude virtual environment directories and enhance documentation on adding datasets with h5py #2032

Are you sure you want to change the base?

Update .gitignore to exclude virtual environment directories and enhance documentation on adding datasets with h5py #2032

Conversation

bendichter commented Feb 6, 2025 • edited Loading

Motivation

codecov bot commented Feb 6, 2025 • edited Loading

Codecov Report

rly commented Feb 7, 2025

bendichter commented Feb 7, 2025

bendichter commented Feb 6, 2025 •

edited

Loading

codecov bot commented Feb 6, 2025 •

edited

Loading