FeatureRequest: Assigning what is metadata
versus feature`
#510
Labels
enhancement
New feature or request
metadata
versus feature`
#510
Feature type
Add new functionality
Change existing functionality
General description of the proposed functionality
When columns are outputted from CellProfiler and processed in CytoTable, we get what can be considered
metadata
(e.g., plate and well) andfeature
, which are currently based on the prefix of the name of the column (e.g., features are prefixed with the compartment ornuclei
,cells
, andcytoplasm
).CellProfiler takes many measurements, but some I have noticed tend to not reflect the morphology of the cells that are included as
features
due to the naming convention that pycytominer expects as feature versus metadata. This is reflected in the functioninfer_cp_features
, which dictates a column as a feature if it starts with the compartment.This has become an issue that I have noticed, as I am finding center x,y coordinates or parent/child indexes being included in my feature space. I have had to manually add a "Metadata_" prefix to these columns to avoid this issue, but I am finding that I am not catching every instance of these "non-morphological features".
There is a discussion to be had if these features should be considered metadata or not. Personally, I think that we should have a simple if statement when running
annotate, and have a parameter for if
TRUEthen it will find columns with
Location,
Parentand
Childin the name, then it should be defined as
metadata`.Feature example
I am not a fan of the parameter name here, but it gives the general idea.
In the annotate code, there could be something like:
Alternative Solutions
This issue is very complicated and could require a more in-depth solution that will be more robust.
This issue is open for all to discuss and come up with a solution that works for all.
CC: @MikeLippincott, @MattsonCam, @axiomcura, @d33bs, @gwaybio
The text was updated successfully, but these errors were encountered: