Toolbox for analysis of capillary electrophoresis foot printing data
BoXFP is a platform independent software package designed for quick, efficient, and reliable analysis of capillary electrophoresis data of footprinted nucleic acid samples.
BoXFP has been developed as an add-on package to the QuShape software packages developed by Dr Fethullah Karabiber of the Weeks Laboratory (https://weekslab.com/software/). It utilises several algorithms to effectively provide nucleotide specific reactivity information from both single experiments and over several replicates. Capillary electrophoresis data from SHAPE, Hydroxyl-radical and enzyme-based reactivity probing experiments can all be analysed using the package. Unlike previous reactivity profile calculation software, BoXFP utilises capillary electrophoresis dye size standards to effectively determine nucleotide positions in the reactivity profiles.
It should be noted that BoXFP has been designed specifically around viral RNA X-ray foot printing, and as such a lot of the features regarding data labelling in the current release relate to viruses. Future releases will be designed more generally around reactivity profile generation using capillary electrophoresis.
BoXFP
package requires only a standard computer with enough RAM to support the in-memory operations.
This package is supported for macOS and Linux. The package has been tested on the following systems:
- macOS: Mojave (10.14.1)
- Linux: Ubuntu 16.04
BoXFP
runs on python 2.7 and requires the following packages:
NumPy
: 1.16.3Pandas
: 0.24.2Matplotlib
: 2.2.3Pickle
: 72223SciPy
: 1.2.1Bio
: 1.73pyqtgraph
: 0.10.0
BoXFP can be installed from source from the GitHub Repo
_.
You can either clone the public repository:
.. code-block:: console
$ git clone git://github.com/PsamClark/BoXFP
Or download the tarball
_:
.. code-block:: console
$ curl -OL https://github.com/PSamClark/BoXFP
Given the requirements outlined above are met installation time will take less than 30 seconds
Reactivity profile determination using BoXFP can be broken down into 3 steps:
- Preprocessing
- Position determination
- Reactivity profile calculation
The preprocessing step involves unpacking of the chromatograph data, signal quality determination, and data tidying processes such as smoothing and baseline and decay correction.
The position determination step involves analysis of sequencing ladders in the chromatograph to determine the exact position in the sample sequence of the reactivity profiles.
The reactivity profile calculation step uses replicate datasets for the X-ray footprinted samples and background samples, to generate an average, background corrected reactivity profile with errors. Information on normalisation factors and replicate correlations is also generated.
To reduce the errors created by improper handling of the chromatograph data during preprocessing, BoXFP utilises an innovative windowing method, in which preprocessing and reactivity calculation is performed on several different overlapping regions of the chromatographs, with the resulting reactivity profiles being compared with each other and a consensus generated.
BoXFP is designed to work with 1 to 3 replicates in both the X-ray footprinted and background samples.
General rules of usage can be found in the [general rules] (https://github.com/PsamClark/BoXFP/tree/python3_update/Tutorial/General_rules.md) section of the tutorial.
For greater user input options, and for processing multiple datasets at once, the BoXFP
module can be imported.
For more information about scripting using the imported BoXFP module see the Tutorial.
-
The first point of contact for any issues: https://github.com/PsamClark/BoXFP/issues
-
Further information can be obtained from the following email address: [email protected]