Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 8 additions & 1 deletion data/docs/lhcb-about/lhcb-about.md
Original file line number Diff line number Diff line change
@@ -1 +1,8 @@
The Large Hadron Collider (LHC) is not only a tool with which to search for exotic new particles, but also a factory of particles whose existence is in no doubt but whose precise properties are not yet known well enough. LHCb, unlike the other experiments ATLAS and CMS, which search for new particles produced directly in the proton–proton collisions, is devoted to the study of well established particles, the beauty (B) and charmed (C) mesons. These particles, which comprise a bottom or charm quark or antiquark plus one other quark with a different “flavour”, only exist for about a trillionth of a second before decaying into lighter particles. But subtle quantum effects called loops, in which virtual particles are temporarily created by borrowing energy from the vacuum, can influence the behaviour of B and C mesons and give researchers a handle on particles that are too heavy to be produced directly.
The Large Hadron Collider beauty (LHCb) experiment is one of the four big experiments at the Large Hadron Collider (LHC). In contrast to its siblings, LHCb is an asymmetric detector that covers events at forward rapidities ($2<\eta<5$). The spirit behind this particular design is to exploit the large production of heavy-flavour quarks in the forward rapidity region.

The **main goal** of LHCb is to study sources of CP violation, via rare decays of *B* (beauty) and *D* mesons, in order to explain the current asymmetry of matter-antimatter in the universe.

The [LHCb Open Data Guide](https://lhcb-opendata-guide.web.cern.ch/) contains information about:

- Analysis of datasets of Run 1 that are found in the Open Data Portal.
- Usage of the LHCb Ntupling Service.
36 changes: 12 additions & 24 deletions data/docs/lhcb-getting-started/lhcb-getting-started.md
Original file line number Diff line number Diff line change
@@ -1,27 +1,15 @@
In 2022, LHCb has released the first 200 terabytes of the data via CERN OpenData Portal, making it available to the public.
By the end of 2023, LHCb released all of its Run I data, via CERN Open Data Portal, to the general public. The data comes in `.DST` and `.MDST` format which is the same format used by LHCb internally.

- The data comes in `.DST` and `.MDST` format, the same format is used by LHCb internally.
- Every data set released is narrated by an "Open Data Record" accessible through Open Data Portal.
- Open Data Records contain various bits of information about the selected data set (this is called metadata). An example of the types of metadata provided in the record is:
- Number of events in the dataset
- Number of files in the dataset
- Combined size in TB of the dataset
- Production ID
- Production Type
- Detector conditions (condb, dddb tags)
- List of Trigger Configuration Keys (TCKs)
- Scripts used for each production step
- List of Logical File Names (LFNs) on [LHCb DIRAC](https://lhcb-dirac.readthedocs.io/en/latest/).

The metadata provided should help the user to navigate, select and work with with LHCb Open Data.

Index of files is accessible both via a GUI or as a machine readable file.

- Some instructions on how to use open data are pointed out in the records themselves.
- As well as the data records, an extensive list of LHCb stripping lines and their descriptions is provided as well.
- After selecting the desired stream, a stripping line description can be followed to obtain a number of cuts/conditions which could be used to filter the data further.
- Data can be accessed directly (eg. using [xrootd](https://xrootd.slac.stanford.edu/) protocol) or downloaded locally.
- It is suggested to further filter and categorize the data by writing out smaller data files in `.root` format (called ntuples).
- This is done in LHCb with the help of software called [DaVinci](https://lhcbdoc.web.cern.ch/lhcbdoc/davinci/).
- 'DaVinci' and other LHCb Software is available through [CVMFS](https://cernvm.cern.ch/fs/).
- Some initial instructions on working with DaVinci are provided in [LHCb Starterkit](https://lhcb.github.io/starterkit-lessons/first-analysis-steps/minimal-dv-job.html) web page.
- Number of events in the dataset
- Number of files in the dataset
- Combined size in TB of the dataset
- Production ID
- Production Type
- Detector conditions (condb, dddb tags)
- List of Trigger Configuration Keys (TCKs)
- Scripts used for each production step
- List of Logical File Names (LFNs) on [LHCb DIRAC](https://lhcb-dirac.readthedocs.io/en/latest/)

The [LHCb Open Data Guide](https://lhcb-opendata-guide.web.cern.ch/) and the metadata provided on the Open Data Portal should help the user to navigate, select, and work with LHCb Open Data.