Skip to content
View CyrilJl's full-sized avatar

Block or report CyrilJl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
CyrilJl/README.md

Banner

Welcome to my GitHub! 👋

My name is Cyril, and I am an applied mathematics engineer, interested in machine learning, data assimilation, time series, and cartographic data more generally! You will find on my GitHub some Python package repositories available on PyPI and conda-forge:

Machine Learning
  • optimask: For managing missing data in arrays. Allows maximizing the number of valid data before learning a model.
  • timefiller: For imputing missing data in a block of correlated time series, or forecasting with covariates containing missing data. An easy-to-implement and efficient package, based on optimask.
  • apyxl: A simple wrapper around xgboost, shap, and hyperopt to produce explainable non-linear regressions in one line of code. apyxl is not intended for production but rather as an aid to understanding or a first approach to a dataset.
Large Data
  • batchstats: The extension of numpy for calculating statistics of large data larger than available memory or for data arriving in batches.
Weather Data ☁️

I aim to produce well-written, documented, and easy-to-use open-source packages. Do not hesitate to open an issue if you encounter a bug or difficulty. 🙂

Pinned Loading

  1. TimeFiller TimeFiller Public

    A Python package for imputing missing data in time series, compatible with scikit-learn estimators

    Python

  2. MeteoFetch MeteoFetch Public

    Récupérez facilement les dernières prévisions Arome et Arpege de MétéoFrance

    Python 2

  3. OptiMask OptiMask Public

    Efficiently finding the largest (non-necessarily contiguous) sub-matrix without NaN in Python

    Jupyter Notebook 1 1

  4. BatchStats BatchStats Public

    Python package for efficient, online statistical computations on streaming or large-scale data

    Python 1

  5. isd-fetch isd-fetch Public

    Python Package for loading NOAA's ISD Lite Dataset

    Python

  6. apyxl apyxl Public

    apyxl simplifies non-linear regressions/classifications and model explainability for all users

    Jupyter Notebook