IntroductionΒΆ

PipelineX is a Python package designed to make Machine Learning projects efficient with modular, reusable, and easy-to-use features for experimentation.

Please refer here to find out how PipelineX differs from other pipeline/workflow packages: Airflow, Luigi, Gokart, Metaflow, and Kedro.

PipelineX provides the following options which can be used separately or together.

  • HatchDict option which provides enhancements for YAML/JSON useful for parameter management summarized as follows.

    • Import-less Python object: Include (nested) Python classes and functions in a YAML/JSON file

    • Anchor-less aliasing: Look up another key in the same YAML/JSON file

    • Python expression in YAML/JSON files

  • Kedro context to define Kedro pipelines in a YAML file with more options

  • Integration of Kedro with MLflow as Kedro DataSets and Hooks. Note: You do not need to install MLflow if you do not use.

  • Integration of Kedro with the additional Python packages as Kedro DataSets, Hooks, and wrappers.