pasteur#
Description
Pasteur is a library for performing end-to-end data synthesis. Gather your raw data and preprocess, synthesize, and evaluate it within a single project. Use the tools you’re familiar with (numpy, pandas, scikit-learn, scipy) and when your dataset grows, scale to out-of-core data by using Pasteur’s parallelization primitives without code changes or using different libraries.
Functions
|
Allows loading ipython functionality with load_ext pasteur |
Module-System Modules
Contains the module definitions in Pasteur, the base classes all Pasteur modules extend from. |
|
This module holds the definitions for the Dataset module, the initial entrypoint for data in Pasteur. |
|
This module holds the definitions for the View module, which appropriately preprocesses Datasets in Pasteur. |
|
Contains the definition for Transformer and ReferenceTransformer modules. |
|
Provides the base definition for Encoder modules |
|
Contains the base definition for Synth(esizer modules). |
|
This module provides the definitions for Metric Modules. |
Transformation-Related Modules
This module implements the base abstractions of Attribute and Value, which are used to encapsulate the information of complex types. |
|
Highly experimental and unpublished class for rebalancing Stratified Values with Differential Privacy. |
|
Contains the logic for handling multiple tables, and holding transformers and encoders. |
Other Modules
This module implements the base abstractions of Attribute and Value, which are used to encapsulate the information of complex types. |
|
Provides the cli entrypoint for kedro. |
|
This module holds the definitions for the Dataset module, the initial entrypoint for data in Pasteur. |
|
Provides the base definition for Encoder modules |
|
This package contains reference implementations for Pasteur modules, which may be extracted to a separate package in the future. |
|
Highly experimental and unpublished class for rebalancing Stratified Values with Differential Privacy. |
|
This module contains all kedro-related logic. |
|
This module provides a system for marginal calculation named MarginalOracle. |
|
This module contains a base class Metadata which is used to wrap, type, and check all View parameters provided to kedro. |
|
This module provides the definitions for Metric Modules. |
|
Contains the module definitions in Pasteur, the base classes all Pasteur modules extend from. |
|
Contains the base definition for Synth(esizer modules). |
|
Contains the logic for handling multiple tables, and holding transformers and encoders. |
|
Contains the definition for Transformer and ReferenceTransformer modules. |
|
Base utility module for Pasteur. |
|
This module holds the definitions for the View module, which appropriately preprocesses Datasets in Pasteur. |
Miscellaneous Modules
This module contains all kedro-related logic. |
|
Base utility module for Pasteur. |
|
This package contains reference implementations for Pasteur modules, which may be extracted to a separate package in the future. |
|
Provides the cli entrypoint for kedro. |