pasteur.extras.views.boston.BostonView#

class pasteur.extras.views.boston.BostonView(**_)[source]#

Attributes

dataset

The name of the View's Dataset.

dataset_tables

Returns the dataset tables required by the View.

deps

Defines the Tables of the View and their Dataset dependencies, ex.:

fit_global

name

parameters

If true, transformers and encoders for this view will be fit on the global dataset.

tables

Returns the table names of the view.

trn_deps

Methods

filter_table(name, keys, **tables)

Filters the table using the keys provided.

ingest(name, **tables)

query(name, **tables)

Equivalent to ingest in Dataset.

split_keys(keys, req_splits, splits, ...)

Takes the key frame and splits it into the portions specified by splits.

dataset: str = 'boston'#

The name of the View’s Dataset. If the Dataset is not loaded, the View is disabled.

property dataset_tables#

Returns the dataset tables required by the View.

deps: dict[str, list[str]] = {'table': ['table']}#

Defines the Tables of the View and their Dataset dependencies, ex.:

`python {"table1": ["master_table1", "master_table2"], "table2": ["master_table3"]} `

filter_table(name, keys, **tables)#

Filters the table using the keys provided.

fit_global: bool = False#
ingest(name, **tables)#
name: str = 'boston'#
parameters: dict[str, Any] | str | None = '/home/docs/checkouts/readthedocs.org/user_builds/pasteur/checkouts/latest/src/pasteur/extras/views/boston/parameters.yml'#

If true, transformers and encoders for this view will be fit on the global dataset. Resolves encoding errors that stem from sampling the partial view. When true, subsampling the view is not possible during transformation and encoding, which may add significant overhead.

query(name, **tables)#

Equivalent to ingest in Dataset.

split_keys(keys, req_splits, splits, random_state)#

Takes the key frame and splits it into the portions specified by splits. Then, return the split with names in req_splits.

Should produce the same results each run regardless of the value of split, because it will be ran once per split.

Return type:

dict[str, Union[DataFrame, LazyDataset[DataFrame]]]

property tables#

Returns the table names of the view.

trn_deps: dict[str, list[str]] = {}#