pasteur.extras.views.texas.TexasBillionView#
- class pasteur.extras.views.texas.TexasBillionView(**_)[source]#
Attributes
The name of the View's Dataset.
Returns the dataset tables required by the View.
Defines the Tables of the View and their Dataset dependencies, ex.:
If true, transformers and encoders for this view will be fit on the global dataset.
Returns the table names of the view.
Methods
filter_table(name, keys, **tables)Filters the table using the keys provided.
ingest(name, charges)query(name, **tables)Equivalent to ingest in Dataset.
split_keys(keys, req_splits, splits, ...)Takes the key frame and splits it into the portions specified by splits.
- dataset: str = 'texas'#
The name of the View’s Dataset. If the Dataset is not loaded, the View is disabled.
- property dataset_tables#
Returns the dataset tables required by the View.
- deps: dict[str, list[str]] = {'table': ['charges']}#
Defines the Tables of the View and their Dataset dependencies, ex.:
`python {"table1": ["master_table1", "master_table2"], "table2": ["master_table3"]} `
- filter_table(name, keys, **tables)#
Filters the table using the keys provided.
- fit_global: bool = False#
- name: str = 'texas_billion'#
- parameters: dict[str, Any] | str | None = '/home/docs/checkouts/readthedocs.org/user_builds/pasteur/checkouts/latest/src/pasteur/extras/views/texas/./parameters_billion.yml'#
If true, transformers and encoders for this view will be fit on the global dataset. Resolves encoding errors that stem from sampling the partial view. When true, subsampling the view is not possible during transformation and encoding, which may add significant overhead.
- query(name, **tables)#
Equivalent to ingest in Dataset.
- split_keys(keys, req_splits, splits, random_state)#
Takes the key frame and splits it into the portions specified by splits. Then, return the split with names in req_splits.
Should produce the same results each run regardless of the value of split, because it will be ran once per split.
- Return type:
dict[str,Union[DataFrame,LazyDataset[DataFrame]]]
- property tables#
Returns the table names of the view.
- tabular = True#
- trn_deps: dict[str, list[str]] = {}#