AutoNormalize is a Python library for automated datatable normalization. It allows you to build an EntitySet
from a single denormalized table and generate features for machine learning using Featuretools.
shell
pip install featuretools[autonormalize]
shell
pip uninstall autonormalize
auto_entityset
shell
auto_entityset(df, accuracy=0.98, index=None, name=None, time_index=None)
Creates a normalized entityset from a dataframe.
Arguments:
df
(pd.Dataframe) : the dataframe containing data
accuracy
(0 < float <= 1.00; default = 0.98) : the accuracy threshold required in order to conclude a dependency (i.e. with accuracy = 0.98, 0.98 of the rows must hold true the dependency LHS --> RHS)
index
(str, optional) : name of column that is intended index of df
name
(str, optional) : the name of created EntitySet
time_index
(str, optional) : name of time column in the dataframe.
Returns:
entityset
(ft.EntitySet) : created entity setfind_dependencies
shell
find_dependencies(df, accuracy=0.98, index=None)
Finds dependencies within dataframe with the DFD search algorithm.
Returns:
dependencies
(Dependencies) : the dependencies found in the data within the contraints providednormalize_dataframe
shell
normalize_dataframe(df, dependencies)
Normalizes dataframe based on the dependencies given. Keys for the newly created DataFrames can only be columns that are strings, ints, or categories. Keys are chosen according to the priority:
Returns:
new_dfs
(list[pd.DataFrame]) : list of new dataframesmake_entityset
shell
make_entityset(df, dependencies, name=None, time_index=None)
Creates a normalized EntitySet from dataframe based on the dependencies given. Keys are chosen in the same fashion as for normalize_dataframe
and a new index will be created if any key has more than a single attribute.
Returns:
entityset
(ft.EntitySet) : created EntitySetnormalize_entityset
shell
normalize_entityset(es, accuracy=0.98)
Returns a new normalized EntitySet
from an EntitySet
with a single entity.
Arguments:
es
(ft.EntitySet) : EntitySet with a single entity to normalizeReturns:
new_es
(ft.EntitySet) : new normalized EntitySetBumps ipython from 7.16.3 to 8.10.0.
Sourced from ipython's releases.
See https://pypi.org/project/ipython/
We do not use GitHub release anymore. Please see PyPI https://pypi.org/project/ipython/
15ea1ed
release 8.10.0560ad10
DOC: Update what's new for 8.10 (#13939)7557ade
DOC: Update what's new for 8.10385d693
Merge pull request from GHSA-29gw-9793-fvw7e548ee2
Swallow potential exceptions from showtraceback() (#13934)0694b08
MAINT: mock slowest test. (#13885)8655912
MAINT: mock slowest test.a011765
Isolate the attack tests with setUp and tearDown methodsc7a9470
Add some regression tests for this changefd34cf5
Swallow potential exceptions from showtraceback()Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase
.
Bumps nbconvert from 6.4.5 to 6.5.1.
Sourced from nbconvert's releases.
Release 6.5.1
No release notes provided.
6.5.0
What's Changed
- Drop dependency on testpath. by
@anntzer
in jupyter/nbconvert#1723- Adopt pre-commit by
@blink1073
in jupyter/nbconvert#1744- Add pytest settings and handle warnings by
@blink1073
in jupyter/nbconvert#1745- Apply Autoformatters by
@blink1073
in jupyter/nbconvert#1746- Add git-blame-ignore-revs by
@blink1073
in jupyter/nbconvert#1748- Update flake8 config by
@blink1073
in jupyter/nbconvert#1749- support bleach 5, add packaging and tinycss2 dependencies by
@bollwyvl
in jupyter/nbconvert#1755- [pre-commit.ci] pre-commit autoupdate by
@pre-commit-ci
in jupyter/nbconvert#1752- update cli example by
@leahecole
in jupyter/nbconvert#1753- Clean up pre-commit by
@blink1073
in jupyter/nbconvert#1757- Clean up workflows by
@blink1073
in jupyter/nbconvert#1750New Contributors
@pre-commit-ci
made their first contribution in jupyter/nbconvert#1752Full Changelog: https://github.com/jupyter/nbconvert/compare/6.4.5...6.5
7471b75
Release 6.5.1c1943e0
Fix pre-commit8685e93
Fix tests0abf290
Run black and prettier418d545
Run test on 6.x branchbef65d7
Convert input to string prior to escape HTML0818628
Check input type before escapingb206470
GHSL-2021-1017, GHSL-2021-1020, GHSL-2021-1021a03cbb8
GHSL-2021-1026, GHSL-2021-102548fe71e
GHSL-2021-1024Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase
.
AutoNormalize should be available for download via conda-forge.
Documentation on how to contribute a package: https://conda-forge.org/docs/maintainer/adding_pkgs.html Example PR of contributing a package: https://github.com/conda-forge/staged-recipes/pull/16033
If the input dataframe has been initialized with logical types, does autonormalize lose this typing information on the output dataframes?
release.md
file that follows our other librariesautonormalize
and some parts will not* Changes
* Remove python-dateutil dependency requirement (#48)
* Testing Changes
* Add ``test_version.py`` and release notes updated CI check (#49)
Thanks to the following people for contributing to this release:
@rwedge, @thehomebrewnerd
normalize_entity
to normalize_entityset
(#41)Thanks to the following people for contributing to this release: @dvreed77
Thanks to the following people for contributing to this release: @gsheni, @tuethan1999
automatic normalization automatic-normalization