Contents Menu Expand
Data Profiler
v0.4.7
Light Logo Dark Logo
Data Profiler
v0.4.7

Getting Started:

  • Intro
  • Install
  • Data Readers
  • Profiler
  • Labeler (Sensitive Data)

User Guide:

  • Examples
    • Overview of Data Profiler
    • Data Reader
    • Profiler
    • Labeler
    • Adding Models to a Labeler Pipeline
  • API
    • Data Readers
      • Avro Data
      • Base Data
      • CSV Data
      • Data
      • Data Utils
      • JSON Data
      • Parquet Data
      • Structured Mixins
      • Text Data
    • Profilers
      • Helpers
        • Report Helpers
      • Base Column Profilers
      • Categorical Column Profile
      • Column Profile Compilers
      • Data Labeler Column Profile
      • Datetime Column Profile
      • Float Column Profile
      • Histogram Utils
      • Int Column Profile
      • Numerical Column Stats
      • Order Column Profile
      • Profile Builder
      • Profiler Options
      • Text Column Profile
      • Unstructured Data Labeler Column Profile
      • Unstructured Labeler Profile
      • Unstructured Text Profile
      • Utils
    • Labelers
      • Base Data Labeler
      • Base Model
      • Character Level Cnn Model
      • Classification Report Utils
      • Data Labelers
      • Data Processing
      • Labeler Utils
      • Regex Model
    • Validators
      • Base Validators

Community:

  • Changelog
  • Feedback
  • GitHub

Examples¶

These examples provide a more in-depth look into the details of the Data Profiler library.

Basics¶

  • Overview of Data Profiler
    • Basic Usage of the Data Profiler
    • Data reader class
    • Profiler options
    • Update profiles
    • Merge profiles
    • Conclusion
  • Data Reader
    • Automatically reading and detecting data
    • Specifying detection options of Data and loading pandas.DataFrame
    • Accessing data and attributes
    • Checking data file types with is_match
    • Reloading data after altering options with reload
    • A deeper dive into CSVData
    • Intro to the CSVData data reader
    • CSVData Options
    • Options: delimiter and quotechar
    • Options: header
    • Options: data_format
    • Options: selected columns
  • Profiler
    • Reporting
    • Profiler options
    • Updating Profiles
    • Merging Profiles
    • Saving and Loading a Profile
  • Labeler
    • What is the Labeler
    • Structured Data Prediction
    • Unstructured Data Prediction
    • Train the Labeler from Scratch
    • Transfer Learning a Labeler
    • Saving and Loading a Labeler
    • Building a Labeler from the Ground Up
  • Adding Models to a Labeler Pipeline
    • Dataset
    • Implement a new character-level LSTM model
    • Integrate the new LSTM model to the DataLabeler
Next
Data Profiler - What’s in your data?
Previous
Labeler (Sensitive Data)
Copyright © 2020, Jeremy Goodsitt, Austin Walters, Anh Truong, Grant Eden, and Chris Wallace | Built with Sphinx and @pradyunsg's Furo theme. | Show Source
Contents
  • Examples
    • Basics