Skip to content
@capitalone

Capital One

We’re an open source-first organization — actively using, contributing to and managing open source software projects.

Pinned Loading

  1. DataProfiler DataProfiler Public

    What's in your data? Extract schema, statistics and entities from datasets

    Python 1.4k 157

  2. datacompy datacompy Public

    Pandas, Polars, and Spark DataFrame comparison for humans and more!

    Python 430 124

  3. locopy locopy Public

    locopy: Loading/Unloading to Redshift and Snowflake using Python.

    Python 102 46

  4. rubicon-ml rubicon-ml Public

    Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

    Jupyter Notebook 125 30

  5. dataCompareR dataCompareR Public

    dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.

    R 75 24

  6. global-attribution-mapping global-attribution-mapping Public

    GAM (Global Attribution Mapping) explains the landscape of neural network predictions across subpopulations

    Python 32 23

Repositories

Showing 10 of 45 repositories
  • rubicon-ml Public

    Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

    capitalone/rubicon-ml’s past year of commit activity
    Jupyter Notebook 125 Apache-2.0 30 10 1 Updated Jul 16, 2024
  • acronym-decoder Public

    Acronym Decoder

    capitalone/acronym-decoder’s past year of commit activity
    TypeScript 43 Apache-2.0 26 0 5 Updated Jul 13, 2024
  • locopy Public

    locopy: Loading/Unloading to Redshift and Snowflake using Python.

    capitalone/locopy’s past year of commit activity
    Python 102 Apache-2.0 46 6 (1 issue needs help) 1 Updated Jul 12, 2024
  • datacompy Public

    Pandas, Polars, and Spark DataFrame comparison for humans and more!

    capitalone/datacompy’s past year of commit activity
    Python 430 Apache-2.0 124 11 (3 issues need help) 2 Updated Jul 11, 2024
  • federated-model-aggregation Public

    The Federated Model Aggregation (FMA) Service is a collection of installable python components that make up the generic workflow/infrastructure needed for federated learning.

    capitalone/federated-model-aggregation’s past year of commit activity
    Python 28 Apache-2.0 11 16 (1 issue needs help) 0 Updated Jul 8, 2024
  • DataProfiler Public

    What's in your data? Extract schema, statistics and entities from datasets

    capitalone/DataProfiler’s past year of commit activity
    Python 1,389 Apache-2.0 157 66 (8 issues need help) 5 Updated Jul 6, 2024
  • dataCompareR Public

    dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.

    capitalone/dataCompareR’s past year of commit activity
  • ablation Public

    Evaluating XAI methods through ablation studies.

    capitalone/ablation’s past year of commit activity
    Python 15 Apache-2.0 5 2 0 Updated Jul 2, 2024
  • global-attribution-mapping Public

    GAM (Global Attribution Mapping) explains the landscape of neural network predictions across subpopulations

    capitalone/global-attribution-mapping’s past year of commit activity
    Python 32 Apache-2.0 23 8 0 Updated Jun 17, 2024
  • synthetic-data Public

    Generating complex, nonlinear datasets appropriate for use with deep learning/black box models which 'need' nonlinearity


    capitalone/synthetic-data’s past year of commit activity
    Python 42 Apache-2.0 27 3 1 Updated Jun 12, 2024