Skip to content
#

data-cleaning

Here are 2,944 public repositories matching this topic...

This assignment contain information on the contributions to the campaigns of the US politicians at the state and the federal level. The contribution data has been collected from various sources and covers the 1989-2017 period

  • Updated Jul 16, 2024
  • Jupyter Notebook

This repository contains the code, documentation, and datasets for a comprehensive exploration of machine learning techniques to address class imbalance. The project investigates the impact of various methods, like ADASYN, KMeansSMOTE, and Deep Learning Generator, on classification performance while effectively demonstrating benefits of pipelining.

  • Updated Jul 15, 2024
  • Jupyter Notebook
desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

  • Updated Jul 16, 2024
  • C++

The project utilizes python and its various libraries like pandas, matplotlib and seaborn to evaluate credit card data that influence customer spending pattern and repayment behavior. The aim is to enhance the effectiveness of revenue generation processes and provide insightful business suggestions for improvement.

  • Updated Jul 15, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-cleaning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-cleaning topic, visit your repo's landing page and select "manage topics."

Learn more