1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
Updated
Jul 16, 2024 - Python
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Deep data introspection for the National Solar Radiation Database
This project delves into Swiggy's restaurant data, exploring various aspects and trends using Pandas and exploratory data analysis techniques.
Always know what to expect from your data.
This project aims to show capabilities of Alteryx software on the Kaggle data named "Superstore". The project includes creation of an analytic application, spatial analysis, creating report visualisations, exploratory data analysis, and a unique Alteryx approach to the traveling salesman problem.
To analyze the factors influencing student acceptance at the University of Rochester
Descriptive And Inferential Data Analysis Using Python Projects
This project uses the Reaction Time Survey dataset to develop a linear regression model for accurately predicting student reaction times based on various predictors. Tech: R (RStudio)
Instructional materials (course files) for the BBT4206 course (Business Intelligence II) using R. Topic: Exploratory (and Explanatory) Data Analysis (EDA).
This repository showcases my data science skills, including EDA, Python, data cleaning/wrangling, and visualization. It demonstrates my problem-solving abilities through interactive insights. Explore the notebooks and provide feedback.
A repository of Data Science Principles' demonstration of workflows and how to's in analyzing datasets.
This repository is a collection of code, documentation, and other resources that support the management and automation of a Data Science project.
EXPLORATORY DATA ANALYSIS (Exploratory Data Analysis refers to the critical process of performing initial investigations on data so as to discover patterns to spot anomalies to test hypothesis and to check assumptions with the help of summary statistics and graphical representations.)
Rating severity and Multilingual toxicity classification using advanced transformer models with TPU support for faster training and robust performance.
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
This repository contains a comprehensive analysis of time series data (stock prices), forecasted using various statistical and deep learning models.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
projeto de machine learning para classificação binária de doenças cardíacas.
Data Science Roadmap from Scratch to Professional
We harness the power of machine learning and data analysis to real challenges in the copper industry. Our documentation covers data preprocessing, feature engineering, classification, regression, and model selection. Discover how we've optimized predictive capabilities for manufacturing solutions.
Add a description, image, and links to the exploratory-data-analysis topic page so that developers can more easily learn about it.
To associate your repository with the exploratory-data-analysis topic, visit your repo's landing page and select "manage topics."