Kyohaku High > General :: General Talk :: > What is data science?

What is data science? - Posted By ishan (ishan09) on 15th Jan 25 at 2:21pm
Data science is a multidisciplinary field that uses a combination of statistics, computer science, and domain-specific knowledge to extract meaningful insights and knowledge from data. The main goal of data science is to transform raw data into actionable insights, predictions, and decisions. Here's a breakdown of what data science involves:

1. Data Collection & Acquisition
What it involves: Gathering data from various sources like databases, web scraping, IoT sensors, APIs, surveys, or existing datasets.
Purpose: To have relevant and sufficient data to work with, which is essential for building models and making decisions.
2. Data Cleaning & Preprocessing
What it involves: Preparing the data for analysis by handling missing values, removing outliers, correcting errors, and transforming data into a usable format.
Purpose: Raw data is often noisy and incomplete, so cleaning and preprocessing ensures better analysis and model performance.
3. Exploratory Data Analysis (EDA)
What it involves: Using statistical techniques and visualizations (like histograms, scatter plots, and box plots) to understand the distribution, patterns, and relationships within the data.
Purpose: To uncover underlying patterns, trends, and anomalies that can inform further analysis or modeling.


data science classes in pune