|
Dec 26, 2024
|
|
|
|
DATA 7003 - DATA ANALYSIS This course is the third of four courses that count toward the IBM Data Science Professional Certificate. It offers a deep dive into data manipulation and analysis using Python and SQL. Learners will master creating relational databases on the Cloud, working with tables, and constructing various SQL statements. Advanced SQL techniques like views, transactions, stored procedures, and joins will be discussed for constructing complex queries. The course emphasizes the practical application of Python for data cleaning and preparation, with a focus on handling missing values, formatting, normalizing, and binning data. Learners will engage in exploratory data analysis using key libraries such as Pandas, Numpy, and Scipy, developing the skills to manipulate data using dataframes, summarize data, understand data distribution, and create data pipelines. The course culminates in building and evaluating regression models using the machine learning scikit-learn library, preparing learners for real-world predictive analytics and decision-making.
Prerequisites & Notes DATA 6003
Credits: 3
Add to Portfolio (opens a new window)
|
|