Dec 26, 2024  
2023-2024 Southeastern University - Graduate Catalog 
    
2023-2024 Southeastern University - Graduate Catalog [ARCHIVED CATALOG]

Add to Portfolio (opens a new window)

DATA 7003 - DATA ANALYSIS


This course is the third of four courses that count toward the IBM Data Science Professional Certificate. It offers a deep dive into data manipulation and analysis using Python and SQL. Learners will master creating relational databases on the Cloud, working with tables, and constructing various SQL statements. Advanced SQL techniques like views, transactions, stored procedures, and joins will be discussed for constructing complex queries. The course emphasizes the practical application of Python for data cleaning and preparation, with a focus on handling missing values, formatting, normalizing, and binning data. Learners will engage in exploratory data analysis using key libraries such as Pandas, Numpy, and Scipy, developing the skills to manipulate data using dataframes, summarize data, understand data distribution, and create data pipelines. The course culminates in building and evaluating regression models using the machine learning scikit-learn library, preparing learners for real-world predictive analytics and decision-making.

Prerequisites & Notes
DATA 6003

Credits: 3



Add to Portfolio (opens a new window)