Data Science is unfortunately unavailable

Thankfully we have 1 other Data Science Class for you to choose from. Check our top choice below or see all classes for more options.

Course Details
Start Date:

This class isn't on the schedule at the moment, but save it to your Wish List to find out when it comes back!
If you're enrolled in an upcoming date, this simply means that date has now sold out.

Skips Nov 24
Purchase Options
Save to WishList

2 people saved this class

Book Private Class
Class Level: All levels
Age Requirements: 18 and older
Average Class Size: 20

Flexible Reschedule Policy: This provider has flexible, free rescheduling for any-in person workshop. Please see the cancellation policy for more details

What you'll learn in this data science course:

This is a part time course.  

In this 11 week course, students learn to build robust predictive models, test their validity, and clearly communicate resulting insights.

Unit 1: Research Design and Exploratory Data Analysis

What is Data Science 
  • Describe course syllabus and establish the classroom environment 
  • Answer the questions: "What is Data Science? What roles exist in Data Science?" 
  • Define the workflow, tools and approaches data scientists use to analyze data
Research Design and Pandas 
  • Define a problem and identify appropriate data sets using the data science workflow 
  • Walkthrough the data science workflow using a case study in the Pandas library 
  • Import, format and clean data using the Pandas Library
Statistics Fundamental I 
  • Use NumPy and Pandas libraries to analyze datasets using basic summary statistics: mean, median, mode, max, min, quartile, inter-quartile, range, variance, standard deviation and correlation 
  • Create data visualization – scatter plots, scatter matrix, line graph, box blots, and histograms – to discern characteristics and trends in a dataset 
  • Identify a normal distribution within a dataset using summary statistics and visualization
Statistics Fundamental II 
  • Explain the difference between causation vs. correlation 
  • Test a hypothesis within a sample case study 
  • Validate your findings using statistical analysis (p-values, confidence intervals)
Instructor Choice 
  • Focus on a topic selected by the instructor/class in order to provide deeper insight into exploratory data analysis
Unit 2: Foundations of Data Modeling

Introduction to Regression 

  • Define data modeling and linear regression 
  • Differentiate between categorical and continuous variables 
  • Build a linear regression model using a dataset that meets the linearity assumption using the scikit-learn library
Evaluating Model Fit 
  • Define regularization, bias, and errors metrics; 
  • Evaluate model fit by using loss functions including mean absolute error, mean squared error, root mean squared error 
  • Select regression methods based on fit and complexity
Introduction to Classification 
  • Define a classification model 
  • Build a K–Nearest Neighbors using the scikit–learn library 
  • Evaluate and tune model by using metrics such as classification accuracy ⁄ error
Introduction to Logistic Regression 
  • Build a Logistic regression classification model using the scikit learn library 
  • Describe the sigmoid function, odds, and odds ratios and how they relate to logistic regression 
  • Evaluate a model using metrics such as classification accuracy ⁄ error, confusion matrix, ROC ⁄ AOC curves, and loss functions
Communicate Results from Logistic Regression 
  • Explain the tradeoff between the precision and recall of a model and articulate the cost of false positives vs. false negatives. 
  • Identify the components of a concise, convincing report and how they relate to specific audiences ⁄ stakeholders 
  • Describe the difference between visualization for presentations vs. exploratory data analysis
Flexible Class Session 
  • Focus on a topic selected by the instructor ⁄ class in order to provide deeper insight into data modeling
Unit 3: Data Science in the Real World

Decision Trees and Random Forest 
  • Describe the difference between classification and regression trees and how to interpret these models 
  • Explain and communicate the tradeoffs of decision trees vs regression models 
  • Build decision trees and random forests using the scikit-learn library
Natural Language Processing 
  • Demonstrate how to tokenize natural language text using NLTK 
  • Categorize and tag unstructured text data 
  • Explain how to build a text classification model using NLTK
Dimensionality Reduction 
  • Explain how to perform a dimensional reduction using topic models 
  • Demonstrate how to refine data using latent dirichlet allocation (LDA) 
  • Extract information from a sample text dataset
Working with Time Series Data 
  • Explain why time series data is different than other data and how to account for it 
  • Create rolling means and plot time series data using the Pandas library 
  • Perform autocorrelation on time series data
Creating Models with Time Series Data 
  • Decompose time series data into trend and residual components 
  • Validate and cross-validate data from different data sets 
  • Use the ARIMA model to forecast and detect trends in time series data
The Value of Databases 
  • Describe the use cases for different types of databases 
  • Explain differences between relational databases and document-based databases 
  • Write simple select queries to pull data from a database and use within Pandas
Moving Forward with your Data Science Career 
  • Specify common models used within different industries 
  • Identify the use cases for common models 
  • Discuss next steps and additional resources for data science learning
Flexible Class Session 
  • Focus on a topic selected by the instructor⁄class in order to provide deeper insight into data science in the real world
Final Presentations 
  • Present final presentation to peers, instructor, and guest panelists who will identify strengths and areas for improvement

Remote Learning

This course is available for "remote" learning and will be available to anyone with access to an internet device with a microphone (this includes most models of computers, tablets). Classes will take place with a "Live" instructor at the date/times listed below.

Upon registration, the instructor will send along additional information about how to log-on and participate in the class.

School Notes:
For students enrolling in 12 week part time and immersive classes, it is not recommended that you book more than one class simultaneously.

Still have questions? Ask the community.

Refund Policy
If you can't make it to a class/workshop, please email us at [email protected] at least 7 days before the scheduled event date. No refunds will be given after this timeframe.
Start Dates (0)

This class isn't on the schedule at the moment, but save it to your Wish List to find out when it comes back!

Similar Classes

Benefits of Booking Through CourseHorse

Booking is safe. When you book with us your details are protected by a secure connection.
Lowest price guaranteed. Classes on CourseHorse are never marked up.
This class will earn you 39500 points. Points give you money off your next class!
Questions about this class?
Get help now from a knowledge expert!
Questions & Answers (0)

Get quick answers from CourseHorse and past students.

Reviews of Classes at General Assembly (2,625)

School: General Assembly

General Assembly

General Assembly (GA) equips individuals with the in-demand skills needed to build a career in today’s high-growth tech sectors. Their award-winning technical training includes flexible delivery, industry-tested curriculum, and a career services program that produces a 99.2% job placement rate for...

Read more about General Assembly

CourseHorse Approved

This school has been carefully vetted by CourseHorse and is a verified DEN educator.

Want to take this class?

Save to Wish List
Booking this class for a group? Find great private group events here

1 Top Choice

Data Science Immersive

This class is temporarily being offered remotely.

at General Assembly - Online Remote Online , Denver, Colorado 00000

This is a full time course. A Well-Rounded Technical Foundation Get hands-on training with the essentials of data science: data mining, statistical modeling, machine learning, and the Python programming language. Apply advanced techniques such as recommender systems, neural networks, and computer vision models to power business forecasts and drive...

Monday Oct 11th, 7am - 3pm Mountain Time

  (60 sessions)

60 sessions