Dec 06, 2025  
Workforce Development & Community Education 
    
Workforce Development & Community Education

Introduction to Data Science and Machine Learning

Location(s): Remote, SUNY Westchester Community College


Return to {$returnto_text} Return to: Areas of Study

Data science is an exciting discipline, which leverages Machine Learning and Artificial Intelligence to enable decision makers to turn raw data into understanding, insight, and actionable options. With the enormous volume and variety of data being created and collected daily, Data Science is one of today’s fastest growing and critically important fields for businesses, organizations, and government. Data Scientists are in demand by both industry and the public sector with robust job growth expected well into the next decade.

Target Audience:

Information Architects, Data Analysts, Statisticians, Developers, Business Intelligence professionals, Business Analysts, Big Data specialists, Coders, Web Developers, learners interested in Predictive Analytics and anyone looking to expand their skills and / or advance their career by learning these valuable and in demand knowledge areas.

This hands-on, project-based course serves as a foundation for building real-world applications with Machine Learning capabilities and as a starting point for a career as a well-rounded data practitioner. In addition to guided projects and practical exercises, students will complete a final capstone project that integrates key concepts from the course. Each student will present their capstone to the class, demonstrating their ability to design, implement, and communicate a machine learning solution to a real-world problem.  Students who successfully complete will receive a Certificate of Completion from SUNY Westchester and a shareable digital badge in recognition of their achievement. 

The course focuses on key foundational concepts related to Data Science and Machine Learning using the Python programming language. Students will learn the foundations of problem solving, statistical algorithms, and machine learning models using Jupyter Notebooks within the Anaconda, Visual Studio Code, and Google Colab programming environments.  Students will also learn the basics of SQL databases as data sources, as well as the Docker containerization platform for model deployment.  

Students will implement experiments to solve various business problems using the Python programming language and machine learning algorithms included in the scikit-learn and Tensorflow libraries.  These experiments are designed to satisfy predictive analytics requirements related to regression, classification, image recognition and natural language processing. 

Objective:

After taking this class, students are expected to:

  • Understand fundamentals of the Python programming language and create scripts that interact with data sets and machine learning models, 
  • Interact with data sets in various formats (including flat files and SQL databases) and create meaningful visualizations based on business requirements, 
  • Understand the basics of Python-based machine learning models and when to select the appropriate algorithms based on business requirements, 
  • Gain proficiency with the Jupyter Notebook, Visual Studio Code and the Google Colab programming environments 
  • Understand the foundations of Deep Learning and implement models using Deep Learning algorithms 
  • Illustrate the data science lifecycle 
  • Collect, clean, prepare and explore data 
  • Navigate the model selection process 
  • Evaluate and refine data models 
  • Deploy data models using techniques including Docker containers

Course Outline:

  • Class Introduction and Course Topics Review 
  • Overview with Data Sets, Python and Jupyter Notebooks 
  • Transform and Visualize Datasets (Including CSV, JSON and SQL queries) 
  • Implement Data Wrangling Techniques 
  • Introduction to Statistical Analysis 
  • Implement Regression, Classification and Clustering Models 
  • Understand Deep Learning Neural Networks 
  • Understand Model Loss, Optimizers and Learning Rates 
  • Understand the NLTK Libraries 
  • Implement Deep Learning Models for Regression, Image Recognition and Natural Language Processing 
  • Recognize Lifecycle Frameworks 
  • Identify Tools and Best Practices 
  • Demonstrate Exploratory Data Analysis 
  • Explore Mathematical Areas Including Linear Algebra and Calculus 
  • Tune Hyperparameters 
  • Prepare Data for Stakeholders 
  • Describe Deployment Methodologies 

Spring Course Offerings


Intro to Machine Learning and Data Science featuring Python, REMOTE
T/Th, Mar. 3-July 7, 6:00-9:00 pm, $2,200. #37767
Call 914-606-5685 for more info.CE-COMP 2239  

How to Register


Register over the phone using MC, Visa or Discover. Call 914-606-6830.

You will need the Class # when speaking with a representative.

Office hours for registration are Monday – Thursday 8:30 a.m. to 7:15 p.m.
Friday 8:30 a.m. to 4:30 p.m. (in summer, 9:00 a.m. – 12:00 noon)
Saturday 9:00 a.m. to 3:30 p.m. (in summer, closed some Saturdays)

For course questions, please contact:

Romina Ganopolsky, Program Administrator
Workforce Development
Call 914-606-5685 or email romina.ganopolsky@sunywcc.edu

Return to {$returnto_text} Return to: Areas of Study