DS1300: A Practical Introduction to Data Science#

Contact Information#

Instructor

Email

Phone

Office

Office Hours

Dr. Robert Kalescky

rkalescky@smu.edu

8-3070

Ford Hall 119

By Appointment

Dr. Eric Godat

egodat@smu.edu

8-4599

Ford Hall 119

By Appointment

Students will be encouraged to use the course Teams channel for communication with instructors and their fellow classmates. This will be especially important when communicating with group members while working on the semester project.

Class information and assignments will be posted in Canvas.

Dates and Times#

May Term, May 11 through 25 10:00 AM - 12:00 PM, 1:00 PM - 3:00 PM

Course Overview#

We live in a world driven by computers, algorithms, and data. This course seeks to equip students with the basic knowledge and skills to not only understand but to use the tools needed to model and make decisions about the world around them. Students will be introduced to basic programming in Python, high-performance computing using ManeFrame II, and data science. This course is not a programming course, though programming will be required to complete assignments. Prior knowledge of programming is neither expected nor required.

Learning Outcomes#

At the end of the course you will be able to:

  • Identify applications of data science in today’s world

  • Identify the privacy and ethical issues associated with the use of data

  • Clearly and concisely communicate data analyses via visualization and meaningful metrics

  • Have had exposure to computational thinking, programming, and high-performance computing

University Requirement Fulfillment#

UC Requirement (matriculation Fall 2016 to Spring 2020)#

This course meets a UC Breadth requirement for Technology and Mathematics

  • 1b Students will demonstrate an ability to analyze complex mathematical problems that arise in a particular discipline or area

  • 1c Students will demonstrate an understanding of how particular technologies work

CC Requirement (for students entering Fall 2020 or later)#

This course fulfills the CC Breadth requirement for Technological Advances and Society (TAS) by introducing students to the technology used in computational and data sciences, how this technology affects society by highlighting its use in different domains, and discussing issues like privacy and ownership of information.

Required Materials#

For this course, you will need a computer that has internet access. All programming will be done on through SMU’s OnDemand Portal so the specifications of your machine are minimal as long as you can connect to an internet browser.

The required reading for this class will be assigned out of:

Suggested readings will come from an assortment of sources including:

Course Structure#

This course will be lecture based with in-class activities designed to demonstrate and reinforce concepts. Given the accelerated nature of short semester courses, a given day will be divided into lecture, coding demonstrations, hands-on programming exercises, and collaborative work sessions. Students will be expected to complete homework assignments after class, though these assignments will follow closely from what is covered in class. Participation in class is expected and will be critical to student success.

This course is not a programming course, though programming will be required to complete assignments. Prior knowledge of programming is neither expected nor required. The instructors will equip students with the skills and tools needed to complete each assignment and be available to provide programming support should need arise.

The course also contains a group project that will span the length of the semester. Students will be divided into teams by the instructors. These groups will work together for the duration of the semester. Group work will be evaluated as a sum of individual efforts and, as such, each member is expected to contribute equally to the success of the project. Issues within teams should be brought to the instructors attention immediately so they can be resolved quickly.

Evaluation#

In-class coding assignments, homework sets, and a semester-long group project will be used to evaluate understanding and proficiency with the material.

Category

Percentage

In-class Participation

20%

Homework Problems

40%

Semester Project

40%

The semester project will culminate in a group presentation to the class during the time designated by the University Registrar for the final exam. This presentation will involve a question and answer session from peer students, attendees, and the instructors. Students will be graded on their presentation skills, content, ability to answer questions, and engagement in other student presentations.

Syllabus#

Each calendar date on the syllabus corresponds to 4 lecture hours, in line with the structure of a May Term semester.

  • Day 1

    • High-performance Computing

    • Using ManeFrame II

    • Using GitHub

  • Day 2

    • Data Science Overview

    • Using Jupyter

    • Python Programming I

  • Day 3

    • Ethics and Bias in Data

    • Python Programming II

    • Working with Data

  • Day 4

    • Exploring and Working with Data

    • Story Telling with Data

  • Day 5

    • Data and Justice and Medicine

    • Project Work Day

  • Day 6

    • Intermediate Project Presentations

    • Using Dask

  • Day 7

    • Data and Cars and Crime

    • Data Storage

    • Building Models and Making Predictions

  • Day 8

    • Visualizing Various Types of Data

    • Using Geopandas

  • Day 9

    • Data and Art

    • Text Mining

    • Customizing Analysis Functions

    • Metric Building

  • Day 10

    • Project Work Day

  • Day 11

    • Project Presentations

Semester-Long Project#

Each student will be assigned a semester-long project that will require the student to apply the methods and tools discussed in the class to a unique dataset. The goal of the project is to reinforce core data science concepts and techniques through application.

University Policies#

  • Disability Accommodations: Students needing academic accommodations for a disability must first register with Disability Accommodations & Success Strategies (DASS). Students can call 214- 768-1470 or visit: http://www.smu.edu/Provost/ALEC/DASS to begin the process. Once registered, students should then schedule an appointment with the professor as early in the semester as possible, present a DASS Accommodation Letter, and make appropriate arrangements. Please note that accommodations are not retroactive and require advance notice to implement.

  • Religious Observance: Religiously observant students wishing to be absent on holidays that require missing class should notify their professors in writing at the beginning of the semester, and should discuss with them, in advance, acceptable ways of making up any work missed because of the absence. (See University Policy No. 1.9.)

  • Excused Absences for University Extracurricular Activities: Students participating in an officially sanctioned, scheduled University extracurricular activity should be given the opportunity to make up class assignments or other graded assignments missed as a result of their participation. It is the responsibility of the student to make arrangements with the instructor prior to any missed scheduled examination or other missed assignment for making up the work. (University Undergraduate Catalogue)

  • Student Learning Outcomes: Please include in your syllabi all student learning outcomes, both those specific to your course, as well as those that satisfy major and general education requirements.

  • Final Exams: Final course examinations shall be given in all courses where they are appropriate, and some form of final assessment is essential. Final exams or final assessments must be administered as specified in the official examination schedule, and shall not be administered during the last week of classes or during the Reading Period. Please state clearly in the syllabus the date/time and form of the final exam or assessment.