Vous êtes sur la page 1sur 3

Universidad Francisco Marroqun

Facultad de Ciencias Econmicas


January-May 2016

Course: Data Science 1


Time: Tuesday, Thursday and Friday, 7:00AM-8:20AM
Classroom: A-305 on Tuesdays and Thursdays; A-304 on Fridays
Professors: Lucas Rentschler
2413-3269 (office)
lrentschler@ufm.edu

And

Roger Alexander Sandoval Romn


4000-8013 (cell phone)
rogersandoval@ufm.edu

Course Materials:
Each student has their own preferred learning style. With this in mind we provide several
ways to study the material. These are not required, and you can use other media if you
prefer. All of the options below are free.

Textbook:
Peter Dalgaard, Introductory Statistics with R, 2nd Edition, ISBN: 978-0-387-79053-4
This book has been uploaded to MiU in PDF format.

Online courses:
EdX: Explore Statistics with R
Coursera: Statistics 1

Short videos:
Introduction to R
Basic statistics

All programming in this class will be done in R. R is a free software environment for
statistical computing and graphics. This description comes from www.r-project.org,
where you can download R.

Course Description:

This class has three objectives:


1. To learn basic probability theory:
a. Discrete and continuous probability distributions
b. Combinatorics
c. Random sampling
d. Summary statistics
i. Expected value
ii. Median
iii. Mode
iv. Standard deviation
v. Correlation
2. To learn the basics of programming:
a. Learn the syntax and basics of R
b. Data cleaning and management in R
c. Generating graphics to effectively summarize data in R
3. To learn these concepts in a Project Based Learning (PBL) environment. An
excellent definition of PBL, from bie.org:
a. Project Based Learning is a teaching method in which students gain
knowledge and skills by working for an extended period of time to
investigate and respond to an engaging and complex question, problem, or
challenge. In Gold Standard PBL, Essential Project Design Elements
include:
i. Key Knowledge, Understanding, and Success Skills - The project
is focused on student learning goals, including standards-based
content and skills such as critical thinking/problem solving,
collaboration, and self-management.
ii. Challenging Problem or Question - The project is framed by a
meaningful problem to solve or a question to answer, at the
appropriate level of challenge.
iii. Sustained Inquiry - Students engage in a rigorous, extended
process of asking questions, finding resources, and applying
information.
iv. Authenticity - The project features real-world context, tasks and
tools, quality standards, or impact or speaks to students personal
concerns, interests, and issues in their lives.
v. Student Voice & Choice - Students make some decisions about the
project, including how they work and what they create.
vi. Reflection - Students and teachers reflect on learning, the
effectiveness of their inquiry and project activities, the quality of
student work, obstacles and how to overcome them.
vii. Critique & Revision - Students give, receive, and use feedback to
improve their process and products.
viii. Public Product - Students make their project work public by
explaining, displaying and/or presenting it to people beyond the
classroom.

Course Structure:
There will be three projects in this class. For each project we will randomly match you
with a group of your classmates (note that this random matching takes place for every
project, your group will likely change between projects).

For each of these projects, your group will seek to answer a question using data analysis
performed in R. This may require that you acquire and clean data. Your group will
communicate your conclusions, with statistical support, via a medium of your choice.
We intentionally leave the requirements of the project somewhat vague so that you and
your group can decide how you would like to proceed.

Classroom Dynamic:

In a given class you will work with your group. You may choose the structure of your
group and how you want to organize your group. You can opt to choose a project leader
who then assigns various tasks. You can opt to proceed via a majority rules decision
structure. During a given class your group will identify areas that need attention. These
could be concepts that need to be mastered, data that must be collected or cleaned,
analysis that must be done, or writing that must be done. It is up to your group to then
decide how to address these things, and then to do it.

We will be present to advice you, and to provide feedback throughout, but we will not
show you how to accomplish a given task.

In order to enforce concepts, readings will be assigned to have discussions about them to
create the connection between the analytics and the business needs. These reading will
be in topics such as Social Networking Analysis (SNA), Big Data, Unstructured
Databases, etc.

Grading:

Your grade in this class is based entirely on the three projects. Each project will be
weighted equally. After a project is submitted, we will assign a grade to the project,
which we denote as . If there are people in your group, then the group will receive
points. The group will decide how to allocate these points among the group members.
The number of points the group allocates to you is your grade for the project.

Attendance:

Attendance is not used as part of the grading process of this class. However, students
who attend regularly will find it much easier to excel. As such, we strongly encourage
you to attend class. Please, as a courtesy to us and your fellow students, be punctual.

Vous aimerez peut-être aussi