Skip to main content

Data Resources: Gumberg Library

Getting Started

Being able to find data, analyze it, and then interpret the results is an extremely valuable skill. This guide will help you to: 

* find datasets that meet your information need

* understand the different types of files that often accompany data files such as setup files, readme files, codebooks, and data dictionaries 

* build skills in analyzing data through the use of statistical software (SPSS and SAS) 

* properly cite data from external sources 

What is Data? What is Big Data?

It's important to understand that data and statistics are two different things; statistics are the analyzed data.  For example raw data might include teacher salaries at a high school and a statistic might be the mean teach salary.     

Data: 

"information, especially facts or numbers, collected to be examined and considered and used to help decision-making, or information in an electronic form that can be stored and used by a computer" (Cambridge Dictionary

" factual information (such as measurements or statistics) used as a basis for reasoning, discussion, or calculation" (Merriam-Webster)

Big Data: 

In recent years, many people started using the phrase "big data" to refer to very large datasets that can be collected and analyzed thanks to modern computing advances. In the early 2000's analyst Doug Laney described big data in terms of three V's: Volume, Velocity and Variety.  The three V's are now a mainstream way of thinking about big data. -SAS Big Data Webpage   

SPSS Data for Workshops

Maureen Diana Sasso

Maureen Diana Sasso's picture
Maureen Diana Sasso
Contact:


sasso@duq.edu
412.396.5680
Website