IDEAS Tutorial
Begin Using IDEAS Now!
|
![]() |
|
The Roper Center provides this tool for your online assistance in analyzing certain datasets. By accessing this feature, you may do your own basic analyses without the aid of expensive statistical software packages. You will be able to analyze the opinions, attitudes and experiences of the respondents as well as the relationships between them.
What does it mean to analyze a dataset?Whenever a survey is conducted, question responses are recorded in numerical form. For instance, if a survey asks, “How often would you say you vote—always, nearly always, part of the time, or seldom?” the response “always” might be recorded as 1, “nearly always” as 2, and so forth. Once the interviewing is completed, the results are entered into a computer-readable collection of these coded responses called a dataset. CodebooksYou can get more detailed information about the survey by clicking on “View codebook in separate window.” The codebook contains the actual questionnaire, or instrument, giving the complete wording for each variable, the labels and codes for all response categories, and other pertinent information about the survey. A Quick Tutorial:Let’s say, for example, you want to analyze the dataset “CBS News Poll: Kennedy Assassination [May 1998],” which is located in Elections, Political Parties/Figures (Click on the survey located under Date Analysis Tool) of Topics at a Glance section of the Roper Center website. FrequenciesThe pull-down menus used to set up the analysis provide descriptive labels for all the questions in the survey. In the analysis of a poll, each question in the survey is referred to as a variable. The most basic form of dataset analysis produces a frequency distribution or topline for each variable. Toplines tell us what percentage of all the respondents gave each response to that particular question. In the Kennedy survey, let’s look at the results of the question: “Do you think one man--Lee Harvey Oswald--was responsible for the assassination of President (John) Kennedy or do you think there were others involved?” To run the frequency distribution using IDEAS:
Bivariate analysis –“Crosstabulations”A second level of dataset analysis available in IDEAS is bivariate analysis. In this case, two variables are crosstabulated against one another. Cross tabs can be run for any variables in the survey you’d like to correlate. Independent variables are placed in columns. Dependent variables are placed in rows. A typical two-way crosstab looks at how responses to a survey question differ by demographic group. For instance, to use IDEAS to find out how “gender” might have influenced opinions on the Kennedy assassination question do the following:
CSM, UC Berkeley To understand the color coding system, read below. Control variables – Three-way CrosstabsProvided a survey sample is large enough to yield a significant result, it is also possible in IDEAS to run a three-way cross tabulation by including a control variable in your analysis. Let’s say, for example, you want to find out how the results of the previous bivariate analysis differ by “Education.”
CSM, UC Berkeley Filter(s)Sometimes, a researcher might need to study a specific group of respondents in the survey. IDEAS’s filter function allows you to specify which respondents you wish to include in your analysis. The subsample, or subset, you select might be respondents in a particular demographic group, such as women or African Americans. A subsample could also consist of individuals who responded in a particular way to one of the questions in the survey. For instance, let’s say you’re interested in knowing more about the respondents in the Kennedy survey. You’re interested in finding out about the men in the survey, where they live, and their political ideology.
Your table should look like this:
CSM, UC Berkeley WeightingSurvey firms apply a technique called weighting to adjust the poll results to account for possible sample biases caused by specific groups of individuals not responding to a survey. The weighting mechanism uses known estimates of the total population provided by the US Census Bureau to adjust the final results. It's not uncommon to weight data by, for instance, age, gender, education, or race in order to achieve the correct demographic proportions. In IDEAS, you can look at and analyze both weighted and unweighted survey results by checking or unchecking the box labeled Weight Tables. The default setting is for weighted results. Color codingIn the analysis of survey data, the smaller a sample is, the less likely it is to be representative of the population as a whole; and, therefore, the less reliable the survey results will be based on that sample. A crosstabulation divides a survey sample, in effect, into a number of smaller samples, each of which occupies a cell in the table produced by the crosstab. These numbers can become quite low if the questions included in the crosstabs have a lot of response categories, or if the demographic variables include more than two or three groups. In IDEAS, color coding is used to indicate the relative reliability of the data appearing in each cell of a table. Descriptive statisticsSome users of IDEAS will wish to have more detailed information for their frequencies and crosstabs than the program’s default setting provides. To obtain a summary of descriptive statistics for an analysis, simply check off the Descriptive Statistics box before running the table. Examples of descriptive statistics include: Mean and Standard Deviation. Summary statistics provide the statistical significance of the relationship. Let’s find out the descriptive statistics for the relationship between “Gender” and “Think Oswald was responsible for JFK’s assassination”:
The table should look like this:
CSM, UC Berkeley | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||