Soc 254 (Fall 2001): Research Methods

Concepts

Dates of lectures

September 6
September 11

September 13

September 20

October 2

October 4

October 23

October 25

October 30

November 6

November 13

November 27

September 6

Scientific method:

The process of creating new knowledge using empirical observations and logical analysis according to the conventions of the scientific community. It includes at least 5 steps:

1.

Observe or have an interest in some aspect of the world.

2.

Invent a tentative description, called a hypothesis, that is consistent with what you have observed.

3.

Use the hypothesis to make predictions.

4.

Test those predictions by further observations and analysis and modify the hypothesis in the light of your results.

5.

Repeat steps 3 and 4 until there are no discrepancies between theory and experiment and/or observation.

Theory:

A set of logically consistent ideas about the relationships between things (concepts) that permits those ideas to be checked against observations through scientific research.

Hypothesis:

A conditional statement, relating to the relationship between variables, that can be subject to testing.

Concepts:

Formally developed ideas that a researcher may seek to study. The "building blocks" of theory.

Variables:

Concepts that have been transformed into measures in which differences can be established.

Indicators:

Observable phenomena that can be used to designate differences in variables.

Operationalization:

The process of figuring out how to measure concepts and test hypotheses using emprical observations.

Falsification:

The ability to disprove a testable proposition. In the scientific method, hypotheses are supported by failing to reject (or falsify) them based on empirical evidence.

Objectivity:

A state of knowledge about the empirical world that is independent of the knower's perceptions and biases.

Intersubjectivity:

A state of knowledge about the empirical world that is shared and agreed upon by several knowers.

Paradigm:

A set of presuppositions on which scientific activity is built; the body of theories, ideas, models, test cases, and values shared by a scientific community; and the specific scientific accomplishments that influence future scientific activity.

Deduction:

Reasoning that moves from general principles (theory) to particular instances (empirical observations).

Induction:

Reasoning that moves from from particular instances (empirical observations) to general principles.

Back to schedule

September 11

Explanation:

Scientific analysis that is causal; that is, it accounts for (or explains) change in one variable with changes in another variable or set of variables.

Description:

Scientific analysis that simply relates (or describes) how a variable changes, either by itself or when related to another variable.

Understanding:

Scientific analysis that is concerned with the process by which one variable relates to another, either as a reasonable account of the intervening mechanisms or (in ethnographic research) as a lived and meaningful experience for individuals and groups.

Independent variable (X):

The thing (phenomenon or event) that is hypothesized to bring about the effect of something else.

Dependent variable (Y):

The thing (phenomenon or event) that the researcher is trying to explain.

Intervening variable:

A third variable that is simultaneously independent (to the original dependent variable) and dependent (to the original independent variable).

Positive relationship:

A relationship between variables in which change in one variable brings about the same kind of change (i.e., "in the same direction") for another variable. When variables are positively related, an increase in one brings about an increase in the other, and a decrease in one brings about a decrease in the other.

Negative relationship:

A relationship between variables in which change in one variable brings about an inverse change (i.e., "in the opposite direction") for another variable. When variables are negatively related, an increase in one brings about a decrease in the other, and a decrease in one brings about an increase in the other.

Linear relationship:

A relationship between variables in which change in one variable brings about a constant amount of change for another variable.

Validity:

A measurement principle in which the variables you use actually demonstrate the concepts you choose to observe. If your measurements are valid, then you really are measuring what you think you are measuring.

Reliability:

A measurement principle in which the measurement procedures you use can generate the same measurements if they were to be repeated at a different time, on a comparable sample, or (in qualitative research) by a different researcher.

Error:

A measure of the amount of empirical observations that cannot be described by the hypothesized model. There are two sources of error:

Random error occurs because of the complexity of forces and diversity of people that make up the real world that researchers study. Ultimately, researchers are unable to eradicate the occurence of random error.

Systematic error occurs because of a flaw in the operationalization of a research design. Researchers must try to reduce the occurence of random error as much as possible.

Categorical variables:

Variables that depict attributes or categories of a concept that cannot be reduced to a number or numerical scale; they vary in kind. There are two kinds of categorical variables:

Nominal variables do not vary according to a specific order. The categories of nominal variables are simply names.

Ordinal variables vary according to a specific order, but the degrees of separation between their ranks cannot be numerically specified.

Numerical variables:

Variables that depict attributes or categories of a concept that can be reduced to a number or numerical scale; they vary in degree. There are two kinds of numerical variables:

Interval variables vary according to a specific order; the degrees of separation between their ranks can be numerically specified, but no true zero point orders their measured variation.

Ratio variables vary according to a specific order; the degrees of separation between their ranks can be numerically specified, and a true zero point orders their measured variation.

Controlling for variables:

A technique in explanatory analysis in which all possible determinants of a dependent variable are held constant, save one: the suspected causal independent variable.

Back to schedule

September 13

Mean:

Commonly known as "average," a measure of central tendency determined by adding up the quantities of each unit in a distribution and then dividing by the number of units.

Median:

A measure of central tendency that represents the midpoint in a distribution of ordered data.

Mode:

A measure of central tendency that represents the most frequent value in a distribution.

Correlation coefficient (r):

A measure that indicates the strength of the linear relationship between two numerical variables. As r tends to +1, the two variables are more strongly positively associated; as r tends to -1, the two variables are more strongly negatively associated.

Statistical significance:

An empirical standard of confidence that the measured relationship between variables is unlikely to have occurred by chance alone. Although there are different formulae and standards for determining statistical significance, most require that the probability of the measured relationship occuring because of chance be less than 5 percent to be statistically significant.

Crosstabulation:

A table format for reporting measurements of relationships between variables. The number of columns correspond to the number of categorical attributes in the independent variable (x); the number of rows correspond to the number of categorical attributes in the dependent variable (y); and each cell reports the proportion of the sample for each type of relationship between variables.

Causality:

A relationships between variables in which at least three criteria are empirically established:

1.

X and Y are correlated.

2.

X precedes Y in time.

3.

The X-Y relationship is nonspurious; no other competing variables account for the observed X-Y relationship.

Carter (2001: 19) offers a fourth and final criteria:

4.

The X-Y relationship is intuitively pleasing; it fits with our current understanding of how the world functions.

Spuriousness:

A relationship between independent and dependent variables in which the hypothesized relationship is in fact caused by the influence of a third variable that is independent and antecedent to the other two.

In crosstabulations, spuriousness is revealed when the direction and strength of the originally hypothesized relationship (i.e., the difference of percentages in cells of each row) diminishes or disappears to zero after a third variable is controlled for with the use of partial tables.

Multivariate model:

A hypothesis in which the effect of more than one independent variable is studied.

In crosstabulations, a multivariate model is revealed by changed but consistent relationships (i.e., strengthened in one, weakened in the other) in the originally hypothesized relationship after a third variable is controlled for with the use of partial tables.

Intervening variable:

A third variable that logically falls in a time sequence, and systematically explains the hypothesized relationship, between the independent and dependent variables.

In crosstabulations, an intervening variable is revealed when the direction and strength of the originally hypothesized relationship (i.e., the difference of percentages in cells of each row) diminishes or disappears to zero after a third variable is controlled for by constructing partial tables. Note that this is the same effect revealed by spuriousness; only logical analysis about the sequence of the third variable can determine whether it is an intervening variable or hides a spurious relationship.

Interaction effect:

The tendency for a third variable to interact with the independent variable, thereby altering the relationship between the independent and dependent variables. Thus, the originally hypothesized relationship between the independent and dependent variables will vary under different conditions of the third variable.

In crosstabulations, an interaction effect is revealed when the originally hypothesized relationship maintains itself in one partial table but diminishes or disappears to zero in the other after a third variable is controlled for.

Back to schedule

September 20

Survey research:

Quantitative social research in which one systematically asks many people the same questions, then records and analyzes their answers.

Questionnaire:

A measurement instrument that provides written instructions and questions which respondents self-administer in order to provide data for analysis.

Interview schedule:

A measurement instrument that provides instructions and questions which the researcher verbally administers to informants in order to gather data for analysis.

Secondary data analysis:

Research in which one does not gather data oneself but reexamines data previously gathered by someone else to test original hypotheses.

Closed-ended questions:

Survey research questions in which respondents choose from a fixed set of answers.

Open-ended questions:

Survey research questions which respondents answer in their own words.

Unit of analysis:

The kind of empirical case or unit that a researcher observes, measures, and analyzes in a study.

Response rate:

The percentage of respondents who complete a questionnaire or interview. Although researchers disagree about what constitutes an acceptable response rate, most consider anything below 50% to be poor and over 90% to be excellent.

Index:

A measure that sums or combines many separate measures of a variable.

Scale:

An index that measures the intensity, direction, level, or potency of a variable constructed along a continuum. Most scales measure ordinal variables.

Back to schedule

October 2

Regression analysis:

A form of statistical analysis that is especially useful in explaining multivariate models. A regression equation expresses the relationship between two or more variables algebraically, estimating the average change in a dependent variable given a change in the independent variable(s). In its simplest (linear) form, a regression equation is usually written:

Y = a + b₁X₁ + b₂X₂ + ... + b_kX_k + e

where...

Y is the dependent variable
a (alpha) is the constant (a.k.a. intercept)

b (beta) is the regression coefficient (a.k.a. slope)

X is the independent variable

e is the error term

Dummy variable:

An operational construct that represents a categorical variable as a two-category numerical variable for statistical analysis. E.g., for gender: female = 1, male = 0.

Back to schedule

October 4

Sample:

A group of subjects that are selected for study in order to make generalizations about a broader population.

Population:

The group of all subjects, either known or unknown, from which a sample is selected.

Elements:

The individual units, often individual persons, that comprise a sample.

Probability sampling methods:

Methods for drawing a sample in which the probability of selecting population elements is known. With probability sampling methods, the researcher uses random sampling so that the representativeness of the sample characteristics to the (known) population characteristics can be statistically calculated.

Nonprobability sampling methods:

Methods for drawing a sample in which the probability of selecting population elements is not known. With nonprobability sampling methods, the researcher must explicitly explain how the sample represents the population from which it was drawn.

Sampling frame:

The list from which the elements of the population are selected. Sampling methods are only as sound as the sampling frame operationalizes the population.

Availability sample:

A nonprobability sample in which elements are drawn based on their availability to the researcher. Also known as a convenience sample.

Snowball sample:

A nonprobability sample in which the researcher asks the initial elements, usually people, to refer other potential elements for inclusion in the sample. The process is repeated until the sample grows (or snowballs) to the size desired by the researcher.

Purposive sample:

A nonprobability sample in which the researcher selects elements for a specific purpose, usually because of the unique position of the elements.

Stratified random sample:

A probability sample that is organized to capture known group differences among the population. In the first stage, elements are sorted into separate groups (or strata) according to the selected group characteristics, e.g., men and women, different racial and ethnic groups. In the second stage, elements are randomly sampled from within the strata.

Back to schedule

October 23

Field research:

Research about social groups and behavior based on careful observation in a natural social environment. Field research can entail any research methods, not just participant observation.

Ethnography:

Literally, "people" + "writing." The description of a group, culture, or social practice as its members understand it.

Inductive research:

Research that begins with the observation stage. Typically, inductive researchers let their research questions emerge from their initial observations and analysis.

Participant observation:

A method of research involving participating and observing first hand in the social behavior and groups you are studying.

Reactivity:

A problem of validity that occurs when subjects are aware they are being studied and alter their behavior from its "natural" patterns.

Field notes:

Writings in which a field researcher records his or her personal observations (the indicators of participant observation). Field notes tend to include at least (1) thick descriptions, (2) running hypotheses, and (3) notes for further investigation.

Back to schedule

October 25

Triangulation:

Drawing together multiple types of evidence gathered from different sources using different methods of data collection.

Control group:

In an experiment, the comparison group who is not exposed to the experimental treatment (i.e., the independent variable).

Back to schedule

October 30

Covert research:

Research conducted without the knowledge or consent of those being studied or by misrepresenting the role of the researcher.

Open setting:

A setting for social behavior where the field researcher can enter without permission.

Closed setting:

A setting for social behavior where the field researcher must obtain permission from members before entering.

Back to schedule

November 6

Experiment:

A research method that seeks to isolate the effects of an independent variable on a dependent variable under strictly controlled conditions.

Treatment:

The independent variable that is administered to subjects in an experiment.

Experimental group:

In an experiment, the group who is exposed to the treatment (i.e., the independent variable).

Pretest:

In an experiment, the baseline measure that is compared with the measures taken after the treatment (i.e., posttest).

Posttest:

In an experiment, the measure of the effect of the experimental treatment that is often compared to a pretest.

Internal validity:

The extent to which an experiment has actually caused what it appears to cause.

External validity:

The extent to which an experiment can be generalized to other settings, other treatments, and other subjects.

Posttest distortion:

In an experiment, the degree to which the observed effect of the treatment is actually due to the effect of measuring the dependent variable twice (in pretest and posttest measures).

Hawthorne effect:

In an experiment, when subjects demonstrate the expected effect without being exposed to the treatment, solely because they are participating in the experiment.

Classical experiment:

The most fundament experimental design: the experimental and control group are given the pretest, an experimental group is given the treatment, and the experimental and control group are given the posttest.

Solomon four-group design:

An experimental design with two experimental and control group pairs, one of which is not exposed to pretest measures.

Posttest-only control group design:

An experimental design where neither the experimental nor the control group is exposed to pretest measures.

Back to schedule

November 13

Risk-benefit ratio:

A measure of a study's ethics: are the risks to subjects and science greater than the benefits to science and society?

Back to schedule

November 27

Content analysis:

An unobtrusive method for analyzing texts that involves four stages.

1. Variable construction:

deciding which characteristics or themes of the texts are to be analyzed and how they are to be observed

2. Sampling:

selecting the texts to be analyzed

3. Observation:

code (or measure) each text for the characteristics or themes

4. Analysis:

aggregate the measurements and make numerical descriptions of the texts.

Manifest content:

Elements of a text that are already observable and readily measurable without (much) interpretation by the content analyst.

Latent content:

Elements of a text that require interpretation by the content analyst before they can be observed and measured.

Intercoder reliability:

In content analysis, the need to confirm the accuracy of a coder's measurements by checking (some sample of) them against a second coder's measurements.

Back to schedule

These definitions have been developed by Leonard Nevarez and in some cases have been informed by the following sources:

Babbie, Earl. 1992. The Practice of Social Research. 6th ed. Belmont, CA: Wadsworth Pub
Baker, Therese L. 1994. Doing Social Research. 2nd ed. New York: McGraw-Hill.

Berg, Bruce L. 2001. Qualitative Research Methods for the Social Sciences. 4th ed. Boston: Allyn and Bacon.

Carter, Gregg Lee. 2001. Doing Sociology with Student CHIP. 3rd ed. Boston: Allyn and Bacon.

Neuman, W. Lawrence. 2000. Social Research Methods. 4th ed. Boston: Allyn and Bacon.

Schutt, Russell K. 1996. Investigating the Social World. Thousand Oaks, CA: Pine Forge.