Dates of lectures
September 6
Theory: A set of logically consistent ideas about the relationships between things (concepts) that permits those ideas to be checked against observations through scientific research.
Hypothesis: A conditional statement, relating to the relationship between variables, that can be subject to testing.
Concepts: Formally developed ideas that a researcher may seek to study. The "building blocks" of theory.
Variables: Concepts that have been transformed into measures in which differences can be established.
Indicators: Observable phenomena that can be used to designate differences in variables.
Operationalization: The process of figuring out how to measure concepts and test hypotheses using emprical observations.
Falsification: The ability to disprove a testable proposition. In the scientific method, hypotheses are supported by failing to reject (or falsify) them based on empirical evidence.
Objectivity: A state of knowledge about the empirical world that is independent of the knower's perceptions and biases.
Intersubjectivity: A state of knowledge about the empirical world that is shared and agreed upon by several knowers.
Paradigm: A set of presuppositions on which scientific activity is built; the body of theories, ideas, models, test cases, and values shared by a scientific community; and the specific scientific accomplishments that influence future scientific activity.
Deduction: Reasoning that moves from general principles (theory) to particular instances (empirical observations).
Induction: Reasoning that moves from from particular instances (empirical observations) to general principles.
Explanation: Scientific analysis that is causal; that is, it accounts for (or explains) change in one variable with changes in another variable or set of variables.
Description: Scientific analysis that simply relates (or describes) how a variable changes, either by itself or when related to another variable.
Understanding: Scientific analysis that is concerned with the process by which one variable relates to another, either as a reasonable account of the intervening mechanisms or (in ethnographic research) as a lived and meaningful experience for individuals and groups.
Independent variable (X): The thing (phenomenon or event) that is hypothesized to bring about the effect of something else.
Dependent variable (Y): The thing (phenomenon or event) that the researcher is trying to explain.
Intervening variable: A third variable that is simultaneously independent (to the original dependent variable) and dependent (to the original independent variable).
Positive relationship: A relationship between variables in which change in one variable brings about the same kind of change (i.e., "in the same direction") for another variable. When variables are positively related, an increase in one brings about an increase in the other, and a decrease in one brings about a decrease in the other.
Negative relationship: A relationship between variables in which change in one variable brings about an inverse change (i.e., "in the opposite direction") for another variable. When variables are negatively related, an increase in one brings about a decrease in the other, and a decrease in one brings about an increase in the other.
Linear relationship: A relationship between variables in which change in one variable brings about a constant amount of change for another variable.
Validity: A measurement principle in which the variables you use actually demonstrate the concepts you choose to observe. If your measurements are valid, then you really are measuring what you think you are measuring.
Reliability: A measurement principle in which the measurement procedures you use can generate the same measurements if they were to be repeated at a different time, on a comparable sample, or (in qualitative research) by a different researcher.
Error: A measure of the amount of empirical observations that cannot be described by the hypothesized model. There are two sources of error:
Random error occurs because of the complexity of forces and diversity of people that make up the real world that researchers study. Ultimately, researchers are unable to eradicate the occurence of random error.
Systematic error occurs because of a flaw in the operationalization of a research design. Researchers must try to reduce the occurence of random error as much as possible.
Categorical variables: Variables that depict attributes or categories of a concept that cannot be reduced to a number or numerical scale; they vary in kind. There are two kinds of categorical variables:
Nominal variables do not vary according to a specific order. The categories of nominal variables are simply names.
Ordinal variables vary according to a specific order, but the degrees of separation between their ranks cannot be numerically specified.
Numerical variables: Variables that depict attributes or categories of a concept that can be reduced to a number or numerical scale; they vary in degree. There are two kinds of numerical variables:
Interval variables vary according to a specific order; the degrees of separation between their ranks can be numerically specified, but no true zero point orders their measured variation.
Ratio variables vary according to a specific order; the degrees of separation between their ranks can be numerically specified, and a true zero point orders their measured variation.
Controlling for variables: A technique in explanatory analysis in which all possible determinants of a dependent variable are held constant, save one: the suspected causal independent variable.
Commonly known as "average,"
a measure of central tendency determined by adding up the
quantities of each unit in a distribution and then dividing
by the number of units.
A measure of central
tendency that represents the midpoint in a distribution of
ordered data.
A measure of central
tendency that represents the most frequent value in a
distribution.
A measure that indicates the
strength of the linear relationship between two numerical
variables. As r tends to +1, the two variables are
more strongly positively associated; as r tends to
-1, the two variables are more strongly negatively
associated.
An empirical standard of
confidence that the measured relationship between variables
is unlikely to have occurred by chance alone. Although there
are different formulae and standards for determining
statistical significance, most require that the probability
of the measured relationship occuring because of chance be
less than 5 percent to be statistically
significant.
A table format for reporting
measurements of relationships between variables. The number
of columns correspond to the number of categorical
attributes in the independent variable (x); the number of
rows correspond to the number of categorical attributes in
the dependent variable (y); and each cell reports the
proportion of the sample for each type of relationship
between variables.
A relationships between
variables in which at least three criteria are empirically
established: 1. X and Y are
correlated. 2. X precedes Y in
time. 3. The X-Y
relationship is nonspurious; no other competing
variables account for the observed X-Y
relationship. Carter (2001: 19)
offers a fourth and final criteria: 4. The X-Y
relationship is intuitively pleasing; it fits with
our current understanding of how the world
functions.
A relationship between
independent and dependent variables in which the
hypothesized relationship is in fact caused by the influence
of a third variable that is independent and antecedent to
the other two.
In crosstabulations, spuriousness is revealed when the
direction and strength of the originally hypothesized
relationship (i.e., the difference of percentages in cells
of each row) diminishes or disappears to zero after a
third variable is controlled for with the use of partial
tables.
A hypothesis in which the
effect of more than one independent variable is studied.
In crosstabulations, a multivariate model is revealed by
changed but consistent relationships (i.e., strengthened in
one, weakened in the other) in the originally hypothesized
relationship after a third variable is controlled for
with the use of partial tables.
A third variable that
logically falls in a time sequence, and systematically
explains the hypothesized relationship, between the
independent and dependent variables.
In crosstabulations, an intervening variable is revealed
when the direction and strength of the originally
hypothesized relationship (i.e., the difference of
percentages in cells of each row) diminishes or disappears
to zero after a third variable is controlled for by
constructing partial tables. Note that this is the same
effect revealed by spuriousness; only logical analysis about
the sequence of the third variable can determine whether it
is an intervening variable or hides a spurious
relationship.
The tendency for a third
variable to interact with the independent variable, thereby
altering the relationship between the independent and
dependent variables. Thus, the originally hypothesized
relationship between the independent and dependent variables
will vary under different conditions of the third
variable.
In crosstabulations, an interaction effect is revealed
when the originally hypothesized relationship maintains
itself in one partial table but diminishes or disappears to
zero in the other after a third variable is controlled
for.
Quantitative social research
in which one systematically asks many people the same
questions, then records and analyzes their answers.
A measurement instrument
that provides written instructions and questions which
respondents self-administer in order to provide data for
analysis.
A measurement instrument
that provides instructions and questions which the
researcher verbally administers to informants in order to
gather data for analysis.
Research in which one does
not gather data oneself but reexamines data previously
gathered by someone else to test original
hypotheses.
Survey research questions in
which respondents choose from a fixed set of
answers.
Survey research questions
which respondents answer in their own words.
The kind of empirical case
or unit that a researcher observes, measures, and analyzes
in a study.
The percentage of
respondents who complete a questionnaire or interview.
Although researchers disagree about what constitutes an
acceptable response rate, most consider anything below 50%
to be poor and over 90% to be excellent.
A measure that sums or
combines many separate measures of a variable.
An index that measures the
intensity, direction, level, or potency of a variable
constructed along a continuum. Most scales measure ordinal
variables.
A form of statistical
analysis that is especially useful in explaining
multivariate models. A regression equation expresses the
relationship between two or more variables algebraically,
estimating the average change in a dependent variable given
a change in the independent variable(s). In its simplest
(linear) form, a regression equation is usually written: a (alpha) is the constant
(a.k.a. intercept) b (beta) is the
regression coefficient (a.k.a. slope) X is the independent
variable e is the error
term
Y = a + b1X1 +
b2X2 + ... +
bkXk + e
where...Y is the dependent
variable
An operational construct
that represents a categorical variable as a
two-category numerical variable for statistical
analysis. E.g., for gender: female = 1, male = 0.
A group of subjects that are
selected for study in order to make generalizations about a
broader population.
The group of all subjects,
either known or unknown, from which a sample is
selected.
The individual units, often
individual persons, that comprise a sample.
Methods for drawing a sample
in which the probability of selecting population elements is
known. With probability sampling methods, the researcher
uses random sampling so that the representativeness of the
sample characteristics to the (known) population
characteristics can be statistically calculated.
Methods for drawing a sample
in which the probability of selecting population elements is
not known. With nonprobability sampling methods, the
researcher must explicitly explain how the sample represents
the population from which it was drawn.
The list from which the
elements of the population are selected. Sampling methods
are only as sound as the sampling frame operationalizes the
population.
A nonprobability sample in
which elements are drawn based on their availability to the
researcher. Also known as a convenience sample.
A nonprobability sample in
which the researcher asks the initial elements, usually
people, to refer other potential elements for inclusion in
the sample. The process is repeated until the sample grows
(or snowballs) to the size desired by the
researcher.
A nonprobability sample in
which the researcher selects elements for a specific
purpose, usually because of the unique position of the
elements.
A probability sample that is
organized to capture known group differences among the
population. In the first stage, elements are sorted into
separate groups (or strata) according to the selected group
characteristics, e.g., men and women, different racial and
ethnic groups. In the second stage, elements are randomly
sampled from within the strata.
Research about social groups
and behavior based on careful observation in a natural
social environment. Field research can entail any research
methods, not just participant observation.
Literally, "people" +
"writing." The description of a group, culture, or social
practice as its members understand it.
Research that begins with
the observation stage. Typically, inductive researchers let
their research questions emerge from their initial
observations and analysis.
A method of research
involving participating and observing first hand in the
social behavior and groups you are studying.
A problem of validity that
occurs when subjects are aware they are being studied and
alter their behavior from its "natural" patterns.
Writings in which a field
researcher records his or her personal observations (the
indicators of participant observation). Field notes tend to
include at least (1) thick descriptions, (2) running
hypotheses, and (3) notes for further
investigation.
Drawing together multiple
types of evidence gathered from different sources using
different methods of data collection.
In an experiment, the
comparison group who is not exposed to the experimental
treatment (i.e., the independent variable).
Research conducted without
the knowledge or consent of those being studied or by
misrepresenting the role of the researcher.
A setting for social
behavior where the field researcher can enter without
permission.
A setting for social
behavior where the field researcher must obtain permission
from members before entering.
A research method that seeks
to isolate the effects of an independent variable on a
dependent variable under strictly controlled
conditions.
The independent variable
that is administered to subjects in an
experiment.
In an experiment, the group
who is exposed to the treatment (i.e., the independent
variable).
In an experiment, the
baseline measure that is compared with the measures taken
after the treatment (i.e., posttest).
In an experiment, the
measure of the effect of the experimental treatment that is
often compared to a pretest.
The extent to which an
experiment has actually caused what it appears to
cause.
The extent to which an
experiment can be generalized to other settings, other
treatments, and other subjects.
In an experiment, the degree
to which the observed effect of the treatment is actually
due to the effect of measuring the dependent variable twice
(in pretest and posttest measures).
In an experiment, when
subjects demonstrate the expected effect without being
exposed to the treatment, solely because they are
participating in the experiment.
The most fundament
experimental design: the experimental and control group are
given the pretest, an experimental group is given the
treatment, and the experimental and control group are given
the posttest.
An experimental design with
two experimental and control group pairs, one of which is
not exposed to pretest measures.
An experimental design where
neither the experimental nor the control group is exposed to
pretest measures.
A measure of a study's
ethics: are the risks to subjects and science greater than
the benefits to science and society?
An unobtrusive method for
analyzing texts that involves four stages. 1. Variable
construction: deciding
which characteristics or themes of the texts
are to be analyzed and how they are to be
observed 2.
Sampling: selecting the texts
to be analyzed 3.
Observation: code (or
measure) each text for the characteristics or
themes 4.
Analysis: aggregate the
measurements and make numerical descriptions of the
texts.
Elements of a text that are
already observable and readily measurable without (much)
interpretation by the content analyst.
Elements of a text that
require interpretation by the content analyst before they
can be observed and measured.
In content analysis, the
need to confirm the accuracy of a coder's measurements by
checking (some sample of) them against a second coder's
measurements.
These definitions have been developed by Leonard Nevarez and in some cases have been informed by the following sources:
Babbie, Earl. 1992. The Practice of Social Research. 6th ed. Belmont, CA: Wadsworth PubBaker, Therese L. 1994. Doing Social Research. 2nd ed. New York: McGraw-Hill.
Berg, Bruce L. 2001. Qualitative Research Methods for the Social Sciences. 4th ed. Boston: Allyn and Bacon.
Carter, Gregg Lee. 2001. Doing Sociology with Student CHIP. 3rd ed. Boston: Allyn and Bacon.
Neuman, W. Lawrence. 2000. Social Research Methods. 4th ed. Boston: Allyn and Bacon.
Schutt, Russell K. 1996. Investigating the Social World. Thousand Oaks, CA: Pine Forge.