You can also download the pdf file by click here.
Studies included in the research synthesis were
identified through a three-step process. First, we conducted a thorough search
for all studies included in ERIC (1966-2002) through FirstSearch
with the following keywords: (distan* and education, distan* and learning, distan* and
teaching, distan* and instruction, online and
education, online and learning, online and teaching, online and instruction,
on-line and education, on-line and learning, on-line and teaching, on-line and
instruction, web-based and education, web-basedand
learning, web-based and teaching,
web-based and instruction, virtual and education, virtual and learning, virtual
and teaching, virtual and instruction). The search identified 8840 potentially
relevant articles. Citation information for all 8840 articles was then
transferred into EndNote (5.0) to build the first
database.
At the second stage, the database was further examined
based on the following criteria:
1. The article had to be published in journals. The
decision of including only journal articles was based on the concern of study
quality. Previous research reviews on distance education had pointed out the
low quality problem of most studies. We
believed that journal articles were of higher quality because of peer review
procedures. Only including journal articles may result in publication bias but
we believed that the risk was minimal as there had not been a dominant paradigm
for distance education over the years to cause a certain bias against or for
positive, negative, or non-significant findings.
2. The article must have complete reference information
(author, date, source, etc.)
3. The article had to include at least one evaluation
study of distance education. The specific outcome measured was not limited.
4. The article must have at least one comparison study
on distance education and face-to-face education. Studies in which students’
own pre-treatment scores served as controls for their post-treatment scores and
those in which one distance course was compared with another distance course
were excluded.
5. The article must have some empirical data. Articles
were not included if they merely describe a distance education course.
6. The article had to include enough statistical
information for computing an effect size. The specific information we were
looking for was means, standard deviation, sample size for both the distance
education group and the face-to-face group, or t value, F value and
degree of freedom (df).
At this stage,
1100 articles that either were not journal articles or didn’t have complete
reference information were removed from the database. Then the research team
read the abstracts of all 7740 articles, of which, 6365 articles were removed
because they didn’t meet criterion 3 or 4. Articles that could not be clearly
decided at this stage were kept in the database to be dealt with at a later
stage. A total of 1375 references were left after this selection.
At the third stage, the research team collected and read
all the 1375 articles and excluded those that didn’t have empirical data. As a
result, 421 articles were left after this elimination. The research team then
read and coded all 421 articles and found that only 49 articles contained
sufficient information for calculating the effect size. For fear of missing
some articles that may actually have the information due to the large number of
articles and complexity of the database, the research team examined all 421
articles one more time and identified 2 more articles that have complete
information to calculate the effect size. Ancestry search was conducted but no
extra articles met all the criteria were found. Finally, 51 journal articles
were included for the final analysis.
To identify those methodological and substantive
characteristics that may be responsible for significant variations in the
findings, a detailed analytical framework was developed through an iterative
process. The framework was developed based on our understanding of possible
sources of variation in the studies. The heterogeneity in outcomes across
studies can come from three sources: the publication, the study, and the
instruction. Figure 1 depicts the logic model underlying the analytical
framework.
Figure 1:
Logic Model of Distance Education Effectiveness
For
publication and study features, we started with Stock (1994)’s seven categories
for describing research reports: report identification, the setting of the
study, subjects, methodology, treatment characteristics, statistical outcomes
or effect sizes, and coding process. Instructional features refer to the
characteristics of the distance education program under study. It has been
argued that distance education should be considered as education at a distance (Shale,
1990):
In sum, distance education ought to be regarded as
education at a distance. All of what constitutes the
process of education when teacher and student are able
to meet face-to-face also constitutes the process of
education when teacher and student are physically
separated. (p. 334)
In other words, the
quality of distance education programs is influenced by the same set of factors
that affect the quality of face-to-face education. Schwab(Schwab,
1983) characterizes education
in terms of four common places of education: teacher, student, what is taught,
and milieux of teaching-learning. This
characterization is also applicable to the study of distance education. While
teacher, student, and what is taught remain pretty much the same as face-to-face
education, the milieux of teaching-learning is
different between distance and face-to-face education in that the milieux of teaching-learning of distance education is
mostly mediated through some kind of technology. Hence we describe the milieux of teaching-learning in terms of the format and
method of delivery. Finally, we used the grounded theory(Glaser,
1992; Glaser & Strauss, 1967) to guide the development of the framework. Following a typical
constant-comparison process, we started coding the studies with the initial
framework and then modified the framework when we came across new features
during the coding process. Table 1 summarizes the variables included in the
final framework. In the following paragraphs, we describe these variables and
reasons for their inclusion in the framework.
Table 1. Variables
Analyzed in this Study
|
Category |
Variable |
Scale |
Source |
|
Publication Features |
Publication Year |
|
Publication |
|
Instructor as Author |
Yes/No |
Publication |
|
|
Study Features |
Study Design |
Quasi-experimental or Experimental |
Researcher Coding |
|
Measurement |
standardized test,
Researcher-developed instruments, Instructor-developed instruments, and
Published instrument |
Publication |
|
|
|
Indicators of effectiveness |
Grades, student satisfaction,
faculty satisfaction, standardized tests |
Publication |
|
Results |
Sample size, mean, standard
deviation, t value, p value |
Publication |
|
|
Instructional Features: Instructor |
Instructor Involvement |
1 to 10 1=No involvement 10=Complete involvement |
Researcher Coding |
|
Status |
Professor, graduate student or
professional |
Publication |
|
|
Instructional Features: Learner |
Background |
High school diploma, or college
degree |
Publication |
|
Status |
Full time or part time |
Publication |
|
|
Instructional Features: Curriculum |
Content Area |
Subject taught |
Publication |
|
Degree |
Yes/no |
Publication |
|
|
Credit |
Yes/no |
Publication |
|
|
Course Time |
|
Publication |
|
|
Class Time |
|
Publication |
|
|
Instructional Features: Milieux |
Interaction type |
Asynchronous interaction, or
Synchronous interaction, or both, or Non-interactive involvement |
Publication |
|
Media involvement |
1 to 10 1=No involvement 10=Complete involvement |
Researcher Coding |
|
|
Setting |
K-12,Graduate, Undergraduate,
Military, etc. |
Publication |
Evidence of Effectiveness
There are different ways to measure the
effectiveness of distance education programs. Studies in distance education
thus differ in what they used as evidence of effectiveness and the reliability
and validity of the evidence used.
The variation in what was measured and the quality of the measurement
may explain the heterogeneity of outcomes.
Outcome
measures. Information about what has been used to assess the effectiveness of
distance programs was collected for each study. A study could use the one or
more of following measures: Grades, Quizzes, Independent/standardized Tests,
Student Satisfaction, Faculty Satisfaction, Dropout Rate, Student Evaluation of
Learning, Student Evaluation of Course, External
Evaluation and Cost Effectiveness. Grades usually are the final scores students
received for the class. Student Evaluation of Learning is students’ perception
of how much they learned from this course, which can be significantly different
from the grades they received. Cost Effectiveness is the ratio of costs and
students achievements.
Source
of Instrument. The source of instruments
used to measure effectiveness can affect the final outcomes in that instruments
from different sources may have different levels of reliability and validity.
We identified four sources of the most frequently used measures: commercial
testing agencies, the researcher (the author of the article), the instructor of
the course, and publishers of textbooks that include assessment items.
Study design. The design of a study is a
good indication of its quality and thus the quality of the results. We were
curious about whether certain type of design is associated with the study
results, thus we coded the studies into two categories: true experimental or
quasi-experimental. The
differentiating characteristic between the two designs is whether random
sampling method was used.
Study Results. To calculate effect
sizes, the results of each study were recorded in the database. The study
results include means, standard deviations, t values, F values,
and r values, depending on what is reported in the primary study.
Factors
Affecting Effectiveness
Factors that may affect the effectiveness of
a distance education program can be categorized into two groups: publication
features and instructional features. Two publication features were identified
as possible factors affecting effectiveness: publication year and instructor as
author.
As mentioned before we used Schwab’s four
common places of education to guide the identification of instructional
features that can potentially affect the effectiveness of distance education
programs: the teacher, the student, the curriculum, and the milieux.
In each of the four common places are a number of potential factors that
contribute the outcomes of learning.
Teacher: Instructor Involvement. The
extent to which the instructor of a distance education course is involved in
the actual delivery of the content and available for interactions with students
during and outside the class sessions is termed ‘instructor involvement.” The level of instructor involvement is
perhaps one of the most defining differences between traditional face-to-face
education and distance education. In face-to-face education, the instructor
generally delivers the content live and interacts with students both in and
outside class meetings whereas in distance education programs the level of
instructor involvement varies a great deal, from one extreme where the content
is pre-programmed and delivered through some technology means without the
actual involvement of an instructor to another where the instructor actually
delivers the content live and is available for interactions with students in
very much the same fashion as face-to-face education. Interactions between the
teacher and students have been found to affect the quality of student
experiences and learning outcomes in distance education (The Institute for
Higher Education Policy, 2000). How content is delivered should also have an
effect on student learning experiences and outcomes. This is also a hot topic
for distance education programs because one of the appeals of distance
education is the potential to increase efficiency by reducing the demand of
actual involvement of faculty. Having one faculty to teach thousands of (or
even more) students with the help of broadcasting, recording, and computing
technologies has been a dream for many advocates of distance education.
However, if higher-level of instructor involvement becomes a requirement for
effective quality programs, the efficiency dream may
never be realized. On the other hand, if the level of instructor involvement is
found to be irrelevant to student outcomes, it would be unnecessary to assign
an instructor to only a small group of students or be actually involved in the
teaching. A pre-programmed video or computer program can accomplish as much. To
assess the value of instructor involvement, we hence included this factor. We
coded instructor involvement on a scale from 1 to 10, with 1 indicating no
human instructor involvement (e.g., computer-based training) and 10 full
involvement of a human instructor (e.g., two-way interactive TV courses).
Teacher: Status of the Instructor.
Another way distance education programs have sought to increase efficiency is
to employ non-regular faculty, who normally costs less than regular faculty.
Thus we are interested in finding out whether instructor status influence
student learning in distance education programs.
Teacher: Training for Teaching Distance Courses. It has been argued that instructors of distance education programs
should be trained first because distance education is a different teaching
environment from face-to-face classrooms. Again, the training has cost implications
for programs. To test whether training affects student learning, we collected
information about teacher training from each study, if that information is
available.
Student: Education Level. We collected
information about student’s educational attainment level before attending the
distance course to examine whether, and if so which level, certain types of
students are more prepared to take distance education courses.
What is Being Taught: Content Area. Some content may be more suited for distance education while others
may be better taught in face-to-face course. Interested in finding if this
assumption is true and if so, what content area is better suited for distance
education, we collected information about the content area of each study. A
course can be teaching one of the following subject areas: Social Science,
Mathematics, Science, Medical Science, Literacy, Humanities, Business, Law,
Engineering, Computer Science, Teacher Education, and Skills. Skills here represent any professional
training that doesn’t fall into other categories. We coded medical sciences,
business education, and teacher education separately because they have been one
of the most commonly taught content areas in distance education.
The Milieux: Instructional Level. Distance education programs have been traditionally intended for
adults but recently it has expanded to include younger audiences. As a related
factor to student characteristics and content, the instructional level of
distance education may be associated with its effectiveness. We grouped the
distance education programs in each study into 10 levels: K-2(Lower
elementary), 3-5(upper elementary), 6-9(middle school), 10-12(high school),
Associate Degree(community college), Undergraduate Level(4 year college),
Graduate Level, Professional Development, and Military Training. We also
collected data about whether the course is for credit or not and whether it is
for degree granting program or not.
The Milieux: Interaction Type. Interaction type characterizes how instructors and students
interact in the distance learning process. There are four types of interaction:
asynchronous, in which a time lag exists between the interactions of the
instructor and students in that students may ask a question via email, to which
the instructor may respond two days later; synchronous, where the
potential exists for instructors and students to interact at the same time; non-interactive,
where there is no interaction between instructors and students at all; and both
synchronous and asynchronous interaction, where the instructor can
interact with students both synchronously and asynchronously.
The Milieux: Media Involvement. Distance education programs also vary in the level of technology is
used. Some programs employ a “mixed-model,” in which part of the instruction is
conducted face-to-face whereas some others are completed delivered via
technology. Proponents of the “mixed-model” suggest that some face-to-face
contact is necessary or desirable to maintain student motivation and thus
higher quality of education. We are interested in testing this hypothesis. Thus
we coded each study’s level of media involvement, which is defined as the
extent a certain instructional delivery system has been mediated by
technologies, i.e., how frequently technology is used in a program. Media
Involvement is coded on a scale from 1 to 10, where 1 indicating no technology was
used while 10 indicating that the instruction was delivered completely with
technology.