
Forecasting Turmoil in Indonesia:
An Application of Hidden Markov Models (2004)
Joe Bond, Vladimir Petroff, Sean O'Brien, Doug Bond
Abstract
In this paper we illustrate an application of
hidden Markov models to the problem of short term forecasting
of conflict escalation/de-escalation using Indonesia as an example.
Utilizing Integrated Data for Events Analysis (IDEA),
the Baum-Welch estimation procedure and the Kullback-Leibler
Distance measure trained, estimated, and evaluated the relevancy of the models.
Employing a MATHEMATICA©-based simulator,
we generate the most likely monthly scenarios as well as
the symbol distributions for Indonesia for the year 1999.
We conclude with a discussion of some unexpected results
derived from the fifty two generated forecasts.
Link to file of
the paper

Integrated Data for Events
Analysis (IDEA): An Event Form Typology for Automated Events Data
Development (2003)
Doug Bond, Joe Bond, Churl Oh, J. Craig Jenkins, Charles Lewis Taylor
Abstract
This article outlines the basic parameters and current
status of the Integrated Data for
Event Analysis (IDEA) project. IDEA provides a comprehensive events
framework for the analysis of international interactions by supplementing
the event forms from all earlier projects with new event forms needed to
monitor contemporary trends in civil and interstate politics. It uses a
more flexible multi-leveled event and actor/target hierarchy that can be
expanded to incorporate new event forms and actors/targets, and adds
dimensions that can be employed to construct indicators for early warning
and assessing conflict escalation. IDEA is currently being used in the
automated coding of news reports (Reuters Business Briefs) and, in
collaboration with other projects, in the analysis of field reports. We
summarize the conceptual framework being used in this data development
effort, its major variables, and its geographic and temporal
coverage.
Link to file of
the paper

The Integrated Data for Events
Analysis (IDEA) Website
(2002)
Abstract
The IDEA protocol and the VRA Knowledge Manager software system operate
together to automatically generate social, economic, enviornmental and
political events data and to display them in summary from in terms of
event counts and various scales. The IDEA conceptual framework, though
optimized for machine coding, offers a useful extension of a human events
coding tradtion that extends back nearly forty years. We have sought
throughout our development process to preserve backwards compatibility as
well as extensibility. Descriptions, examples and usage notes for each of
the roughly 200 political, social, economic and environmental events are
provided.
Link to the IDEA site

Casualty
Assessment and the IDEA Framework (2002)
Joe Bond (Harvard University), Doug Bond (Harvard University) and
Marianne Abbott (The Ohio State Universtiy)
Abstract
This memo reviews the recent and ongoing research activities of the IDEA project,
with a focus on the challenge of assessing the intensity of acute conflict in
general and the casualties of armed conflict in particular.
Link
to file of the paper

Operationalizing and Assessing TRACE: Tool for the Rapid Assessment of Complex Emergencies
Doug Bond, Joe Bond and Churl Oh
Abstract
“TRACE” (Tool for the Rapid Assessment of Complex Emergencies) is a rapid assessment tool for use by analysts seeking real-time data necessary to identify, track and evaluate crisis situations that may evolve into or have already become CHEs. Virtual Research Associates, Inc. (VRA) first outlined the TRACE approach in the final report of a study that was completed in December 1999. TRACE is designed for use in rapidly assessing a range of complex emergency situations, both natural and human-made in their origin.
Link
to file of the paper

An Automated Information
Extraction Tool For International Conflict Data with Performance as Good
as Human Coders: A Rare Events Evaluation Design
(2002)
Gary King
(Harvard University) and Will Lowe (Harvard University)
Abstract
Despite widespread recognition that aggregated summary statistics on
international conflict and cooperation miss most of the complex
interactions among nations, the vast majority of scholars continue to
employ annual, quarterly, or occasionally monthly observations. Daily
events data, coded from some of the huge volume of news stories produced
by journalists, have not been used much for the last two decades. We offer
some reason to change this practice, which we feel should lead to
considerably increased use of these data. We address advances in event
categorization schemes and software programs that automatically produce
data by "reading" news stories without human coders. We design a method
that makes it feasible for the first time to evaluate these programs when
they are applied in areas with the particular characteristics of
international conflict and cooperation data, namely even categories with
highly unequal prevalences, and where rare events (such as highly
conflictual actions) are of special interest. We use this rare events
design to evaluate one existing program, and find it to be as good as
trained human coders, but obviously far less expensive to use. For large
scale data collections, the program dominates human coding. Our new
evaluative method should be of use in international relations, as well as
more generally in the field of computational linguistics, for evaluating
other automated information extraction tools. We believe that the data
created by programs similar to the one we evaluated should see
dramatically increased use in international relations research. To
facilitate this process, we are releasing this article data on 4.3 million
international events, covering the entire world for the last decade.
Link to Adobe
.pdf file of the paper

Integrated Data for Events
Analysis (IDEA): An Event Form Typology for Automated Events Data
Development
(2001)
Doug Bond, Joe Bond, Churl Oh (Harvard
University and Virtual Research Associates, Inc.), J. Craig Jenkins (The
Ohio State University) and Charles Lewis Taylor (Virginia Polytechnic
Institute and State University)
This paper outlines the heritage, basic
parameters and current status of an automated approach to events data
development that evolved from the PANDA project at Harvard University. The
IDEA project has roots in the World Handbook for Political and Social
Indicators project as well as the Mass Protest in East Europe and KEDS
projects. The ten-year old PANDA project now uses a protocol dubbed IDEA,
for the Integrated Data for Events Analysis, which is used to guide and
inform the automated coding of news, field and other reports. The paper
includes a description of the variables used in the data development
effort, along with a description of the current geographic coverage,
availability and temporal scope of the data sets being produced. It then
reviews the IDEA protocol, focusing on the unit and level of analysis, its
reliability and its main assumptions and its implications for use. Also
included is a general discussion of the strengths and limitations of
automated coding for applied and basic research.
Link to Adobe .pdf of the paper (superceded by
Integrated Data for Events Analysis (IDEA): An Event Form Typology for Automated Events Data Development
(2003)

Conflict-Carrying Capacity,
Political Crisis, and Reconstruction: A Framework for the Early Warning of
Political System Vulnerability
(2001)
J. Craig Jenkins (The Ohio
State University) and Doug Bond (Harvard University)
Abstract
This study provides a
conceptual framework for the analysis of conflict-carrying capacity (or
CCC) defined as the ability of political systems to regulate intense
internal conflicts. CCC is indexed by the multiplicative interaction
between the proportions of civil contention, state repression, and
violence. Data from the PANDA project is used to illustrate the usefulness
of this CCC index in capturing system instability in an institutionalized
democracy (the United States), a bureacratic-authoritarian regime
(Mexico), an institutionalized Communist regime (China), and a peaceful
democratic transition (Poland). It provides early warning signals of civil
war (Algeria, Sri Lanka) and moves toward political stability (Peru).
Civil contention and state repression are not destablizing per se. Rather
it is the simultaneous combination of these with violent contention that
leads to internal political crises and, alternatively, to political
stabilization.
Link to Adobe .pdf of the paper (available soon)

Conflict-Cooperation for
Interstate and Intrastate Interactions: An Expansion of the Goldstein
Scale
(2001)
Charles Lewis Taylor (Virginia Polytechnic Institute and State
University), Joe Bond (Harvard University), Doug Bond (Harvard
University), J. Craig Jenkins (The Ohio State University) and Zeynep
Benderlioglu Kuzucu (The Ohio State University)
This paper presents the current state of our project to expand and to
replicate Goldstein's Conflict-Cooperation Scale for WEIS Event Data using
a technique and focusing on intrastate as well as interstate interactions.
We report here the results of a pilot study conducted with the assistance
of a small expert panel. Our intention is to take advantage of a larger
and more diverse set of judges representing the governmental, national
security, commercial and academic communities while controlling for major
demographic characteristics. In our Integrated Data for Event Analysis
(IDEA), classic WEIS categories are supplemented with verb cues drawn from
the World Handbook and other interaction event taxonomies. Panelists are
asked to rank these event categories, with regard to
Contentiousness-accommodation, coercion-altruism, and physical violence
and to overall conflict-cooperation. No assumption is made with regard to
the three hypothesized dimensions within conflict and cooperation.
Interrelationships will be sought empirically. This expansion of the
Goldstein scale will be a useful tool in the analysis of conflict and
cooperation in international and intra-national political interactions.
Link
to Adobe .pdf of the paper

Charting the Korean
Peninsula: A 1990-2000 Assessment of North Korean Event Interactions
(2000)
Joe Bond, Doug Bond and Churl Oh (Harvard University)
This paper introduces the VRAâ Knowledge Manager, a real-time data
development and analysis software system designed to process large volumes
of text-based information. The system is designed to illuminate countries'
vulnerabilities to natural and social hazards by monitoring global news
wire service reports, assessing them and presenting up-to-the-minute
intelligence graphs, maps and tables. Following a short description of the
system, a decade of North Korean events data is analyzed using indicators
that tap dimensions of conflict carrying capacity, forceful action and
conflict and cooperation.
Link to Adobe .pdf of the paper

Mapping Mass Political
Conflict and Civil Society: Issues and Prospects for the Automated
Development of Event Data
(1997)
Doug Bond (Harvard University), J. Craig
Jenkins (The Ohio State University), Charles Lewis Taylor (Virginia
Polytechnic Instititute and State University) and Kurt Schock (Rutgers
University)
Abstract
Mass political conflict
is typically examined in terms of violence and in isolation from routine
civil interactions. We argue that mass conflict is multi-dimensional and
that violence should be treated as an outcome of conflict, as well as a
form of action. We define three dimensions of conflict--contentiousness,
coerciveness, and change goals--and indices of the civil society that are
central to mapping global trends in mass conflict. We then outline a
strategy for mapping mass conflict and civil interactions using the PANDA
protocol to generate highly reliable event data and then use these indices
to trace two democratic transitions (in Poland and South Korea), a
conflict crisis that was repressed (China) and a conflict escalation that
flared into a civil war (the former Yugoslavia). Automation has major
advantages over human coding in terms of transparency, integration with
existing event data series, real time availability and long-term
maintenance costs. It also opens up new ways of thinking about event data
and the assessment of reliability.
URL to Adobe .pdf of the paper (available
soon)

Profiles of International
"Hotspots" (1995)
Doug Bond and William B. Vogele (Harvard
Univeristy)
Abstract
Three profiles are
presented and discussed - conflict intensity, violence signals and
conflict carrying capacity. The examination of the level of sanctions
being wielded in the conflict and consideration of the context of the news
reports in terms of violence results in the conflict intensity profile. In
the violence signals profile we consider reports of threats, reported
demonstrations and reductions in relationships. The threats and
demonstrations signal the possibility of violence and/or mobilization and
decreased bonds and relationships weaken the social prohibitions against
violence. The third profile reveals the conflict carrying capacity of a
system. This profile is designed to illuminate the nature (violent or not)
and locus (within or outside the political system) of conflict and is
presented in two parts to highlight the distinct roles of the two
components. As a backdrop to these three profiles, we graphically present
the conflict context or world news attention, focusing on the volume of
Reuters reporting on the conflict situation over time. This conflict
context is offered to facilitate interpretation of the three profiles,
especially to alert us when the volume of reports over any year in a
particular country is adequate. We identify three countries or "hotspots"
deemed to be most at risk for eruption or escalation into violence:
Afghanistan, Algeria and Turkey. Conflict event data profiles were created
of these armed conflicts, tracing their evolution from January 1984
through April 1995. These event data were coded from more than a half
million worldwide news reports from Reuters. The criteria employed to
select these three countries most at risk include recent peak intensities
in their level of reported sanctions, report context and proportion of
reported threat; recent three year continuous rises in their reported
demonstrations or reductions in relationships; and recent conflict
carrying capacity scores continuously approaching or falling below, their
proportion of foreful action.
Link to Adobe .pdf of the paper (available
soon).

Recovering Events from Events Data (1995)
Karen Rothkin (MIT) and Doug Bond (Harvard
Univeristy)
Abstract
In this paper we draw upon a large (N=530,718) global event data set to
examine source coverage bias to model, in two equations, contextual (i.e.
country) differences while trying to predict violent crises. Given various
patterns of reporting patterns across countries, we draw upon three kinds
of variables--exogenous data on countries; the endogenous event report
stream, including theoretically derived indicators of impending crises;
and historically threaded events, such as elections, that are not usefully
approximated as discrete events. It should be emphasized that this paper
reports on an ongoing, theoretically informed yet largely inductive
modeling exercise. In other words, we do not test an antecedently derived
hypothesis; as the model is refined, we plan to conduct such tests, as
well as encourage independent evaluation. Toward this end, and in the
spirit of the September 1995 PS Symposium on "Verification/Replication," all of our
event data is publicly available.
Link
to Adobe .pdf of the paper.

© 1996-2008 Virtual Research Associates, Inc.
All rights reserved.