The Small Data Problem Using Bayesian Networks in Endangered Species Policy Development

The Small Data Problem: Using Bayesian Networks in Endangered Species Policy Development


Bayesian networks are commonly used to address "big data" problems and can also model expert knowledge in the absence of any data. Between these extremes lies a broad class of small data problems, which I define as those where causal explanations are sought from observational datasets with small sample sizes relative to the number of dimensions. Many of these problems are central to ongoing, important policy debates, but machine learning techniques and standard statistical analyses are generally unhelpful. Using examples from endangered species policy development, I present an analysis workflow based on causal identification, model instantiation with informed priors, and Bayesian updating to generate models that blend existing knowledge and available data. Such models can serve an important role in decision-making where policy alternatives cannot be tested experimentally and/or where datasets are constrained.

Presenter Biography

Steve Wilson has over 25 years of experience working at technical and professional levels in strategic and operational planning for public and private-sector clients. He specializes in quantitative approaches to decision support and policy analysis. Steve holds a Ph.D. in wildlife ecology from the University of British Columbia in Vancouver.

Presentation Video

Presentation Slides

For North America

Bayesia USA

4235 Hillsboro Pike
Suite 300-688
Nashville, TN 37215, USA

+1 888-386-8383

Head Office

Bayesia S.A.S.

Parc Ceres, Batiment N 21
rue Ferdinand Buisson
53810 Change, France

For Asia/Pacific

Bayesia Singapore

1 Fusionopolis Place
#03-20 Galaxis
Singapore 138522

Copyright © 2024 Bayesia S.A.S., Bayesia USA, LLC, and Bayesia Singapore Pte. Ltd. All Rights Reserved.