Causal Inference and Observational Data: Editorial Open Access
Causal Inference and Observational Data: Editorial Open Access
Causal Inference and Observational Data: Editorial Open Access
E D I TO R I A L Open Access
Abstract
Observational studies using causal inference frameworks can provide a feasible alternative to randomized
controlled trials. Advances in statistics, machine learning, and access to big data facilitate unraveling complex
causal relationships from observational data across healthcare, social sciences, and other fields. However, challenges
like evaluating models and bias amplification remain.
© The Author(s) 2023. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use,
sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and
the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this
article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included
in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will
need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The
Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available
in this article, unless otherwise stated in a credit line to the data.
Olier et al. BMC Medical Research Methodology (2023) 23:227 Page 2 of 3
utilization of instrumental variables, regression dis- data and integrating them with structured data, thereby
continuity designs, and quasi-experimental approaches enhancing the depth of insights and broadening the
as methodological advancements further augment the applicability of causal inference from observational data.
understanding of complex social phenomena, policy However, causal inference with observational data
impacts, and economic relationships [4, 5]. is not free of challenges. For instance, causal inference
Broadly speaking, causal inference attempts to build models are hard to evaluate. If a causal link is found,
data-driven models that can predict the effect of inter- still there is no clear mechanism to assess whether the
ventions on outcomes. Using observational data for link is real or not. The performance of associative data-
causal inference is gaining momentum due to the conflu- driven models can be assessed and compared easily since
ence of factors such as the large amount of more com- large data repositories are publicly available and widely
plex and richer data and advanced techniques from used. However, this is not the case for causal inference,
statistics and ML. In general, two frameworks exist for for which the lack of public benchmark data is one of
causal inference in observational studies, which are not the biggest problems it is encountered in their develop-
necessarily mutually exclusive: the structural causal ment. There is also a lack of comparisons to non-causal
model (SCM) framework and the potential outcome methods in the literature [9]. It is also inevitable to make
framework (POF). The SCM framework relies on deter- untestable assumptions, which could also contribute to
ministic, functional equations to construct directed acy- bias amplification and harm the external validity when
clic graphs (DAGs) with variables as nodes and links as compared to non-causal counterparts [10].
causal relationships and is particularly useful in identi- As the field continues to advance, interdisciplinary col-
fying unknown causal and confounding variables while laborations, methodological innovations, and the integra-
estimating the actual effect of a given treatment. On the tion of emerging technologies will continue to expand the
other hand, the POF framework (also known as the coun- frontiers of causal inference and its applications in vari-
terfactual framework) examines outcomes that would ous domains. Nevertheless, challenges must be addressed
have likely been observed had the treatment differed, for swift adoption in social and medical research.
representing the counterfactual or the missing outcome.
Abbreviations
Other frameworks such as instrumental variables, media- DAG directed acyclic graphs
tion analysis, and Bayesian networks are also noteworthy ML machine learning
in causal inference research [6]. POF potential outcome framework
RCT randomized control trial
In recent years, there has been growing interest in SCM structural causal model
combining multiple frameworks and approaches to
improve causal inference. Integrating ideas from different Authors’ contributions
IO—conceived and drafted the Editorial. YZ, XL, VV revised the Editorial. All
frameworks can lead to more comprehensive and robust authors read and approved the final manuscript.
causal analyses. Additionally, the use of machine learn-
ing techniques and the exploration of new identifica- Funding
No funding was obtained for this editorial.
tion strategies are areas that hold promise for advancing
causal inference research [7]. Analysis of observational Data Availability
studies could benefit from the best of two worlds. ML Not applicable.
References 7. Prosperi M, Guo Y, Sperrin M, Koopman JS, Min JS, He X, et al. Causal inference
1. Hernán MA, Methods of Public Health Research — Strengthening Causal and counterfactual prediction in machine learning for actionable healthcare.
Inference from Observational Data. New England Journal of Medicine [Inter- Nat Mach Intell. 2020;2(7):369–75.
net]. 2021 Oct 7 [cited 2023 May 23];385(15):1345–8. Available from: https:// 8. Luo Y, Peng J, Ma J. When causal inference meets deep learning. Nature
www.nejm.org/doi/full/https://doi.org/10.1056/NEJMp2113319. Machine Intelligence 2020 2:8 [Internet]. 2020 Aug 12 [cited 2023
2. Hemkens LG, Ewald H, Naudet F, Ladanie A, Shaw JG, Sajeev G, et al. Interpre- May 23];2(8):426–7. Available from: https://www.nature.com/articles/
tation of epidemiologic studies very often lacked adequate consideration of s42256-020-0218-x.
confounding. J Clin Epidemiol. 2018;93:94–102. 9. Kaddour J, Lynch A, Liu Q, Kusner MJ, Silva R. Causal Machine Learning: A
3. Sanchez P, Voisey JP, Xia T, Watson HI, O’Neil AQ, Tsaftaris SA. Causal machine Survey and Open Problems. arXiv:220615475 [Internet]. 2022 Jun 30 [cited
learning for healthcare and precision medicine. R Soc Open Sci. 2022;9(8). 2023 May 23]; Available from: http://arxiv.org/abs/2206.15475.
4. Rohlfing I, Zuber CI. Check Your Truth Conditions!Clarifying the Relation- 10. Hammerton G, Munafò MR. Causal inference with observational data: the
ship between Theories of Causation and Social Science Methods for Causal need for triangulation of evidence. Psychol Med [Internet]. 2021 Mar 1 [cited
Inference. Sociol Methods Res [Internet]. 2021 Nov 1 [cited 2023 May 2023 May 23];51(4):563–78. Available from: https://www.cambridge.org/
23];50(4):1623–59. Available from: https://journals.sagepub.com/doi/https:// core/journals/psychological-medicine/article/causal-inference-with-observa-
doi.org/10.1177/0049124119826156. tional-data-the-need-for-triangulation-of-evidence/AF5F7918753DF50F26B1
5. Varian HR, Proceedings of the National Academy of Sciences [Internet]. D49561F0DF83.
Causal inference in economics and marketing. 2016 Jul 5 [cited 2023 May
23];113(27):7310–5. Available from: https://www.pnas.org/doi/abs/https://
doi.org/10.1073/pnas.1510479113. Publisher’s Note
6. Shi J, Norgeot B. Learning Causal Effects from Observational Data in Health- Springer Nature remains neutral with regard to jurisdictional claims in
care: a review and Summary. Front Med (Lausanne). 2022;9:864882. published maps and institutional affiliations.