The elephant in the room: Predictive performance of PLS models

https://doi.org/10.1016/j.jbusres.2016.03.049Get rights and content

Abstract

Attempts to introduce predictive performance metrics into partial least squares (PLS) path modeling have been slow and fall short of demonstrating impact on either practice or scientific development in PLS. This study contributes to PLS development by offering a comprehensive framework that identifies different dimensions of prediction and their effect on predictive performance evaluation with PLS. This framework contextualizes prior efforts in PLS and prediction and highlights potential opportunities and challenges. A second contribution to PLS development lies in proposed procedures to generate and evaluate different types of predictions from PLS models. These procedures account for the best practices that the new framework identifies. An outline of the many powerful ways in which predictive PLS methodologies can strengthen theory-building research constitutes a third contribution to PLS development. The framework, procedures, and research guidelines hopefully form the basis for a more informed and unified development of the rigorous theoretical and practical applications of PLS.

Section snippets

Introduction and motivation

Business research has adopted PLS path modeling, known simply as PLSPM or just PLS, as a methodological tool that bridges exploration with explanation. The popularity of PLS stems partly from its ability to produce parameter estimates of complex models without many of the distributional and other constraints of traditional parametric methods. PLS achieves these estimates by using an iterative estimation procedure considered “predictive” in nature (Hair, Ringle, & Sarstedt, 2011). Yet, the PLS

Two key features of predictive performance in predictive analytics

As early as the 1950s, thinkers of the philosophy of science recognized the fundamental difference between explanation and prediction and the necessity of both (Dubin, 1969). Management research typically focuses on explanatory models. But with the recent popularity of Big Data and predictive analytics, some predictive modeling now appears in the management literature.

Before undertaking an examination of predictive performance in the context of PLS models, please note that in contemporary

Predictive performance of PLS models: A conceptual framework

Unlocking the full potential of PLS predictions requires understanding the full picture of what is possible with predictions in PLS, where current proposals fit into this picture, and which of these proposals supports the goal of predictive performance evaluation of a PLS model.

A procedure for evaluating predictive performance of a PLS model

The lack of established techniques and tools to generate and evaluate predictions from PLS has hindered progress on creating predictive methodology for PLS. This study proposes a set of procedures for prediction and evaluation that together create a framework for evaluating the predictive performance of PLS models. The framework is tailored to the desirable properties of predictive performance evaluation highlighted earlier: out-of-sample, item-level, and operative. Specifically, the procedure

Enhancing PLS models using theory-informed prediction

As demonstrated, PLS models can be used not only to quantify and test a causal theory but also to generate point predictions and prediction intervals for manifest items both in-sample and out-of-sample. These predictions are based on a causal model, unlike predictions typically generated by machine learning algorithms. The ability of PLS to generate predictions and evaluate their performance gives PLS a self-diagnostic capability that permits use of predicted outcomes and metrics to evaluate

Conclusions and future directions

Being able to generate empirical case-wise out-of-sample predictions from a model and to evaluate the predictive power of explanatory models is vital to theory building and evaluation. Prediction using PLS models differs from machine learning algorithms and even from regression models in that PLS allows a researcher to incorporate a theoretically justified causal structure into the prediction process. Although regression models are useful for prediction, “[they are] most effective when dealing

Acknowledgments

The authors thank Kwok-Kee Wei and Loretta Fung for their feedback and suggestions. Thanks also to the organizers and participants of the 2nd International Symposium on Partial Least Squares Path Modeling (Seville, 2015) for discussions and feedback on prediction and PLS. Special thanks (in alphabetical order) to Jean-Michel Becker, Gabriel Cepeda, Wynne Chin, Nicholas Danks, Theo Dijkstra, Jorg Henseler, Christian Ringle, Ed Rigdon, José L. Roldán, Mikko Rönkkö, and Marko Sarstedt for sharing

References (39)

  • W.W. Chin

    The partial least squares approach to structural equation modelling

  • W.W. Chin

    Bootstrap cross-validation indices for PLS path model assessment

  • W.W. Chin

    On partial least squares variance-based SEM (VBSEM) vs. covariance-based SEM (CBSEM) for confirmatory analysis: It's all about the components and variance explained

  • G.A. Churchill

    A paradigm for developing better measures for marketing constructs

    Journal of Marketing Research

    (1979)
  • J. Dana et al.

    The superiority of simple alternatives to regression for social science predictions

    Journal of Educational and Behavioral Statistics

    (2004)
  • A. Diamantopoulos et al.

    Guidelines for choosing between multi-item and single-item scales in construct measurement: a predictive validity perspective

    Journal of the Academy of Marketing Science

    (2012)
  • T. Dijkstra

    Latent variables and indices: Herman Wold’s basic design and Partial Least Squares

  • T.K. Dijkstra et al.

    Consistent partial least squares path modeling

    MIS Quarterly

    (2015)
  • R. Dubin

    Theory building: A practical guide to the construction and testing of theoretical models

    (1969)
  • Cited by (796)

    View all citing articles on Scopus
    View full text