sconjoint: Structural Deep Learning for Conjoint Experiments

Welcome

sconjoint is an R package implementing the structural deep learning estimator for forced-choice conjoint experiments developed by Acharya, Hainmueller, and Xu (2026) (paper). The estimator embeds a deep neural network inside a random-utility logit so that each respondent’s preference vector $\hat{\boldsymbol\beta}(\mathbf Z_i) \in \mathbb{R}^p$ varies smoothly and flexibly with her observed covariates $\mathbf Z$. Double / debiased machine learning [DML; Chernozhukov et al. (2018)] inference provides honest respondent-clustered standard errors on all population-level quantities.

The key advance over the standard AMCE framework (Hainmueller, Hopkins, and Yamamoto 2014) is that $\hat{\boldsymbol\beta}(\mathbf Z_i)$ gives the joint distribution of preferences across attributes for each respondent, enabling structural quantities — marginal rates of substitution, counterfactual choice probabilities, preference polarization, willingness to pay — that require the full preference vector rather than one-attribute-at-a-time marginal effects.

This book is the primary user documentation for the package. It walks from installation through four complete worked examples, a simulation sanity check, a reference catalog of the structural estimands, and visualization options.

library(sconjoint)
data(package = "sconjoint")$results[, c("Item", "Title")]

     Item     
[1,] "br2017" 
[2,] "bs2013" 
[3,] "gs2020" 
[4,] "simdata"
[5,] "sw2022" 
     Title                                                               
[1,] "Ballard-Rosa, Martin & Scheve (2017) tax-plan conjoint"            
[2,] "Bechtel-Scheve (2013) climate-treaty conjoint"                     
[3,] "Graham-Svolik (2020) candidate-choice conjoint on democratic norms"
[4,] "Simulated conjoint with known ground truth"                        
[5,] "Saha-Weeks (2022) candidate-choice conjoint"

Bundled datasets

The package ships four conjoint datasets from published replication materials, covering democratic norms, tax preferences, candidate choice, and climate treaties.

These are the paper’s analysis datasets

The bundled gs2020, br2017, and sw2022 datasets are the full analysis frames used in Acharya, Hainmueller, and Xu (2026), carrying the paper’s complete respondent-moderator set $\mathbf Z$ (22, 23, and 19 covariates). The worked examples fit the paper’s production configuration — K = 10 cross-fitting folds, the default 1,000 Adam epochs, and the per-application Stage-2 prior — so the structural quantities they report reproduce the manuscript’s empirical results. These fits are heavier than a quick demo, so each example chunk is cached and the book reuses the saved fit on re-render. (bs2013 is a lightweight teaching example, not one of the paper’s applications.)

`gs2020` — Graham and Svolik (2020): democratic norms

1,605 respondents $\times$ $$13 matchups $\times$ 2 profiles = 41,314 rows. Thirty attribute contrasts (the diff_ columns: party, two policy dimensions, the 16-level democracy code, candidate sex, race, and profession) and the paper’s 22 respondent moderators ($\mathbf Z$): ideology, party, Trump approval, demographics, authoritarianism, political knowledge, issue ideal points, and six direct democracy-attitude items. The paper’s main specification holds out the six direct items and fits on the remaining 16. Raw ideo7, pid7, and the survey weight are carried as convenience columns for subgroup and weighting analyses.

`br2017` — Ballard-Rosa, Martin, and Scheve (2017): tax preferences

2,000 respondents $\times$ 8 tasks $\times$ 2 profiles = 32,000 rows. Seven numeric attributes (six bracket rates plus a revenue-impact indicator) and the paper’s 23 respondent moderators ($\mathbf Z$): demographics, partisanship, ideology, and economic attitudes and beliefs. Bracket rates are rebuilt from the source file’s coded variables and value labels; the distributed file’s derived rate column stores the 45% level of the $175–375k bracket as 5, which affected bundled copies before 0.2.0.9004 (see ?br2017). Raw seven-point resp_pid7 is carried as a convenience column for the by-party analyses.

`sw2022` — Saha and Weeks (2022): candidate choice

1,191 respondents $\times$ 3 tasks $\times$ 2 profiles = 7,146 rows. Thirteen candidate-attribute dummies (gender, prior run, talent, agenda, children) and the paper’s full set of 19 respondent moderators ($\mathbf Z$): gender, age, income, education, party, region, employment status, ideology, vote choice, and a gender-attitudes scale. A convenience factor resp_party (Democrat / Independent / Republican) is carried for subgroup labelling and is not part of $\mathbf Z$.

`bs2013` — Bechtel and Scheve (2013): climate treaties

2,500 respondents $\times$ 4 tasks $\times$ 2 profiles = 20,000 rows. Five categorical treaty attributes (burden distribution, emissions target, monitoring body, participation, and sanctions) plus a numeric monthly cost_usd attribute, and three respondent moderators (resp_female, resp_age, and 0–10 resp_ideo). The numeric cost supports dollar-scale willingness-to-pay analysis. This is a lightweight teaching example, not one of the paper’s applications.

Organization

The user guide is structured into the following chapters:

Chapter 1 Get Started
Installation instructions for torch and sconjoint.
Chapter 2 Simulated Example
Full workflow on simulated data with known ground truth, verifying each step against the true DGP.
Chapter 3 Example: Democratic Norms
Graham and Svolik (2020) democratic-norms conjoint, emphasizing heterogeneity testing, partisan subgroup analysis, and preference clustering.
Chapter 4 Example: Tax Preferences
Ballard-Rosa, Martin, and Scheve (2017) tax-plan conjoint, showcasing all-numeric (continuous) attributes and partisan tax-schedule preferences.
Chapter 5 Example: Candidate Choice
Full walk-through of the Saha and Weeks (2022) candidate-choice conjoint.
Chapter 6 Example: Climate Treaties
Bechtel and Scheve (2013) climate-treaty conjoint (tutorial example), showcasing willingness-to-pay analysis with a numeric cost attribute.
Chapter 7 Estimands
Reference catalog of the structural estimands — which quantity to use, what it targets, and how its inference is computed.
Chapter 8 Plot Options
Customizing plots with dummies, labels, groups, and ggplot2.
Chapter 9 Advanced options
Reference for scfit() arguments and defaults: Stage-2 options, reproducibility, training hyperparameters, and parallel execution.

Authors

Avidit Acharya
Jens Hainmueller
Yiqing Xu
StatsClaw (Agentic System for Statistical Software Development)

How to Cite

Acharya, Avidit, Jens Hainmueller, and Yiqing Xu. 2026. sconjoint: Structural Deep-Learning Estimation for Conjoint Experiments — User Manual (v0.2.0). https://github.com/xuyiqing/sconjoint

@manual{sconjoint2026,
  title = {sconjoint: Structural Deep-Learning Estimation for Conjoint Experiments --- User Manual},
  author = {Acharya, Avidit and Hainmueller, Jens and Xu, Yiqing},
  year = {2026},
  note = {R package version 0.2.0},
  url = {https://github.com/xuyiqing/sconjoint}
}

Report Bugs

Please report any bugs by submitting an issue on GitHub or emailing Yiqing Xu (yiqingxu [at] stanford.edu). Please include a minimally reproducible example and your sessionInfo() output.

sconjoint:

Abramson, Scott F., Korhan Koçak, and Asya Magazinnik. 2022. “What Do We Learn about Voter Preferences from Conjoint Experiments?” American Journal of Political Science 66 (4): 1008–20.

Acharya, Avidit, Jens Hainmueller, and Yiqing Xu. 2026. “Learning Preferences from Conjoint Data: A Hybrid Structural Deep Learning Approach.”

Athey, Susan, Julie Tibshirani, and Stefan Wager. 2019. “Generalized Random Forests.” Annals of Statistics 47 (2): 1148–78.

Ballard-Rosa, Cameron, Lucy Martin, and Kenneth Scheve. 2017. “The Structure of American Income Tax Policy Preferences.” Journal of Politics 79 (1): 1–16.

Bansak, Kirk, Jens Hainmueller, and Dominik Hangartner. 2016. “How Economic, Humanitarian, and Religious Concerns Shape European Attitudes Toward Asylum Seekers.” Science 354 (6309): 217–22.

———. 2023. “Europeans’ Support for Refugees of Varying Background Is Stable over Time.” Nature 620: 849–54.

Bansak, Kirk, Jens Hainmueller, Daniel J. Hopkins, and Teppei Yamamoto. 2023. “Using Conjoint Experiments to Analyze Election Outcomes: The Essential Role of the Average Marginal Component Effect.” Political Analysis 31 (4): 500–518.

Bechtel, Michael M., and Kenneth F. Scheve. 2013. “Mass Support for Global Climate Agreements Depends on Institutional Design.” Proceedings of the National Academy of Sciences 110 (34): 13763–68.

Berry, Steven, James Levinsohn, and Ariel Pakes. 1995. “Automobile Prices in Market Equilibrium.” Econometrica 63 (4): 841–90.

Chernozhukov, Victor, Denis Chetverikov, Mert Demirer, Esther Duflo, Christian Hansen, Whitney Newey, and James Robins. 2018. “Double/Debiased Machine Learning for Treatment and Structural Parameters.” The Econometrics Journal 21 (1): C1–68.

Chernozhukov, Victor, Whitney K. Newey, and Rahul Singh. 2022. “Automatic Debiased Machine Learning of Causal and Structural Effects.” Econometrica 90 (3): 967–1027.

Chipman, Hugh A., Edward I. George, and Robert E. McCulloch. 2010. “BART: Bayesian Additive Regression Trees.” Annals of Applied Statistics 4 (1): 266–98.

Cuesta, Brandon de la, Naoki Egami, and Kosuke Imai. 2022. “Improving the External Validity of Conjoint Analysis: The Essential Role of Profile Distribution.” Political Analysis 30 (1): 19–45.

Egami, Naoki, and Kosuke Imai. 2019. “Causal Interaction in Factorial Experiments: Application to Conjoint Analysis.” Journal of the American Statistical Association 114 (526): 529–40.

Farrell, Max H., Tengyuan Liang, and Sanjog Misra. 2021. “Deep Neural Networks for Estimation and Inference.” Econometrica 89 (1): 181–213.

———. 2025. “Deep Learning for Individual Heterogeneity: An Automatic Inference Framework.”

Goplerud, Max, Kosuke Imai, and Nicole E. Pashley. 2025. “Estimating Heterogeneous Causal Effects of High-Dimensional Treatments: Application to Conjoint Analysis.” Annals of Applied Statistics 19 (2): 866–88.

Graham, Matthew H., and Milan W. Svolik. 2020. “Democracy in America? Partisanship, Polarization, and the Robustness of Support for Democracy in the United States.” American Political Science Review 114 (2): 392–409.

Green, Paul E., and V. Srinivasan. 1990. “Conjoint Analysis in Marketing: New Developments with Implications for Research and Practice.” Journal of Marketing 54 (4): 3–19.

Hainmueller, Jens, Daniel J. Hopkins, and Teppei Yamamoto. 2014. “Causal Inference in Conjoint Analysis: Understanding Multidimensional Choices via Stated Preference Experiments.” Political Analysis 22 (1): 1–30.

Ham, Dae Woong, Kosuke Imai, and Lucas Janson. 2024. “Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis.” Political Analysis 32 (3): 329–44.

Hetzenecker, Stephan, and Maximilian Osterhaus. 2024. “Deep Learning for the Estimation of Heterogeneous Parameters in Discrete Choice Models.”

Kamakura, Wagner A., and Gary J. Russell. 1989. “A Probabilistic Choice Model for Market Segmentation and Elasticity Structure.” Journal of Marketing Research 26 (4): 379–90.

Leeper, Thomas J., Sara B. Hobolt, and James Tilley. 2020. “Measuring Subgroup Preferences in Conjoint Experiments.” Political Analysis 28 (2): 207–21.

McFadden, Daniel. 1974. “Conditional Logit Analysis of Qualitative Choice Behavior.” In Frontiers in Econometrics, edited by Paul Zarembka, 105–42. Academic Press.

———. 1981. “Econometric Models of Probabilistic Choice.” In Structural Analysis of Discrete Data with Econometric Applications, edited by Charles F. Manski and Daniel McFadden, 198–272. MIT Press.

McFadden, Daniel, and Kenneth Train. 2000. “Mixed MNL Models for Discrete Response.” Journal of Applied Econometrics 15 (5): 447–70.

Revelt, David, and Kenneth Train. 1998. “Mixed Logit with Repeated Choices: Households’ Choices of Appliance Efficiency Level.” Review of Economics and Statistics 80 (4): 647–57.

Rho, Sungmin, and Michael Tomz. 2017. “Why Don’t Trade Preferences Reflect Economic Self-Interest?” International Organization 71 (S1): S85–108.

Robinson, Thomas S., and Raymond Duch. 2024. “How to Detect Heterogeneity in Conjoint Experiments.” Journal of Politics 86 (2): 412–27.

Saha, Sparsha, and Ana Catalano Weeks. 2022. “Ambitious Women: Gender and Voter Perceptions of Candidate Ambition.” Political Behavior 44: 779–805.

Train, Kenneth E. 2009. Discrete Choice Methods with Simulation. 2nd ed. Cambridge University Press.

Zhirkov, Kirill. 2022. “Estimating and Using Individual Marginal Component Effects from Conjoint Experiments.” Political Analysis 30 (2): 236–49.

Welcome

Bundled datasets

gs2020 — Graham and Svolik (2020): democratic norms

br2017 — Ballard-Rosa, Martin, and Scheve (2017): tax preferences

sw2022 — Saha and Weeks (2022): candidate choice

bs2013 — Bechtel and Scheve (2013): climate treaties