• Home
  • Study Details
By physician referral or invitation only

Understanding Target Variable Selection in Machine Learning Model Development

Predictive modeling has recently attracted a lot of attention from organizations trying to leverage AI and big data to improve their work processes such as decision-making. However, real-world problems are rarely well-formulated machine learning problems. Practitioners have to supply a well-defined predictive target to operationalize a predictive model. In such cases, they often resort to using observed variables to approximate the actual construct of interest. For example, people have used high sales numbers as a proxy for a good employee. Proxy label selection is a recurring challenge when predictive ML is applied to real-world problems. The purpose of this interview study is to understand how ML practitioners select proxy labels, evaluate proxy labels, and iterate through the different tasks involved.

Age & Gender

  • 18 years ~ 99 years
  • Male, Female, Gender Inclusive

Contact the Team

Location

Thank you for your interest, but this study is recruiting by invitation only.

United States (Nationwide)

Additional Study Information

Principal Investigator

Yue Wang
School of Information and Library Science

Study Type

Behavioral or Social
Observational

Study Topics

Healthy Volunteer or General Population

IRB Number

24-0416

Research for Me logo

Copyright © 2013-2022 The NC TraCS Institute, the integrated home of the NIH Clinical and Translational Science Awards (CTSA) Program at UNC-CH.  This website is made possible by CTSA Grant UL1TR002489 and the National Center for Advancing Translational Sciences.

Questions?

  • This email address is being protected from spambots. You need JavaScript enabled to view it.
logo for the North Carolina Translational and Clinical Sciences Institute
logo for UNC Health
logo for UNC School of Medicine
logo for UNC Research