Yuxin Chen

Yuxin Chen
  • Associate Professor of Statistics and Data Science
  • Associate Professor of Electrical and Systems Engineering (secondary appointment)

Contact Information

  • office Address:

    313 Academic Research Building
    265 South 37th Street
    Philadelphia, PA 19104

Research Interests: statistics, optimization, reinforcement learning, machine learning theory, information theory, statistical machine learning

Links: Personal Website, Google Scholar

Overview

For more information, please visit my Personal Website.

Openings

I’m looking for highly motivated postdocs and Ph. D. students with strong mathematical background and interest in machine learning theory, statistics, and optimization.

Education

Ph.D. in Electrical Engineering, Stanford University, 2015 (Advisor: Andrea J. Goldsmith)
M.S. in Statistics, Stanford University, 2013
M.S. in Electrical and Computer Engineering, University of Texas at Austin, 2010
B.E. in Electrical Engineering / Microelectronics, Tsinghua University, 2008

Academic Positions Held

Associate Professor of Statistics and Data Science,
Associate Professor of Electrical and Systems Engineering (secondary appointment),
The Wharton School, University of Pennsylvania, 2022-present

Assistant Professor of Electrical and Computer Engineering,
Associated Faculty Member of Computer Science and of Applied and Computational Mathematics,
Princeton University,  2017-2021

Postdoctoral Researcher, Department of Statistics
Stanford University, 2015-2017 (Advisor: Emmanuel J. Candès)

Continue Reading

Research

Teaching

Current Courses (Spring 2025)

  • OIDD4770 - Intro To Python Data Sci

    The goal of this course is to introduce the Python programming language within the context of the closely related areas of statistics and data science. Students will develop a solid grasp of Python programming basics, as they are exposed to the entire data science workflow, starting from interacting with SQL databases to query and retrieve data, through data wrangling, reshaping, summarizing, analyzing and ultimately reporting their results. Competency in Python is a critical skill for students interested in data science. Prerequisites: No prior programming experience is expected, but statistics, through the level of multiple regression is required. This requirement may be fulfilled with Undergraduate courses such as Stat 1020, Stat 1120.

    OIDD4770401 ( Syllabus )

    OIDD4770402 ( Syllabus )

  • OIDD4810 - Conv Optim Stat Data Sci

    Convex optimization has become a real pillar of modern data science and has transformed algorithm designs. A wide spectrum of problems in statistics, machine learning, and engineering can be formulated as optimization tasks that exhibit favorable convexity properties, which admit standardized and efficient solutions. This course aims to introduce the elements of convex optimization, concentrating on modeling aspects and algorithms that are useful in data science applications. Topics include convex sets, convex functions, linear and quadratic programs, semidefinite programming, optimality conditions and duality theory. We will visit important applications in statistics and machine learning to demonstrate the wide applicability of convex optimization. We will also cover effective optimization algorithms like gradient descent and Newton's method. Prerequisites: Basic linear algebra (Math 3120, 3130, 3140 or equivalent), basic calculus (Math 2400 or equivalent), basic probability (STAT 4300 or equivalent), and knowledge of a programming language like MATLAB or Python to conduct simulation exercises.

    OIDD4810401 ( Syllabus )

  • OIDD5810 - Conv Optim Stat Data Sci

    Convex optimization has become a real pillar of modern data science and has transformed algorithm designs. A wide spectrum of problems in statistics, machine learning, and engineering can be formulated as optimization tasks that exhibit favorable convexity properties, which admit standardized and efficient solutions. This course aims to introduce the elements of convex optimization, concentrating on modeling aspects and algorithms that are useful in data science applications. Topics include convex sets, convex functions, linear and quadratic programs, semidefinite programming, optimality conditions and duality theory. We will visit important applications in statistics and machine learning to demonstrate the wide applicability of convex optimization. We will also cover effective optimization algorithms like gradient descent and Newton's method. Prerequisites: Basic linear algebra, basic calculus, basic probability, and knowledge of a programming language like MATLAB or Python to conduct simulation exercises.

    OIDD5810401 ( Syllabus )

  • STAT4770 - Intro To Python Data Sci

    The goal of this course is to introduce the Python programming language within the context of the closely related areas of statistics and data science. Students will develop a solid grasp of Python programming basics, as they are exposed to the entire data science workflow, starting from interacting with SQL databases to query and retrieve data, through data wrangling, reshaping, summarizing, analyzing and ultimately reporting their results. Competency in Python is a critical skill for students interested in data science. Prerequisites: No prior programming experience is expected, but statistics, through the level of multiple regression is required. This requirement may be fulfilled with Undergraduate courses such as Stat 1020, Stat 1120.

    STAT4770401 ( Syllabus )

    STAT4770402 ( Syllabus )

  • STAT4810 - Conv Optim Stat Data Sci

    Convex optimization has become a real pillar of modern data science and has transformed algorithm designs. A wide spectrum of problems in statistics, machine learning, and engineering can be formulated as optimization tasks that exhibit favorable convexity properties, which admit standardized and efficient solutions. This course aims to introduce the elements of convex optimization, concentrating on modeling aspects and algorithms that are useful in data science applications. Topics include convex sets, convex functions, linear and quadratic programs, semidefinite programming, optimality conditions and duality theory. We will visit important applications in statistics and machine learning to demonstrate the wide applicability of convex optimization. We will also cover effective optimization algorithms like gradient descent and Newton's method. Prerequisites: Basic linear algebra (Math 3120, 3130, 3140 or equivalent), basic calculus (Math 2400 or equivalent), basic probability (STAT 4300 or equivalent), and knowledge of a programming language like MATLAB or Python to conduct simulation exercises.

    STAT4810401 ( Syllabus )

  • STAT5810 - Conv Optim Stat Data Sci

    Convex optimization has become a real pillar of modern data science and has transformed algorithm designs. A wide spectrum of problems in statistics, machine learning, and engineering can be formulated as optimization tasks that exhibit favorable convexity properties, which admit standardized and efficient solutions. This course aims to introduce the elements of convex optimization, concentrating on modeling aspects and algorithms that are useful in data science applications. Topics include convex sets, convex functions, linear and quadratic programs, semidefinite programming, optimality conditions and duality theory. We will visit important applications in statistics and machine learning to demonstrate the wide applicability of convex optimization. We will also cover effective optimization algorithms like gradient descent and Newton's method. Prerequisites: Basic linear algebra, basic calculus, basic probability, and knowledge of a programming language like MATLAB or Python to conduct simulation exercises.

    STAT5810401 ( Syllabus )

All Courses

  • AMCS5999 - Independent Study

    Independent Study allows students to pursue academic interests not available in regularly offered courses. Students must consult with their academic advisor to formulate a project directly related to the student’s research interests. All independent study courses are subject to the approval of the AMCS Graduate Group Chair.

  • AMCS9999 - Ind Study & Research

    Study under the direction of a faculty member.

  • OIDD4770 - Intro To Python Data Sci

    The goal of this course is to introduce the Python programming language within the context of the closely related areas of statistics and data science. Students will develop a solid grasp of Python programming basics, as they are exposed to the entire data science workflow, starting from interacting with SQL databases to query and retrieve data, through data wrangling, reshaping, summarizing, analyzing and ultimately reporting their results. Competency in Python is a critical skill for students interested in data science. Prerequisites: No prior programming experience is expected, but statistics, through the level of multiple regression is required. This requirement may be fulfilled with Undergraduate courses such as Stat 1020, Stat 1120.

  • OIDD4810 - Conv Optim Stat Data Sci

    Convex optimization has become a real pillar of modern data science and has transformed algorithm designs. A wide spectrum of problems in statistics, machine learning, and engineering can be formulated as optimization tasks that exhibit favorable convexity properties, which admit standardized and efficient solutions. This course aims to introduce the elements of convex optimization, concentrating on modeling aspects and algorithms that are useful in data science applications. Topics include convex sets, convex functions, linear and quadratic programs, semidefinite programming, optimality conditions and duality theory. We will visit important applications in statistics and machine learning to demonstrate the wide applicability of convex optimization. We will also cover effective optimization algorithms like gradient descent and Newton's method. Prerequisites: Basic linear algebra (Math 3120, 3130, 3140 or equivalent), basic calculus (Math 2400 or equivalent), basic probability (STAT 4300 or equivalent), and knowledge of a programming language like MATLAB or Python to conduct simulation exercises.

  • OIDD5810 - Conv Optim Stat Data Sci

    Convex optimization has become a real pillar of modern data science and has transformed algorithm designs. A wide spectrum of problems in statistics, machine learning, and engineering can be formulated as optimization tasks that exhibit favorable convexity properties, which admit standardized and efficient solutions. This course aims to introduce the elements of convex optimization, concentrating on modeling aspects and algorithms that are useful in data science applications. Topics include convex sets, convex functions, linear and quadratic programs, semidefinite programming, optimality conditions and duality theory. We will visit important applications in statistics and machine learning to demonstrate the wide applicability of convex optimization. We will also cover effective optimization algorithms like gradient descent and Newton's method. Prerequisites: Basic linear algebra, basic calculus, basic probability, and knowledge of a programming language like MATLAB or Python to conduct simulation exercises.

  • STAT4770 - Intro To Python Data Sci

    The goal of this course is to introduce the Python programming language within the context of the closely related areas of statistics and data science. Students will develop a solid grasp of Python programming basics, as they are exposed to the entire data science workflow, starting from interacting with SQL databases to query and retrieve data, through data wrangling, reshaping, summarizing, analyzing and ultimately reporting their results. Competency in Python is a critical skill for students interested in data science. Prerequisites: No prior programming experience is expected, but statistics, through the level of multiple regression is required. This requirement may be fulfilled with Undergraduate courses such as Stat 1020, Stat 1120.

  • STAT4810 - Conv Optim Stat Data Sci

    Convex optimization has become a real pillar of modern data science and has transformed algorithm designs. A wide spectrum of problems in statistics, machine learning, and engineering can be formulated as optimization tasks that exhibit favorable convexity properties, which admit standardized and efficient solutions. This course aims to introduce the elements of convex optimization, concentrating on modeling aspects and algorithms that are useful in data science applications. Topics include convex sets, convex functions, linear and quadratic programs, semidefinite programming, optimality conditions and duality theory. We will visit important applications in statistics and machine learning to demonstrate the wide applicability of convex optimization. We will also cover effective optimization algorithms like gradient descent and Newton's method. Prerequisites: Basic linear algebra (Math 3120, 3130, 3140 or equivalent), basic calculus (Math 2400 or equivalent), basic probability (STAT 4300 or equivalent), and knowledge of a programming language like MATLAB or Python to conduct simulation exercises.

  • STAT5810 - Conv Optim Stat Data Sci

    Convex optimization has become a real pillar of modern data science and has transformed algorithm designs. A wide spectrum of problems in statistics, machine learning, and engineering can be formulated as optimization tasks that exhibit favorable convexity properties, which admit standardized and efficient solutions. This course aims to introduce the elements of convex optimization, concentrating on modeling aspects and algorithms that are useful in data science applications. Topics include convex sets, convex functions, linear and quadratic programs, semidefinite programming, optimality conditions and duality theory. We will visit important applications in statistics and machine learning to demonstrate the wide applicability of convex optimization. We will also cover effective optimization algorithms like gradient descent and Newton's method. Prerequisites: Basic linear algebra, basic calculus, basic probability, and knowledge of a programming language like MATLAB or Python to conduct simulation exercises.

  • STAT9910 - Sem in Adv Appl of Stat

    This seminar will be taken by doctoral candidates after the completion of most of their coursework. Topics vary from year to year and are chosen from advance probability, statistical inference, robust methods, and decision theory with principal emphasis on applications.

  • STAT9999 - Independent Study

    Written permission of instructor and the department course coordinator required to enroll.

Awards and Honors

  • SIAM Activity Group on Imaging Science Best Paper Prize, 2024
  • Alfred P. Sloan Research Fellowship, 2022
  • Google Research Scholar Award, 2022
  • Princeton SEAS Junior Faculty Award, 2021
  • Princeton Graduate Mentoring Award, 2020
  • ICCM Best Paper Award (Gold Medal), 2020
  • ARO Young Investigator Program Award, 2020
  • Finalist for the Best Paper Prize for Young Researchers in Continuous Optimization, 2019
  • AFOSR Young Investigator Program Award, 2019

Activity

Latest Research

Changxiao Cai, H Vincent Poor, Yuxin Chen (2022), Uncertainty Quantification for Nonconvex Tensor Completion: Confidence Intervals, Heteroscedasticity and Optimality, accepted to IEEE Transactions on Information Theory.
All Research