Worldwide Thesis Database & PhD tips

Set-valued Data: Regression, Design and Outliers - PhDData

Access database of worldwide thesis

Set-valued Data: Regression, Design and Outliers

The thesis was published by Li, Qiyu, in September 2022, University of Bern.

Abstract:

The focus of this dissertation is to study setโ€valued data from three aspects, namely regression, optimal design and outlier identification. This dissertation consists of three peerโ€reviewed published articles, each of them addressing one aspect. Their titles and abstracts are listed below:

1. Local regression smoothers with setโ€valued outcome data:
This paper proposes a method to conduct local linear regression smoothing in the presence of setโ€valued outcome data. The proposed estimator is shown to be consistent, and its mean squared error and asymptotic distribution are derived. A method to build error tubes around the estimator is provided, and a small Monte Carlo exercise is conducted to confirm the good finite sample properties of the estimator. The usefulness of the method is illustrated on a novel dataset from a clinical trial to assess the effect of certain genesโ€ expressions on different lung cancer treatments outcomes.

2. Optimal design for multivariate multiple linear regression with setโ€identified response:
We consider the partially identified regression model with setโ€identified responses, where the estimator is the set of the least square estimators obtained for all possible choices of points sampled from setโ€identified observations. We address the issue of determining the optimal design for this case and show that, for objective functions mimicking those for several classical optimal designs, their setโ€identified analogues coincide with the optimal designs for pointโ€identified realโ€valued responses.

3. Depth and outliers for samples of sets and random sets distributions:
We suggest several constructions suitable to define the depth of setโ€valued observations with respect to a sample of convex sets or with respect to the distribution of a random closed convex set. With the concept of a depth, it is possible to determine if a given convex set should be regarded an outlier with respect to a sample of convex closed sets. Some of our constructions are motivated by the known concepts of halfโ€space depth and band depth for functionโ€valued data. A novel construction derives the depth from a family of nonโ€linear expectations of random sets. Furthermore, we address the role of positions of sets for evaluation of their depth. Two case studies concern interval regression for Greek wine data and detection of outliers in a sample of particles.

The full thesis can be downloaded at :
http://boristheses.unibe.ch/2738/1/21li_q.pdf

Read the last PhD tips

2022
September

How can I tell my Ph.D. supervisor I published a paper about my thesis without telling them or listing them as authors?
2022
October

Do European Ph.D. programs are soo different than American Ph.D. programs to treat students?
2022
October

Is writing so many academic articles a waste of time when most of it isn’t read by anyone?
2022
October

Is it guaranteed when you receive a Ph.D., a lucrative job will be waiting for you?
2022
September

Can a Ph.D. supervisor fire you?