Content area

Abstract

Model selection is an important problem in statistics. In this thesis, we develop two methods for solving problems of model selection. First we address the problem of variable screening for computer simulation experiments. Initially, a multi-stage strategy is developed which incorporates a state-of-the-art technique in each stage. By making the best use of the property of each technique, in combination they can achieve a sophisticated goal that can not be achieved by any single method. We combine an extension of the BART sum of trees model with adaptive sampling techniques and sensitivity analysis to select variables in a highly precise manner. Secondly, we introduce a graphical tool for choosing the number of nodes for a neural network. The idea here is to fit the neural network with a range of numbers of nodes at first, and then generate a jump plot using a transformation of the mean square errors of the resulting residuals. A theorem is proven to show that the jump plot will select several candidate numbers of nodes among which one is the true number of nodes. Then a single node only test, which has been theoretically justified, will be used to rule out erroneous candidates. The method has a sound theoretical background, yields good results on simulated datasets, and shows wide applicability to datasets from real research. The final project is a case study in air traffic control. Air traffic control relies upon accurate prediction of flight trajectories to prevent airspace conflicts. We develop a two-stage statistical approach to model how changes in the estimated weight affect the predictions of flight for a particular computer model of commercial flights. Our approach allows for differential amounts of nonlinearity in the fit.

Details

1010268
Subject
Classification
Title
Topics in model selection: Variable selection for computer experiments and choosing the number of nodes for neural networks
Number of pages
121
Degree date
2011
School code
0036
Source
DAI-B 73/06, Dissertation Abstracts International
ISBN
978-1-267-20183-6
University/institution
University of California, Santa Cruz
University location
United States -- California
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
3497932
ProQuest document ID
926573547
Document URL
https://www.proquest.com/dissertations-theses/topics-model-selection-variable-computer/docview/926573547/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic