A Fast Multivariate Nearest Neighbour Imputation Algorithm
Solomon, Norman, Oatley, Giles and McGarry, Kenneth (2007) A Fast Multivariate Nearest Neighbour Imputation Algorithm. Lecture Notes in Engineering and Computer Science, 2166 (1). pp. 940-948. ISSN 2078-0958
Item Type: | Article |
---|
Abstract
Imputation of missing data is important in many
areas, such as reducing non-response bias in surveys and
maintaining medical documentation. Nearest neighbour (NN)
imputation algorithms replace the missing values within any
particular observation by taking copies of the corresponding
known values from the most similar observation found in the
dataset. However, when NN algorithms are executed against large
multivariate datasets the poor
performance (program execution
speed) of these algorithms can present major practical problems.
We argue that these problems
have not been sufficiently
addressed, and we present a fast NN imputation algorithm that
can employ any method for meas
uring the similarity between
observations. The algorithm has b
een designed for the imputation
of missing values in large multivar
iate datasets that contain many
different missingness patterns with large proportions of missing
data. The ideas underpinning th
e algorithm are explained in
detail, and experiments are described which show that the
algorithm delivers very good perf
ormance when it is used for
imputation in both segmented and non-segmented datasets
containing several million rows
|
PDF
WCE2007_pp940-947.pdf - Published Version Download (298kB) |
More Information
Depositing User: Kenneth McGarry |
Identifiers
Item ID: 5284 |
ISSN: 2078-0958 |
URI: http://sure.sunderland.ac.uk/id/eprint/5284 | Official URL: http://www.iaeng.org/LNECS/ |
Users with ORCIDS
Catalogue record
Date Deposited: 12 Mar 2015 09:55 |
Last Modified: 18 Dec 2019 15:37 |
Author: | Kenneth McGarry |
Author: | Norman Solomon |
Author: | Giles Oatley |
University Divisions
Faculty of TechnologyFaculty of Technology > School of Computer Science
Subjects
Computing > Artificial IntelligenceComputing > Databases
Actions (login required)
View Item (Repository Staff Only) |