Close menu


Sunderland Repository records the research produced by the University of Sunderland including practice-based research and theses.

A Fast Multivariate Nearest Neighbour Imputation Algorithm

Solomon, Norman, Oatley, Giles and McGarry, Kenneth (2007) A Fast Multivariate Nearest Neighbour Imputation Algorithm. Lecture Notes in Engineering and Computer Science, 2166 (1). pp. 940-948. ISSN 2078-0958

Item Type: Article


Imputation of missing data is important in many
areas, such as reducing non-response bias in surveys and
maintaining medical documentation. Nearest neighbour (NN)
imputation algorithms replace the missing values within any
particular observation by taking copies of the corresponding
known values from the most similar observation found in the
dataset. However, when NN algorithms are executed against large
multivariate datasets the poor
performance (program execution
speed) of these algorithms can present major practical problems.
We argue that these problems
have not been sufficiently
addressed, and we present a fast NN imputation algorithm that
can employ any method for meas
uring the similarity between
observations. The algorithm has b
een designed for the imputation
of missing values in large multivar
iate datasets that contain many
different missingness patterns with large proportions of missing
data. The ideas underpinning th
e algorithm are explained in
detail, and experiments are described which show that the
algorithm delivers very good perf
ormance when it is used for
imputation in both segmented and non-segmented datasets
containing several million rows

WCE2007_pp940-947.pdf - Published Version

Download (298kB)

More Information

Depositing User: Kenneth McGarry


Item ID: 5284
ISSN: 2078-0958
Official URL:

Users with ORCIDS

ORCID for Kenneth McGarry: ORCID iD

Catalogue record

Date Deposited: 12 Mar 2015 09:55
Last Modified: 18 Dec 2019 15:37


Author: Kenneth McGarry ORCID iD
Author: Norman Solomon
Author: Giles Oatley

University Divisions

Faculty of Technology
Faculty of Technology > School of Computer Science


Computing > Artificial Intelligence
Computing > Databases

Actions (login required)

View Item (Repository Staff Only) View Item (Repository Staff Only)