Learning Noise in Web Data Prior to Elimination
Onyancha, Julius, Plekhanova, Valentina and Nelson, David (2018) Learning Noise in Web Data Prior to Elimination. In: Transactions on Engineering Technologies: 25th World Congress on Engineering (WCE 2017). Springer. ISBN 9789811307461
Item Type: | Book Section |
---|
Abstract
This research work explores how noise in web data is currently addressed. We establish that current research works eliminate noise in web data mainly based on the structure and layout of web pages i.e. they consider noise as any data that does not form part of the main web page. However, not all data that form part of the main web page is of a user interest and not every data considered noise is actually noise to a given user. The ability to determine what is useful from noise data taking into account dynamic change of user interests has not been fully addressed by current research works. We aim to justify a claim that it is important to learn noise prior to elimination, not only to decrease levels of noise but also reduce the loss of useful information otherwise eliminated as noise. This is because if the process of eliminating noise in web data is not user-driven, the interestingness of web data available to a user will not reflect their interests given the time of the request.
More Information
Depositing User: Barry Hall |
Identifiers
Item ID: 8833 |
ISBN: 9789811307461 |
URI: http://sure.sunderland.ac.uk/id/eprint/8833 | Official URL: https://link.springer.com/chapter/10.1007/978-981-... |
Users with ORCIDS
Catalogue record
Date Deposited: 23 Feb 2018 11:43 |
Last Modified: 03 Jun 2020 15:23 |
Author: | Julius Onyancha |
Author: | Valentina Plekhanova |
Author: | David Nelson |
University Divisions
Faculty of TechnologyFaculty of Technology > School of Computer Science
Subjects
Computing > Network ComputingActions (login required)
View Item (Repository Staff Only) |