Tangible interaction learning model to enhance learning activity processes among children with dyslexia

Missing data is a widespread data quality issue across various domains. A common challenge is the occurrence of missing data during the data input process. Numerous studies have proposed methods to impute missing values for data across multiple fields. However, certain domains present unique chal...

Full description

Bibliographic Details
Main Author:	Jamalai@Jamali, Siti Nurliana
Format:	Thesis
Language:	English
Published:	2024
Subjects:	Dyslexic children - Education Malay language - Study and teaching (Primary) Human-computer interaction
Online Access:	http://psasir.upm.edu.my/id/eprint/120146/1/120146.pdf

_version_	1846217896372469760
author	Jamalai@Jamali, Siti Nurliana
author_facet	Jamalai@Jamali, Siti Nurliana
author_sort	Jamalai@Jamali, Siti Nurliana
description	Missing data is a widespread data quality issue across various domains. A common challenge is the occurrence of missing data during the data input process. Numerous studies have proposed methods to impute missing values for data across multiple fields. However, certain domains present unique challenges due to the involvement of attributes from multiple scientific disciplines, such as biology, chemistry, and medical which complicates the imputation process. Current machine learning models struggle to handle both missing values and inaccuracies simultaneously, particularly when dealing with large datasets. These challenges are further compounded by the data type constraints imposed by these algorithms. Furthermore, most of the current approaches focused on the imputation method alone without giving enough attention to the cleansing and pre-processing phase which can be crucial for the imputation method mechanism. Besides that, software tools for applying missing data imputation approaches are limited. Hence, there is a need for the inclusion of intelligence approaches in data imputation in the case of determining which independent variables are the best set to impute missing values in dependent variables. To find optimum variables, Machine Learning approach needs to be utilized. In this research, an imputation approach using Extremely Randomized Trees (Extra Trees) of ensemble machine learning methods named (ImputeX) is proposed. This method has the ability to impute both categorical and continuous data features for large datasets. In addition, an application is presented for public users to utilize the proposed method using standard and autonomous data imputation. The proposed imputation method was compared with existing imputation methods including MissForest, K-NNI, HyperImpute, Multivariate Imputation by Chained Equations (MICE), Multiple Imputation with Denoising Autoencoders (MIDAS), and SoftImpute. From these results, it was observed that the proposed method improves the execution time by 35% compared to recent imputation methods and increases the accuracy by 0.5% at 10% missing ratio reaching 15% of accuracy improvement at 90% missing ratio. While the presented application has achieved the best performance compared to current software tools such as R package, Statistical Package for the Social Sciences (SPSS), Stata, and Microsoft Excel. The significance of this research is to develop an intelligent method that can deal with both missing values and accuracy in large datasets while minimizing time consumed. Through the presentation of an accurate and reliable imputation method, this research helps to improve data quality. Additionally, it contributes to data science by improving the data cleaning procedure, which is a step in the data preprocessing stage.
format	Thesis
id	oai:psasir.upm.edu.my:120146
institution	Universiti Putra Malaysia
language	English
publishDate	2024
record_format	eprints
spelling	oai:psasir.upm.edu.my:1201462025-10-09T08:37:03Z http://psasir.upm.edu.my/id/eprint/120146/ Tangible interaction learning model to enhance learning activity processes among children with dyslexia Jamalai@Jamali, Siti Nurliana Missing data is a widespread data quality issue across various domains. A common challenge is the occurrence of missing data during the data input process. Numerous studies have proposed methods to impute missing values for data across multiple fields. However, certain domains present unique challenges due to the involvement of attributes from multiple scientific disciplines, such as biology, chemistry, and medical which complicates the imputation process. Current machine learning models struggle to handle both missing values and inaccuracies simultaneously, particularly when dealing with large datasets. These challenges are further compounded by the data type constraints imposed by these algorithms. Furthermore, most of the current approaches focused on the imputation method alone without giving enough attention to the cleansing and pre-processing phase which can be crucial for the imputation method mechanism. Besides that, software tools for applying missing data imputation approaches are limited. Hence, there is a need for the inclusion of intelligence approaches in data imputation in the case of determining which independent variables are the best set to impute missing values in dependent variables. To find optimum variables, Machine Learning approach needs to be utilized. In this research, an imputation approach using Extremely Randomized Trees (Extra Trees) of ensemble machine learning methods named (ImputeX) is proposed. This method has the ability to impute both categorical and continuous data features for large datasets. In addition, an application is presented for public users to utilize the proposed method using standard and autonomous data imputation. The proposed imputation method was compared with existing imputation methods including MissForest, K-NNI, HyperImpute, Multivariate Imputation by Chained Equations (MICE), Multiple Imputation with Denoising Autoencoders (MIDAS), and SoftImpute. From these results, it was observed that the proposed method improves the execution time by 35% compared to recent imputation methods and increases the accuracy by 0.5% at 10% missing ratio reaching 15% of accuracy improvement at 90% missing ratio. While the presented application has achieved the best performance compared to current software tools such as R package, Statistical Package for the Social Sciences (SPSS), Stata, and Microsoft Excel. The significance of this research is to develop an intelligent method that can deal with both missing values and accuracy in large datasets while minimizing time consumed. Through the presentation of an accurate and reliable imputation method, this research helps to improve data quality. Additionally, it contributes to data science by improving the data cleaning procedure, which is a step in the data preprocessing stage. 2024-05 Thesis NonPeerReviewed text en http://psasir.upm.edu.my/id/eprint/120146/1/120146.pdf Jamalai@Jamali, Siti Nurliana (2024) Tangible interaction learning model to enhance learning activity processes among children with dyslexia. Doctoral thesis, Universiti Putra Malaysia. http://ethesis.upm.edu.my/id/eprint/18500 Dyslexic children - Education Malay language - Study and teaching (Primary) Human-computer interaction
spellingShingle	Dyslexic children - Education Malay language - Study and teaching (Primary) Human-computer interaction Jamalai@Jamali, Siti Nurliana Tangible interaction learning model to enhance learning activity processes among children with dyslexia
title	Tangible interaction learning model to enhance learning activity processes among children with dyslexia
title_full	Tangible interaction learning model to enhance learning activity processes among children with dyslexia
title_fullStr	Tangible interaction learning model to enhance learning activity processes among children with dyslexia
title_full_unstemmed	Tangible interaction learning model to enhance learning activity processes among children with dyslexia
title_short	Tangible interaction learning model to enhance learning activity processes among children with dyslexia
title_sort	tangible interaction learning model to enhance learning activity processes among children with dyslexia
topic	Dyslexic children - Education Malay language - Study and teaching (Primary) Human-computer interaction
url	http://psasir.upm.edu.my/id/eprint/120146/1/120146.pdf
url-record	http://psasir.upm.edu.my/id/eprint/120146/ http://ethesis.upm.edu.my/id/eprint/18500
work_keys_str_mv	AT jamalaijamalisitinurliana tangibleinteractionlearningmodeltoenhancelearningactivityprocessesamongchildrenwithdyslexia

Tangible interaction learning model to enhance learning activity processes among children with dyslexia

Similar Items