International Journal of
Computer Engineering Research

  • Abbreviation: Int. J. Comput. Eng. Res.
  • Language: English
  • ISSN: 2141-6494
  • DOI: 10.5897/IJCER
  • Start Year: 2010
  • Published Articles: 33

Full Length Research Paper

Lessons learned and perspectives on constrained data collection and preparation for a predictive machine learning model applied to transportation industry in a non-digitalised environment

Simon Isaac KABEYA MWEPU
  • Simon Isaac KABEYA MWEPU
  • Higher Institute of Statistics of Lubumbashi, Democratic Republic of Congo.
  • Google Scholar
Patrick MUKALA
  • Patrick MUKALA
  • University of Wollongong in Dubai, United Arab Emirates.
  • Google Scholar


  •  Received: 28 October 2023
  •  Accepted: 15 January 2024
  •  Published: 31 March 2024

References

Abaker AA, Saeed FA (2021). A comparative analysis of machine learning algorithms to build a predictive model for detecting diabetes complications. Informatica 45:1.
Crossref

 

Adams LC (2023). Application of Data Mining and Machine Learning on Occupational Health and Safety Struck-by Incidents on South African Construction Sites: A CRISP-DM approach.

 

Agrawal M, Khan AU, Shukla PK (2019). Stock price prediction using technical indicators: a predictive model using optimal deep learning. Learning 6(2):7.
Crossref

 

Alzubaidi L, Zhang J, Humaidi AJ, Al-Dujaili A, Duan Y, Al-Shamma O, Santamaría J, Fadhel MA, Al-Amidie M, Farhan L (2021). Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data 8:1-74.
Crossref

 

Arnal Segura M, Bini G, Fernandez Orth D, Samaras E, Kassis M, Aisopos F, Rambla De Argila J, Paliouras G, Garrard P, Giambartolomei C, Tartaglia GG (2022). Machine learning methods applied to genotyping data capture interactions between single nucleotide variants in late onset Alzheimer's disease. Alzheimer's and Dementia: Diagnosis, Assessment and Disease Monitoring 14(1):e12300.
Crossref

 

Arumugam K, Swathi Y, Sanchez DT, Mustafa M, Phoemchalard C, Phasinam K, Okoronkwo E (2022). Towards applicability of machine learning techniques in agriculture and energy sector. Materials Today: Proceedings 51:2260-2263.
Crossref

 

Ayele WY (2021). Adapting CRISP-DM for Idea Mining: A Data Mining Process for Generating Ideas Using a Textual Dataset. arXiv preprint arXiv:2105.00574.
Crossref

 

Beer D (2019). The social power of algorithms. The Social Power of Algorithms (pp. 1-13). Routledge.
Crossref

 

Beheshti A, Ghodratnama S, Elahi M, Farhood H (2022). Social Data Analytics. CRC Press.
Crossref

 

Boehm KM, Khosravi P, Vanguri R, Gao J, Shah SP (2022). Harnessing multimodal data integration to advance precision oncology. Nature Reviews Cancer 22(2):114-126.
Crossref

 

Cai L, Zhu Y (2015). The challenges of data quality and data quality assessment in the big data era. Data Science Journal 14:2-2.
Crossref

 

Chekroud AM, Bondar J, Delgadillo J, Doherty G, Wasil A, Fokkema M, Cohen Z, Belgrave D, DeRubeis R, Iniesta R, Dwyer D (2021). The promise of machine learning in predicting treatment outcomes in psychiatry. World Psychiatry 20(2):154-170.
Crossref

 

Chen YM, Chen PC, Lin WC, Hung KC, Chen YC, Hung CF, Wang LJ, Wu CN, Hsu CW, Kao HY (2023). Predicting new-onset post-stroke depression from real-world data using machine learning algorithm. Frontiers in Psychiatry 14:1195586.
Crossref

 

Choi SW, Lee EB, Kim JH (2021). The engineering machine-learning automation platform (emap): A big-data-driven ai tool for contractors' sustainable management solutions for plant projects. Sustainability 13(18):10384.
Crossref

 

Guo Q, Zheng Y, Shi J, Wang J, Li G, Li C, Fromson JA, Xu Y, Liu X, Xu H, Zhang T (2020). Immediate psychological distress in quarantined patients with COVID-19 and its association with peripheral inflammation: a mixed-method study. Brain, Behavior and Immunity 88:17-27.
Crossref

 

Hasanzadeh S, Esmaeili B, Dodd MD (2017). Impact of construction workers' hazard identification skills on their visual attention. Journal of Construction Engineering and Management 143(10):04017070.
Crossref

 

Heinrichs B, Eickhoff SB (2020). Your evidence? Machine learning algorithms for medical diagnosis and prediction. Human Brain Mapping 41(6) :1435-1444.
Crossref

 

Johnstone SJ, Parrish L, Jiang H, Zhang DW, Williams V, Li S (2021). Aiding diagnosis of childhood attention-deficit/hyperactivity disorder of the inattentive presentation: Discriminant function analysis of multi-domain measures including EEG. Biological Psychology 161:108080.
Crossref

 

Kabeya M (2003). Use of artificial intelligence in the detection of geometric defects of a railway track. The annals of the Higher Institute of Statistics of Lubumbashi (7):67-96.

 

Kabeya M (2021). Prediction of train derailments based on an implementation by the Machine Learning technics in railway operations. Master dissertation. Lubumbashi University.

 

Kamalov F, Gurrib I, Rajab K (2021). Financial forecasting with machine learning: price vs return. Kamalov, F., Gurrib, I. & Rajab, K.(2021). Financial Forecasting with Machine Learning: Price Vs Return. Journal of Computer Science 17(3):251-264.
Crossref

 

Kasongo N (2003). Procedure for hierarchical selection of discriminate variables by the technics of step-by-step discriminant analysis in the case of several classes. Case of Male internal Medicine at Sendwe Hospital in Lubumbashi. The annals of the Higher Institute of Statistics of Lubumbashi (7):97-126

 

Kolyshkina I, Simoff S (2021). Interpretability of machine learning solutions in public healthcare: The CRISP-ML approach. Frontiers in Big Data 4:660206.
Crossref

 

Kowsher M, Turaba MY, Sajed T, Rahman MM (2019). Prognosis and treatment prediction of type-2 diabetes using deep neural network and machine learning classifiers. In 2019 22nd International Conference on Computer and Information Technology (ICCIT) (pp. 1-6). IEEE.
Crossref

 

Lantz B (2019). Machine learning with R: expert techniques for predictive modeling. Packt Publishing Limited.

 

Madera C (2018). The evolution of information systems and architectures under the influence of big data: Data Lakes, (Doctoral dissertation. University of Montpellier), France.

 

Meng T, Guo X, Lian W, Deng K, Gao L, Wang Z, Huang J, Wang X, Long X, Xing B (2020). Identifying facial features and predicting patients of acromegaly using three-dimensional imaging techniques and machine learning. Frontiers in Endocrinology 11:492.
Crossref

 

Menkel-Meadow C (2017). Portia in a different voice: speculations on a women's lawyering process. In Gender and Justice (pp. 341-365). Routledge.
Crossref

 

Merriam SB, Grenier RS (Eds.) (2019). Qualitative research in practice: Examples for discussion and analysis. John Wiley & Sons.

 

Nordin N, Zainol Z, Mohd Noor MH, Lai Fong C (2021). A comparative study of machine learning techniques for suicide attempts predictive model. Health Informatics Journal 27(1):1460458221989395.
Crossref

 

Rees DG (2018). Essential statistics. Chapman and Hall/CRC.
Crossref

 

Rehman A, Naz S, Razzak I (2022). Leveraging big data analytics in healthcare enhancement: trends, challenges and opportunities. Multimedia Systems 28(4):1339-1371.
Crossref

 

Ribeiro R, Pilastri A, Moura C, Rodrigues F, Rocha R, Cortez P (2020). Predicting the tear strength of woven fabrics via automated machine learning: an application of the CRISP-DM methodology.
Crossref

 

Russell B, Potter M (2022). Introduction to mathematical philosophy. Routledge.
Crossref

 

Sarr JM, Brochier T, Brehmer P, Perrot Y, Bah A, Sarré A, Jeyid MA, Sidibeh M, El Ayoubi S (2021). Complex data labeling with deep learning methods: Lessons from fisheries acoustics. ISA Transactions 109-113-125.
Crossref

 

Schröer C, Kruse F, Gómez JM (2021). A systematic literature review on applying CRISP-DM process model. Procedia Computer Science 181:526-534.
Crossref

 

Siebert J, Joeckel L, Heidrich J, Trendowicz A, Nakamichi K, Ohashi K, Namba I, Yamamoto R, Aoyama M (2021). Towards CRISP-ML (Q): a machine learning process model with quality assurance methodology. Machine learning and knowledge extraction 3(2):392-413.

 

Szepesvári C (2022). Algorithms for reinforcement learning. Springer Nature.

 

Toye F, Seers K, Allcock N, Briggs M, Carr E, Barker K (2014). Meta-ethnography 25 years on: challenges and insights for synthesising a large number of qualitative studies. BMC Medical Research Methodology 14(1):1-14.
Crossref

 

Vollmer S, Mateen BA, Bohner G, Király FJ, Ghani R, Jonsson P, Cumbers S, Jonas A, McAllister KS, Myles P, Grainger D (2020). Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness. BMJ, 368.
Crossref

 

Wang R, Pan Z, Chen Y, Tan Z, Zhang J (2021). Influent Quality and Quantity Prediction in Wastewater Treatment Plant: Model Construction and Evaluation. Polish Journal of Environmental Studies 30(5).
Crossref

 

Woldeamanuel MM, Kim T, Cho S, Kim HK (2023). Estimation of concrete strength using thermography integrated with deep-learning-based image segmentation: Case studies and economic analysis. Expert Systems with Applications 213:119249.
Crossref