Integrating Remote Photoplethysmography and Machine Learning on Multimodal Dataset for Noninvasive Heart Rate Monitoring
Abstract
:1. Introduction
2. Methodology
2.1. Hypothesis
2.2. Remote Photoplethysmography
Algorithm 1: Plane orthogonal to skin |
Assumption: Video duration 15 s; Frame rate 30 fps; total frame 450 frames |
Input: Video contains frame |
Output: rPPG signal |
1: = zeros (l, N), l = 48 (30 fps camera) |
2: for do: |
3: |
4: if then |
5: |
6: |
7: # The first row of |
8: # The second row of |
9: |
10: |
11: end if |
12: end for |
13: return |
2.3. Signal Filtering
2.4. Fast Fourier Transform
2.5. Support Vector Machine
2.6. Random Forest Regression
Algorithm 2: Random forest regression |
Input: training dataset, n: number of trees; d: tree depth; m: number of features at each node |
Output: prediction of Random Forest model |
1: for i = 1, 2, …, n do: |
2: Bootstrap sampling of the dataset Di |
3: Build decision tree: |
4: At each node, randomly sample m features |
5: Find the best split using the m subset |
6: Split the node and repeat until stopping criteria (max depth or minimum samples) are met. |
7: Save the decision tree Ti |
8: end for |
9: For each tree Ti, get the predicted value from all trees |
10: return Average of the predictions from each tree |
2.7. Data Collection
2.8. Region of Interest Selection
2.9. Feature Selection
- Maximum and minimum values of the signal: They are the basic features of a signal representing the maximum and minimum value of signal amplitude, respectively. From those values, a new feature called peak to peak, which is the difference between the maximum and minimum values of the signal, can be generated. Given that represents the signal amplitude at time frame , the maximum, minimum, and peak-to-peak values of the signal are represented as follows:
- Skewness and Kurtosis: Skewness is a measure of the asymmetry of the signal’s amplitude distribution around the mean, while kurtosis is the existence of outliers in the signal’s distribution. Given that represents the signal amplitude at time frame , and are the mean and standard deviation of the signal, respectively, skewness and kurtosis of the signal are represented as follows:
- Entropy: It measures the complexity or randomness in the signal based on the probability distribution of its values. It is often used to assess the predictability of a signal. Given that represents the signal amplitude at time frame , to calculate entropy, the signal needs to be discretized over a period of 30, matching the fps assumption used in this study. This process results in where . Each covers a specific range that can take. The entropy can be calculated as follows:
- Zero crossings: Zero crossings are a count of how many times the signal crosses the horizontal axis—that is, changes from positive to negative or vice versa. This feature also gives some insight into the frequency content of the signal. Given that represents the signal amplitude at time frame , zero crossings can be formulated as follows:
- Spectral centroid and spectral bandwidth: The spectral centroid is that point, considering the signal in the frequency domain, where the signal can be said to balance domain while spectral bandwidth is a measure of the spread or width of the frequency spectrum around the spectral centroid. They provide a measure of the amount the energy that is concentrated or dispersed across frequencies. Given that represents the frequency domain of the signal obtained by implementing into DFT in Formula (2), spectral centroid and spectral bandwidth can be formulated as follows:
- Dominant frequency and total power: Dominant frequency is the frequency at which, in the Fourier spectrum of the signal, its magnitude is the largest, whereas total power is a notion of the overall energy, computed by summing over squared amplitudes. Given that is the representation of time-domain signal , dominant frequency and total power can be calculated as follows:
3. Experiments
3.1. DeepFace for Gender and Age Prediction
3.2. Experiment with Random Forest and Support Vector Regression for Multimodal Heart Rate Calculation
- Hyperparameters for SVR: Kernel {Linear, RBF}, and ;
- Hyperparameters for Random Forest Regression: number of trees , max depth , and min samples per split .
3.3. Performance Results
3.4. Comparison with State-of-the-Art Methods
4. Discussion
5. Limitations and Future Works
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Yehezky, H.; Bustamam, A.; Hermawan, H. Enhancing ECG Signal Data through Denoising Features with Transformer Generative Adversarial Networks for Model Classication 1D-CNN. Res. Sq. 2023. [Google Scholar] [CrossRef]
- Atmadja, H.Y.N.; Bustamam, A. Hermawan Generated Tabular Data with Multi-GANs for Arrhythmia Classification Based on DNN Models. In Proceedings of the 2022 International Conference of Science and Information Technology in Smart Administration, ICSINTESA 2022, IEEE, Denpasar, Indonesia, 10–12 November 2022; pp. 69–74. [Google Scholar]
- Shyam, A.; Ravichandran, V.; Preejith, S.P.; Joseph, J.; Sivaprakasam, M. PPGnet: Deep Network for Device Independent Heart Rate Estimationfrom Photoplethysmogram. In Proceedings of the 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Berlin, Germany, 23–27 July 2019. [Google Scholar]
- Schiweck, C.; Piette, D.; Berckmans, D.; Claes, S.; Vrieze, E. Heart Rate and High Frequency Heart Rate Variability during Stress as Biomarker for Clinical Depression. A Systematic Review. Psychol. Med. 2019, 49, 200–211. [Google Scholar] [CrossRef] [PubMed]
- Qiao, D.; Ayesha, A.H.; Zulkernine, F.; Jaffar, N.; Masroor, R. ReViSe: Remote Vital Signs Measurement Using Smartphone Camera. IEEE Access 2022, 10, 131656–131670. [Google Scholar] [CrossRef]
- Hertzman, A.B. Observations on the Finger Volume Pulse Recorded Photoelectrically. Am. J. Physiol. 1937, 119, 334–335. [Google Scholar]
- Soldatos, J. Big Data and Artificial Intelligence in Digital Finance; Kyriazis, D., Ed.; Springer International Publishing: Cham, Switzerland, 2022. [Google Scholar]
- Upadhyay, B.R.; Baral, A.B.; Torlak, M. Vital Sign Detection via Angular and Range Measurements with MmWave MIMO Radars: Algorithms and Trials. IEEE Access 2022, 10, 106017–106032. [Google Scholar] [CrossRef]
- Pnevmatikakis, A.; Kanavos, S.; Perikleous, A.; Kyriazakos, S. Risk Assessment for Personalized Health Insurance Products. In Big Data and Artificial Intelligence in Digital Finance; Springer: Cham, Switzerland, 2022; pp. 281–294. [Google Scholar] [CrossRef]
- Verkruysse, W.; Svaasand, L.O.; Stuart Nelson, J. Remote Plethysmographic Imaging Using Ambient Light. Opt. Express 2008, 16, 2878–2886. [Google Scholar] [CrossRef]
- Poh, M.-Z.; Mcduff, D.J.; Picard, R.W. Non-Contact, Automated Cardiac Pulse Measurements Using Video Imaging and Blind Source Separation. References and Links. Opt. Express 2010, 18, 10762–10774. [Google Scholar] [CrossRef]
- De Haan, G.; Jeanne, V. Robust Pulse Rate from Chrominance-Based RPPG. IEEE Trans. Biomed. Eng. 2013, 60, 2878–2886. [Google Scholar] [CrossRef]
- Wang, W.; Den Brinker, A.C.; Stuijk, S.; De Haan, G. Algorithmic Principles of Remote PPG. IEEE Trans. Biomed. Eng. 2017, 64, 1479–1491. [Google Scholar] [CrossRef]
- Huang, X.; Lv, J. Resilient Remote Heart Rate Measurement from Partially Masked Faces. In Proceedings of the International Joint Conference on Neural Networks, IEEE, Gold Coast, Australia, 18–23 June 2023. [Google Scholar]
- Niu, X.; Zhao, X.; Han, H.; Das, A.; Dantcheva, A.; Shan, S.; Chen, X. Robust Remote Heart Rate Estimation from Face Utilizing-Temporal Attention. In Proceedings of the 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019), Lille, France, 14–18 May 2019. [Google Scholar]
- Kodithuwakkuge, J.; Sekhar, V.; Lamahewa, T.A. Blind Symbol Rate Estimation Using Convolutional Neural Networks for PSK Modulated Signals. In Proceedings of the 2021 15th International Conference on Signal Processing and Communication Systems, ICSPCS 2021—Proceedings, IEEE, Sydney, Australia, 13–15 December 2021. [Google Scholar]
- Reiss, A.; Indlekofer, I.; Schmidt, P.; Van Laerhoven, K. Deep PPG: Large-Scale Heart Rate Estimation with Convolutional Neural Networks. Sensors 2019, 19, 3079. [Google Scholar] [CrossRef]
- Kim, K.B.; Baek, H.J. Photoplethysmography in Wearable Devices: A Comprehensive Review of Technological Advances, Current Challenges, and Future Directions. Electronics 2023, 12, 2923. [Google Scholar] [CrossRef]
- Quer, G.; Gouda, P.; Galarnyk, M.; Topol, E.J.; Steinhubl, S.R. Inter- and Intraindividual Variability in Daily Resting Heart Rate and Its Associations with Age, Sex, Sleep, BMI, and Time of Year: Retrospective, Longitudinal Cohort Study of 92,457 Adults. PLoS ONE 2020, 15, e0227709. [Google Scholar] [CrossRef]
- Ehrenwald, M.; Wasserman, A.; Shenhar-Tsarfaty, S.; Zeltser, D.; Friedensohn, L.; Shapira, I.; Berliner, S.; Rogowski, O. Exercise Capacity and Body Mass Index—Important Predictors of Change in Resting Heart Rate. BMC Cardiovasc. Disord. 2019, 19, 307. [Google Scholar] [CrossRef]
- Gonzales, T.I.; Jeon, J.Y.; Lindsay, T.; Westgate, K.; Perez-Pozuelo, I.; Hollidge, S.; Wijndaele, K.; Rennie, K.; Forouhi, N.; Griffin, S.; et al. Resting Heart Rate Is a Population-Level Biomarker of Cardiorespiratory Fitness: The Fenland Study. PLoS ONE 2023, 18, e0285272. [Google Scholar] [CrossRef] [PubMed]
- Wu, T.; Schmitt, H.J.; Blazek, V. Photoplethysmography Imaging: A New Noninvasive and Noncontact Method for Mapping of the Dermal Perfusion Changes. Opt. Tech. Instrum. Meas. Blood Compos. Struct. Dyn. 2000, 4163, 62. [Google Scholar] [CrossRef]
- Stricker, R.; Müller, S.; Gross, H.-M. Non-Contact Video-Based Pulse Rate Measurement on a Mobile Service Robot. In Proceedings of the 23rd IEEE International Symposium on Robot and Human Interactive Communication, Edinburgh, UK, 25–29 August 2014. [Google Scholar]
- De Haan, G.; Van Leest, A. Improved Motion Robustness of Remote-PPG by Using the Blood Volume Pulse Signature. Physiol. Meas. 2014, 35, 1913. [Google Scholar] [CrossRef] [PubMed]
- Kim, J.K.; Mok Ahn, J. Design of an Optimal Digital IIR Filter for Heart Rate Variability by Photoplethysmogram. Int. J. Eng. Res. Technol. 2018, 11, 2009–2021. [Google Scholar]
- Tarvainen, M.P.; Ranta-aho, P.O.; Karjalainen, P.A. An Advanced Detrending Method with Application to HRV Analysis. IEEE Trans. Biomed. Eng. 2002, 49, 172–175. [Google Scholar] [CrossRef]
- Steiglitz, K. Digital Signal Processing; Courier Dover Publications: Mineola, NY, USA, 2020. [Google Scholar]
- Guler, S.; Golparvar, A.; Ozturk, O.; Dogan, H.; Kaya Yapici, M. Optimal Digital Filter Selection for Remote Photoplethysmography (RPPG) Signal Conditioning. Biomed. Phys. Eng. Express 2023, 9, 027001. [Google Scholar] [CrossRef]
- Burden, R.L. Numerical Analysis; Brooks/Cole: Pacific Grove, CA, USA, 2011. [Google Scholar]
- Bustamam, A.; Sarwinda, D.; Abdillah, B.; Kaloka, T.Q. Detecting Lesion Characteristics of Diabetic Retinopathy Using Machine Learning and Computer Vision. J. Electron. Syst. 2020, 10, 23–33. [Google Scholar] [CrossRef]
- Octaria, E.A.; Siswantining, T.; Bustamam, A.; Sarwinda, D. Kernel PCA and SVM-RFE Based Feature Selection for Classification of Dengue Microarray Dataset. AIP Conf. Proc. 2020, 2264, 030004. [Google Scholar]
- Açıkkar, M.; Altunkol, Y. A Novel Hybrid PSO- and GS-Based Hyperparameter Optimization Algorithm for Support Vector Regression. Neural Comput. Appl. 2023, 35, 19961–19977. [Google Scholar] [CrossRef]
- Bustamam, A.; Musti, M.I.S.; Hartomo, S.; Aprilia, S.; Tampubolon, P.P.; Lestari, D. Performance of Rotation Forest Ensemble Classifier and Feature Extractor in Predicting Protein Interactions Using Amino Acid Sequences. BMC Genom. 2019, 20, 950. [Google Scholar] [CrossRef]
- Testas, A. Hyperparameter Tuning with Scikit-Learn and PySpark. In Distributed Machine Learning with PySpark; Apress: Berkeley, CA, USA, 2023; pp. 417–440. [Google Scholar] [CrossRef]
- Kim, D.; Lee, K.; Sohn, C.B. Assessment of ROI Selection for Facial Video-Based RPPG. Sensors 2021, 21, 7923. [Google Scholar] [CrossRef]
- Sarwinda, D.; Siswantining, T.; Bustamam, A. Classification of Diabetic Retinopathy Stages Using Histogram of Oriented Gradients and Shallow Learning. In Proceedings of the 2018 International Conference on Computer, Control, Informatics and its Applications (IC3INA), IEEE, Tangerang, Indonesia, 1–2 November 2018. [Google Scholar]
- Oppenheim, A.V.; Willsky, A.S.; Nawab, S.H.; Ding, J.J. Signals and Systems; Prentice Hall: New York, NY, USA, 1997; Volume 2. [Google Scholar]
- Singh, A.K.; Krishnan, S. ECG Signal Feature Extraction Trends in Methods and Applications. BioMed. Eng. OnLine 2023, 22, 22. [Google Scholar] [CrossRef] [PubMed]
- Krishnan, S.; Athavale, Y. Trends in Biomedical Signal Feature Extraction. Biomed. Signal Process. Control 2018, 43, 41–63. [Google Scholar] [CrossRef]
- Anguita, D.; Ridella, S.; Rivieccio, F. K-Fold Generalization Capability Assessment for Support Vector Classifiers. In Proceedings of the 2005 IEEE International Joint Conference on Neural Networks, Montreal, QC, Canada, 31 July–4 August 2005; Volume 2, pp. 855–858. [Google Scholar] [CrossRef]
- Farahmand, M.; Desa, M.I.; Nilashi, M. A Combined Data Envelopment Analysis and Support Vector Regression for Efficiency Evaluation of Large Decision Making Units. Int. J. Eng. Technol. 2014, 6, 2310–2321. [Google Scholar]
Index | Method | Video | Physical | MAE | RMSE | MAPE | Hyperparameter Pairs |
---|---|---|---|---|---|---|---|
1 | RF | ✓ | ✗ | ||||
2 | RF | ✗ | ✓ | % | |||
3 | RF | ✓ | ✓ | 3.057 | |||
4 | SVM | ✓ | ✗ | 7.601 | 18.624 | % | |
5 | SVM | ✗ | ✓ | 3.251 | 12.797 | % | |
6 | SVM | ✓ | ✓ | 7.496 | 18.787 | % |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Buyung, R.A.; Bustamam, A.; Ramazhan, M.R.S. Integrating Remote Photoplethysmography and Machine Learning on Multimodal Dataset for Noninvasive Heart Rate Monitoring. Sensors 2024, 24, 7537. https://doi.org/10.3390/s24237537
Buyung RA, Bustamam A, Ramazhan MRS. Integrating Remote Photoplethysmography and Machine Learning on Multimodal Dataset for Noninvasive Heart Rate Monitoring. Sensors. 2024; 24(23):7537. https://doi.org/10.3390/s24237537
Chicago/Turabian StyleBuyung, Rinaldi Anwar, Alhadi Bustamam, and Muhammad Remzy Syah Ramazhan. 2024. "Integrating Remote Photoplethysmography and Machine Learning on Multimodal Dataset for Noninvasive Heart Rate Monitoring" Sensors 24, no. 23: 7537. https://doi.org/10.3390/s24237537
APA StyleBuyung, R. A., Bustamam, A., & Ramazhan, M. R. S. (2024). Integrating Remote Photoplethysmography and Machine Learning on Multimodal Dataset for Noninvasive Heart Rate Monitoring. Sensors, 24(23), 7537. https://doi.org/10.3390/s24237537