Information

  • Author Services

Initiatives

You are accessing a machine-readable page. In order to be human-readable, please install an RSS reader.

All articles published by MDPI are made immediately available worldwide under an open access license. No special permission is required to reuse all or part of the article published by MDPI, including figures and tables. For articles published under an open access Creative Common CC BY license, any part of the article may be reused without permission provided that the original article is clearly cited. For more information, please refer to https://www.mdpi.com/openaccess .

Feature papers represent the most advanced research with significant potential for high impact in the field. A Feature Paper should be a substantial original Article that involves several techniques or approaches, provides an outlook for future research directions and describes possible research applications.

Feature papers are submitted upon individual invitation or recommendation by the scientific editors and must receive positive feedback from the reviewers.

Editor’s Choice articles are based on recommendations by the scientific editors of MDPI journals from around the world. Editors select a small number of articles recently published in the journal that they believe will be particularly interesting to readers, or important in the respective research area. The aim is to provide a snapshot of some of the most exciting work published in the various research areas of the journal.

Original Submission Date Received: .

  • Active Journals
  • Find a Journal
  • Proceedings Series
  • For Authors
  • For Reviewers
  • For Editors
  • For Librarians
  • For Publishers
  • For Societies
  • For Conference Organizers
  • Open Access Policy
  • Institutional Open Access Program
  • Special Issues Guidelines
  • Editorial Process
  • Research and Publication Ethics
  • Article Processing Charges
  • Testimonials
  • Preprints.org
  • SciProfiles
  • Encyclopedia

BDCC-logo

Article Menu

big data thesis paper

  • Subscribe SciFeed
  • Recommended Articles
  • Google Scholar
  • on Google Scholar
  • Table of Contents

Find support for a specific problem in the support section of our website.

Please let us know what you think of our products and services.

Visit our dedicated information section to learn more about MDPI.

JSmol Viewer

Explore big data analytics applications and opportunities: a review.

big data thesis paper

1. Introduction

2. literature review, 2.1. big data and analytics, 2.2. characteristics of big data, 2.3. the types of data analytics.

  • Descriptive
  • Prescriptive
  • Business Intelligent, which consist of both Descriptive analytics and Diagnostic analytics
  • Advanced Analytics, which consider the higher data analytics types in maturity level, namely predictive and prescriptive analytics [ 55 , 56 , 59 ].

3. Big Data Analytics Opportunities and Applications

4. big data applications pre the covid-19 pandemic, 4.1. big data in healthcare, 4.2. big data in education, 4.3. big data in transportation.

  • First source of data which is the primary source is the direct physical sensing. Represented, in road-side static sensors such as LiDAR, microwave Radars, and sensors that measure speed, noise, and traffic flow known as acoustic sensors [ 100 ]. Other examples are the use of mobile phone technologies such as GPS, GSM, and Bluetooth [ 97 ].
  • The second source of data is the social media sources “human & social Sensing” highlighted in the use of motorists to the smartphone-compatible platforms [ 101 ]. For instance Instagram, twitter and others [ 97 ].
  • The third category of data source is urban sensing which is generated by transportation operators. In this category data captured can analyze urban mobility in terms of congestion and traffic flows [ 102 ]. This can be performed via credit cards and smart cards scanned through urban sensors from public transit, retail scanners and digital toll systems [ 97 ].

4.4. Big Data in Banking

5. big data applications peri the covid-19 pandemic, 5.1. big data in healthcare, 5.2. big data in education, 5.3. big data in transportation, 5.4. big data in banking, 5.5. big data analytics across industry, 6. conclusions, author contributions, institutional review board statement, data availability statement, conflicts of interest.

  • Alhomdy, S.; Thabit, F.; Abdulrazzak, F.A.H.; Haldorai, A.; Jagtap, S. The role of cloud computing technology: A savior to fight the lockdown in COVID-19 crisis, the benefits, characteristics and applications. Int. J. Intell. Netw. 2021 , 2 , 166–174. [ Google Scholar ] [ CrossRef ]
  • Alsunaidi, S.J.; Almuhaideb, A.M.; Ibrahim, N.M.; Shaikh, F.S.; Alqudaihi, K.S.; Alhaidari, F.A.; Khan, I.U.; Aslam, N.; Alshahrani, M.S. Applications of Big Data Analytics to Control COVID-19 Pandemic. Sensors 2021 , 21 , 2282. [ Google Scholar ] [ CrossRef ] [ PubMed ]
  • Rothberg, H.N.; Erickson, G.S. Big data systems: Knowledge transfer or intelligence insights? J. Knowl. Manag. 2017 , 21 , 92–112. [ Google Scholar ] [ CrossRef ]
  • Adrian, C.; Adrian, C.; Abdullah, R.; Atan, R.; Jusoh, Y.Y. Conceptual Model Development of Big Data Analytics Implementation Assessment Effect on Article in Press Conceptual Model Development of Big Data Analytics Implementation Assessment Effect on Decision-Making. Int. J. Interact. Multimed. Artif. Intell. 2018 , 5 , 101–106. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Gupta, M.; George, J.F. Toward the development of a big data analytics capability. Inf. Manag. 2016 , 53 , 1049–1064. [ Google Scholar ] [ CrossRef ]
  • Kalema, B.M.; Mokgadi, M. Developing countries organizations’ readiness for Big Data analytics. Probl. Perspect. Manag. 2017 , 15 , 260–270. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • De Bruin, T.; Kulkarni, U.; Rosemann, M.; Freeze, R. Understanding the Main Phases of Developing a Maturity Assessment Model ; ACIS: Sydney, Australia, 2005. [ Google Scholar ]
  • Rialti, R.; Marzi, G.; Ciappei, C.; Busso, D. Big data and dynamic capabilities: A bibliometric analysis and systematic literature review. Manag. Decis. 2019 , 57 , 2052–2068. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Al-sai, Z.A.; Abdullah, R.; Husin, M.H.; Syed-mohamad, S.M. A Preliminary Systematic Literature Review On Critical Success Factors Categories For Big Data. In Proceedings of the AiIC2019, Toyama, Japan, 7–11 July 2019. [ Google Scholar ]
  • Al-Sai, Z.A.; Abdullah, R.; Husin, M.H. Big Data Impacts and Challenges: A Review. In Proceedings of the 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology, Amman, Jordan, 9–11 April 2019; pp. 150–155. [ Google Scholar ] [ CrossRef ]
  • Al-Sai, Z.A.; Abdullah, R.; Husin, M.H. A review on big data maturity models. In Proceedings of the 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology, Amman, Jordan, 9–11 April 2019; pp. 156–161. [ Google Scholar ]
  • Zulkarnain, N.; Meyliana, M.; Prabowo, H.; Nizar Hidayanto, A. The critical success factors for big data adoption in government. Int. J. Mech. Eng. Technol. 2019 , 10 , 864–875. [ Google Scholar ] [ CrossRef ]
  • Al-Sai, Z.A.; Abualigah, L.M. Big Data and E-government: A review. In Proceedings of the 2017 8th International Conference on Information Technology, Amman, Jordan, 17–18 May 2017; pp. 580–587. [ Google Scholar ] [ CrossRef ]
  • Malik, P. Governing Big Data: Principles and practices. IBM J. Res. Dev. 2013 , 57 , 1:1–1:13. [ Google Scholar ] [ CrossRef ]
  • Big Data Analytics and Fi Nancial Reporting Quality: Qualitative Evidence from Canada. Available online: https://doi.org/10.1108/JFRA-12-2021-0489 (accessed on 1 January 2021).
  • Measuring Your Big Data Maturity. Available online: https://michaelskenny.com/points-of-view/measuring-your-big-data-maturity/ (accessed on 1 January 2021).
  • Davenport, T.; Dyché, J. Big Data in Big Companies. Baylor Bus. Rev. 2013 , 32 , 20–21. [ Google Scholar ]
  • Ward, J.S.; Barker, A. Undefined By Data: A Survey of Big Data Definitions. arXiv 2013 , arXiv:1309.5821. [ Google Scholar ]
  • Bertot, J.C.; Choi, H. Big data and e-government: Issues, policies, and recommendations. In Proceedings of the 14th Annual International Conference on Digital Government Research, New York, NY, USA, 17 June 2013; pp. 1–10. [ Google Scholar ]
  • Braun, H. Evaluation of Big Data Maturity Models—A Bench- Marking Study To Support Big Data Maturity Assessment In Organizations ; Tampere University of Technology: Tampere, Finland, 2015. [ Google Scholar ]
  • Manyika, J.; Chui, M.; Brown, B.; Bughin, J.; Dobbs, R.; Roxburgh, C.; Byers, A.H. Big Data: The Next Frontier for Innovation, Competition, and Productivity ; McKinsey Global Institute: Chicago, IL, USA, 2011. [ Google Scholar ]
  • Henke, N.; Bughin, J.; Chui, M.; Manyika, J.; Saleh, T.; Wiseman, B.; Sethupathy, G. The Age of Analytics: Competing in a Data-Driven World ; McKinsey Global Institute: Chicago, IL, USA, 2016; Volume 12, pp. 904–920. [ Google Scholar ] [ CrossRef ]
  • Chen, M.; Mao, S.; Liu, Y. Big data: A survey. Mob. Netw. Appl. 2014 , 19 , 171–209. [ Google Scholar ] [ CrossRef ]
  • Romijn, B.-J. Big Data in the Public Sector: Uncertainties and Readiness in the Dutch Public Executive Sector. Inf. Syst. Frontiers. 2017 , 19 , 267–283. [ Google Scholar ]
  • Sun, S.; Cegielski, C.G.; Jia, L.; Hall, D.J. Understanding the Factors Affecting the Organizational Adoption of Big Data. J. Comput. Inf. Syst. 2016 , 58 , 193–203. [ Google Scholar ] [ CrossRef ]
  • Zainal, N.Z.; Hussin, H.; Nazri, M.N.M. Big Data Initiatives by Governments —Issues and Challenges: A Review. In Proceedings of the 2016 6th International Conference on Information and Communication Technology for The Muslim World (ICT4M), Jakarta, Indonesia, 22–24 November 2016; pp. 304–309. [ Google Scholar ] [ CrossRef ]
  • Esteves, J.; Curto, J. A risk and benefits behavioral model to assess intentions to adopt big data. J. Intell. Stud. Bus. 2013 , 3 , 37–46. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Kaka, E.S. E-Government Adoption and Framework for Big Data Analytics. In Proceedings of the Second Covenant University Conferenced on E-Governance in Nigeria (CUCEN 2015), Covenant University Canaanland, Ota Ogun State, Nigeria, 10–12 June 2015; pp. 1–28. [ Google Scholar ]
  • Batko, K.; Ślęzak, A. The use of Big Data Analytics in healthcare. J. Big Data 2022 , 9 , 3. [ Google Scholar ] [ CrossRef ]
  • Al Nuaimi, E.; Al Neyadi, H.; Mohamed, N.; Al-Jaroodi, J. Applications of big data to smart cities. J. Internet Serv. Appl. 2015 , 6 , 25. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Lutfi, A.; Alrawad, M.; Alsyouf, A.; Almaiah, M.A.; Al-Khasawneh, A.; Al-Khasawneh, A.L.; Alshira’h, A.F.; Alshirah, M.H.; Saad, M.; Ibrahim, N. Drivers and impact of big data analytic adoption in the retail industry: A quantitative investigation applying structural equation modeling. J. Retail. Consum. Serv. 2023 , 70 , 103129. [ Google Scholar ] [ CrossRef ]
  • Big Data Analytics. Available online: https://www.ibm.com/analytics/big-data-analytics (accessed on 1 January 2021).
  • Brock, V.; Khan, H.U. Big data analytics: Does organizational factor matters impact technology acceptance? J. Big Data 2017 , 4 , 21. [ Google Scholar ] [ CrossRef ]
  • Chen, J.; Chen, Y.; Du, X.; Li, C.; Lu, J.; Zhao, S.; Zhou, X. Big Data Challenge: A Data Management Perspective. Front. Comput. Sci. 2013 , 7 , 157–164. [ Google Scholar ] [ CrossRef ]
  • Hood-Clark, S.F. Influences On The Use And Behavioral Intention To Use Big Data. Ph.D. Thesis, Capella University ProQuest Dissertations Publishing, Minneapolis, MN, USA, 2016. [ Google Scholar ]
  • Gandomi, A.; Haider, M. Beyond the hype: Big data concepts, methods, and analytics. Int. J. Inf. Manag. 2015 , 35 , 137–144. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Munné, R. Big data in the public sector. In New Horizons for a Data-Driven Economy ; Springer: Cham, Switzerland, 2016; pp. 195–208. [ Google Scholar ]
  • Comuzzi, M.; Patel, A. How organisations leverage Big Data: A maturity model. Ind. Manag. Data Syst. 2016 , 116 , 1468–1492. [ Google Scholar ] [ CrossRef ]
  • Laney, D. META Delta. Appl. Deliv. Strateg. 2001 , 949 , 4. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Kanan, T.; Mughaid, A.; Al-Shalabi, R.; Al-Ayyoub, M.; Elbe, M.; Sadaqa, O. Business intelligence using deep learning techniques for social media contents. Cluster Comput. 2022 , 1–12. [ Google Scholar ] [ CrossRef ]
  • Singh, S.; Singh, N. Big Data analytics. In Proceedings of the 2012 International Conference on Communication, Information & Computing Technology, Mumbai, India, 19–20 October 2012; pp. 1–4. [ Google Scholar ] [ CrossRef ]
  • Where Big Data Projects Fail. Available online: https://www.forbes.com/sites/bernardmarr/2015/03/17/where-big-data-projects-fail/?sh=22d01455239f (accessed on 1 January 2021).
  • Cato, P.; Golzer, P.; Demmelhuber, W. An investigation into the implementation factors affecting the success of big data systems. In Proceedings of the 2015 11th International Conference on Innovations in Information Technology, Dubai, United Arab Emirates, 1–3 November 2015; pp. 134–139. [ Google Scholar ] [ CrossRef ]
  • Motau, M.; Kalema, B.M. Big Data Analytics Readiness: A South African Public Sector Perspective. In Proceedings of the 2016 IEEE International Conference on Emerging Technologies and Innovative Business Practices for the Transformation of Societies, Balaclava, Mauritius, 3–6 August 2016. [ Google Scholar ]
  • Big Security for Big Data: Addressing Security Challenges for the Big Data Infrastructure. Available online: https://doi.org/10.1007/978-3-319-06811-4 (accessed on 1 January 2021).
  • Soon, K.W.K.; Lee, C.A.; Boursier, P. A study of the determinants affecting adoption of big data using integrated Technology Acceptance Model (TAM) and diffusion of innovation (DOI) in Malaysia. Int. J. Appl. Bus. Econ. Res. 2016 , 14 , 17–47. [ Google Scholar ]
  • Kaisler, S.; Armour, F.; Espinosa, J.A.; Money, W. Big Data: Issues and Challenges Moving Forward. In Proceedings of the 2013 46th Hawaii International Conference on System Sciences, Wailea, HI, USA, 7–10 January 2013; pp. 995–1004. [ Google Scholar ] [ CrossRef ]
  • Khan, N.; Yaqoob, I.; Hashem, I.A.; Inayat, Z.; Ali, W.M.; Shiraz, M.; Gani, A.; Member, S. Big Data: Survey, Technologies, Opportunities, and Challenges. Sci. World J. 2014 , 2014 , 712826. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Sagiroglu, S.; Sinanc, D. Big data: A review. In Proceedings of the 2013 International Conference on Collaboration Technologies and Systems, San Diego, CA, USA, 20–24 May 2013; pp. 42–47. [ Google Scholar ] [ CrossRef ]
  • Saxena, S. Integrating Open and Big Data via e-Oman: Prospects and issues—Read. Contemp. Arab Aff. 2016 , 9 , 607–621. [ Google Scholar ] [ CrossRef ]
  • Mohanty, S.K.; Jagadeesh, M.; Srivatsa, H.K. Big Data Imperatives: Enterprise Big Data Warehouse, BI Implementations and Analytics. Available online: https://www.ptonline.com/articles/how-to-get-better-mfi-results (accessed on 1 January 2021).
  • Harnessing the Potential of Big Data in Post-Pandemic Southeast Asia. Available online: https://www.adb.org/publications/potential-big-data-post-pandemic-southeast-asia (accessed on 1 January 2021).
  • Long, C.K.; Agrawal, R.; Trung, H.Q.; Van Pham, H. A big data framework for E-Government in Industry 4.0. Open Comput. Sci. 2021 , 11 , 461–479. [ Google Scholar ] [ CrossRef ]
  • Sivarajah, U.; Kamal, M.M.; Irani, Z.; Weerakkody, V. Critical analysis of Big Data challenges and analytical methods. J. Bus. Res. 2017 , 70 , 263–286. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Sheng, J.; Amankwah-Amoah, J.; Khan, Z.; Wang, X. COVID-19 Pandemic in the New Era of Big Data Analytics: Methodological Innovations and Future Research Directions. Br. J. Manag. 2021 , 32 , 1164–1183. [ Google Scholar ] [ CrossRef ]
  • Delen, D.; Zolbanin, H.M. The analytics paradigm in business research. J. Bus. Res. 2018 , 90 , 186–195. [ Google Scholar ] [ CrossRef ]
  • Rustagi, V.; Bajaj, M.; Tanvi; Singh, P.; Aggarwal, R.; AlAjmi, M.F.; Hussain, A.; Hassan, M.I.; Singh, A.; Singh, I.K. Analyzing the Effect of Vaccination Over COVID Cases and Deaths in Asian Countries Using Machine Learning Models. Front. Cell. Infect. Microbiol. 2022 , 11 , 806265. [ Google Scholar ] [ CrossRef ] [ PubMed ]
  • Lepenioti, K.; Bousdekis, A.; Apostolou, D.; Mentzas, G. Prescriptive analytics: Literature review and research challenges. Int. J. Inf. Manag. 2020 , 50 , 57–70. [ Google Scholar ] [ CrossRef ]
  • Delen, D.; Ram, S. Research challenges and opportunities in business analytics. J. Bus. Anal. 2018 , 1 , 2–12. [ Google Scholar ] [ CrossRef ]
  • Sekli, G.F.M.; De La Vega, I. Adoption of big data analytics and its impact on organizational performance in higher education mediated by knowledge management. J. Open Innov. Technol. Mark. Complex. 2021 , 7 , 221. [ Google Scholar ] [ CrossRef ]
  • Nageshwaran, G.; Harris, R.C.; El Guerche-Seblain, C. Review of the role of big data and digital technologies in controlling COVID-19 in Asia: Public health interest vs. privacy. Digit. Health 2021 , 7 , 20552076211002953. [ Google Scholar ] [ CrossRef ]
  • Henke, N.; Puri, A.; Saleh, T. Accelerating analytics to navigate COVID-19 and the next normal. McKinsey Anal. 2020 , 9. [ Google Scholar ]
  • Sözen, M.E.; Sarlyer, G.; Ataman, M.G. Big data analytics and COVID-19: Investigating the relationship between government policies and cases in Poland, Turkey and South Korea. Health Policy Plan. 2022 , 37 , 100–111. [ Google Scholar ] [ CrossRef ]
  • Nwanga, M.E.; Onwuka, E.N.; Aibinu, A.M.; Ubadike, O.C. Impact of Big Data Analytics to Nigerian mobile phone industry. In Proceedings of the 2015 International Conference on Industrial Engineering and Operations Management, Dubai, United Arab Emirates, 3–5 March 2015; pp. 1314–1319. [ Google Scholar ] [ CrossRef ]
  • Mathrani, S.; Lai, X. Big data analytic framework for organizational leverage. Appl. Sci. 2021 , 11 , 2340. [ Google Scholar ] [ CrossRef ]
  • Akter, S.; Wamba, S.F. Big data analytics in E-commerce: A systematic review and agenda for future research. Electron. Mark. 2016 , 26 , 173–194. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Naik, K.; Joshi, A. Role of Big Data in various sectors. In Proceedings of the 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), Palladam, India, 10–11 February 2017; pp. 117–122. [ Google Scholar ] [ CrossRef ]
  • Chen, Y.; Alspaugh, S.; Katz, R. Interactive analytical processing in big data systems: A crossindustry study of mapreduce workloads. Proc. VLDB Endow. 2012 , 5 , 1802–1813. [ Google Scholar ] [ CrossRef ]
  • Chakraborty, S.; Saha, A.K.; Ezugwu, A.E.; Agushaka, J.O.; Zitar, R.A.; Abualigah, L. Differential Evolution and Its Applications in Image Processing Problems: A Comprehensive Review. Arch. Comput. Methods Eng. 2022 , 1–56. [ Google Scholar ] [ CrossRef ]
  • Yousefinaghani, S.; Dara, R.; Mubareka, S.; Papadopoulos, A.; Sharif, S. An analysis of COVID-19 vaccine sentiments and opinions on Twitter. Int. J. Infect. Dis. 2021 , 108 , 256–262. [ Google Scholar ] [ CrossRef ] [ PubMed ]
  • Mellado, B.; Wu, J.; Kong, J.D.; Bragazzi, N.L.; Asgary, A.; Kawonga, M.; Choma, N.; Hayasi, K.; Lieberman, B.; Mathaha, T.; et al. Leveraging artificial intelligence and big data to optimize covid-19 clinical public health and vaccination roll-out strategies in Africa. Int. J. Environ. Res. Public Health 2021 , 18 , 7890. [ Google Scholar ] [ CrossRef ]
  • E-Government Survey 2020—Digital Government in the Decade of Action for Sustainable Development: With Addendum on COVID-19. Available online: https://www.scienceopen.com/book?vid=bc74a872-5582-485a-aafe-260bd9a415bd. (accessed on 1 January 2021).
  • REal-Time Data Monitoring for Shared, Adaptive, Multi-Domain and Personalised Prediction and Decision Making for Long-Term Pulmonary Care Ecosystems (RE-SAMPLE). Available online: https://clinicaltrials.gov/ct2/show/NCT04955080 (accessed on 1 January 2021).
  • Gherheș, V.; Stoian, C.E.; Fărcașiu, M.A.; Stanici, M. E-learning vs. Face-to-face learning: Analyzing students’ preferences and behaviors. Sustainability 2021 , 13 , 4381. [ Google Scholar ] [ CrossRef ]
  • OECD. An Assessment of the Impact of COVID-19 on Job and Skills Demand Using Online Job Vacancy Data ; OECD: Paris, France, 2021; pp. 1–19. [ Google Scholar ]
  • Innovations From The Nation. Available online: https://www.mbrcgi.gov.ae/en/enrich/innovations-from-the-nation (accessed on 1 January 2021).
  • Haleem, A.; Javaid, M.; Khan, I.H.; Vaishya, R. Significant Applications of Big Data in COVID-19 Pandemic. Indian J. Orthop. 2020 , 54 , 526–528. [ Google Scholar ] [ CrossRef ] [ PubMed ]
  • Perspective, A.W. Winning in the Next Normal with Insights: How Banks & FIs Are Preparing. 2022. Available online: https://www.wns.com/%0Aperspectives/blogs/blogdetail/932/%0Awinning-in-the-next-normal-with-insights-how-banks--fis-are-preparing%0A (accessed on 1 January 2021).
  • Big Data Analytics in Healthcare. Available online: https://www.hindawi.com/journals/jhe/si/971905/ (accessed on 1 January 2021).
  • Strickland, N.H. PACS (picture archiving and communication systems): Filmless radiology. Arch. Dis. Child. 2000 , 83 , 82–86. [ Google Scholar ] [ CrossRef ] [ PubMed ]
  • Dash, S.; Shakyawar, S.K.; Sharma, M.; Kaushik, S. Big data in healthcare: Management, analysis and future prospects. J. Big Data 2019 , 6 , 54. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Dayrit, M.M.; Lagrada, L.P.; Picazo, O.F.; Pons, M.C.; Villaverde, M.C. The Philippines Health System Review. Health Syst. Transit. 2018 , 8 , 2. [ Google Scholar ]
  • Wu, J.; Wang, J.; Nicholas, S.; Maitland, E.; Fan, Q. Application of big data technology for COVID-19 prevention and control in China: Lessons and recommendations. J. Med. Internet Res. 2020 , 22 , e21980. [ Google Scholar ] [ CrossRef ]
  • Berendt, B.; Littlejohn, A.; Kern, P.; Mitros, P.; Shacklock; Blakemore, M. Big Data for Monitoring Educational Systems ; Publications Office of the European Union: Luxembourg, 2017. [ Google Scholar ]
  • Dishon, G. New data, old tensions: Big data, personalized learning, and the challenges of progressive education. Theory Res. Educ. 2017 , 15 , 272–289. [ Google Scholar ] [ CrossRef ]
  • Ruiz-Palmero, J.; Colomo-Magaña, E.; Ríos-Ariza, J.M.; Gómez-García, M. Big data in education: Perception of training advisors on its use in the educational system. Soc. Sci. 2020 , 9 , 53. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Fischer, C.; Pardos, Z.A.; Baker, R.S.; Williams, J.J.; Smyth, P.; Yu, R.; Slater, S.; Baker, R.; Warschauer, M. Mining Big Data in Education: Affordances and Challenges. Rev. Res. Educ. 2020 , 44 , 130–160. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Ochoa, X.; Worsley, M. Editorial: Augmenting Learning Analytics with Multimodal Sensory Data. J. Learn. Anal. 2016 , 3 , 213–219. [ Google Scholar ] [ CrossRef ]
  • Crossley, S.; Ocumpaugh, J.; Labrum, M.; Bradfield, F.; Dascalu, M.; Baker, R.S. Modeling math identity and math success through sentiment analysis and linguistic features. In Proceedings of the International Conference on Educational Data Mining (EDM), Raleigh, NC, USA, 16–20 July 2018. [ Google Scholar ]
  • Chaturapruek, S.; Dee, T.S.; Johari, R.; Kizilcec, R.F.; Stevens, M.L. How a data-driven course planning tool affects college students’ GPA: Evidence from two field experiments. In Proceedings of the 5th Annual ACM Conference on Learning at Scale, London, UK, 26–28 June 2018. [ Google Scholar ] [ CrossRef ]
  • Lukosch, H.K.; Bekebrede, G.; Kurapati, S.; Lukosch, S.G. A Scientific Foundation of Simulation Games for the Analysis and Design of Complex Systems. Simul. Gaming 2018 , 49 , 279–314. [ Google Scholar ] [ CrossRef ]
  • Teaching, E.; Educational, L.T.; Mining, D.; Sin, K.; Muthu, L.; Prakash, B.R.; Hanumanthappa, M.; Kavitha, V. Enhancing Teaching and Learning Through Educational Data Mining and Learning Analytics: An Issue Brief. ICTACT J. Soft Comput. 2015 , 5 , 1035–1049. [ Google Scholar ]
  • Anaya, A.R.; Boticario, J.G. A data mining approach to reveal representative collaboration indicators in open collaboration frameworks. In Proceedings of the International Conference on Educational Data Mining (EDM), Cordoba, Spain, 1–3 July 2009; pp. 210–219. [ Google Scholar ]
  • Wang, X.; Guo, B.; Shen, Y. Predicting the At-Risk Online Students Based on the Click Data Distribution Characteristics. Sci. Program. 2022 , 2022 , 9938260. [ Google Scholar ] [ CrossRef ]
  • Zhu, Z.T.; Yu, M.H.; Riezebos, P. A research framework of smart education. Smart Learn. Environ. 2016 , 3 , 4. [ Google Scholar ] [ CrossRef ] [ Green Version ]
  • Torre-Bastida, A.I.; Del Ser, J.; Laña, I.; Ilardia, M.; Bilbao, M.N.; Campos-Cordobés, S. Big Data for transportation and mobility: Recent advances, trends and challenges. IET Intell. Transp. Syst. 2018 , 12 , 742–755. [ Google Scholar ] [ CrossRef ]
  • Selod, H.; Soumahoro, S. Big Data in Transportation: An Economics Perspective ; World Bank: Washington, DC, USA, 2020. [ Google Scholar ]
  • Hou, Y.; Chen, J.; Wen, S. The effect of the dataset on evaluating urban traffic prediction. Alex. Eng. J. 2021 , 60 , 597–613. [ Google Scholar ] [ CrossRef ]
  • A Research Agenda for Transport Policy. Available online: https://www.e-elgar.com/shop/gbp/a-research-agenda-for-transport-policy-9781788970198.html (accessed on 1 January 2021).
  • New Traffic Data Sources—An Overview. Available online: https://www.bitre.gov.au/sites/default/files/2019-12/NewDataSources-BackgroundPaper-April%202014.pdf. (accessed on 1 January 2021).
  • Mining the Datasphere: Big Data, Technologies, and Transportation Disaster Management. Available online: https://sbenrc.com.au/app/uploads/2021/03/SBEnrc-Project-1.45-Milestone-1-Congestion-v2.pdf (accessed on 1 January 2021).
  • Blazquez, D.; Domenech, J. Big Data sources and methods for social and economic analyses. Technol. Forecast. Soc. Chang. 2018 , 130 , 99–113. [ Google Scholar ] [ CrossRef ]
  • Nobanee, H.; Dilshad, M.N.; Al Dhanhani, M.; Al Neyadi, M.; Al Qubaisi, S.; Al Shamsi, S. Big Data Applications the Banking Sector: A Bibliometric Analysis Approach. SAGE Open 2021 , 11 , 4. [ Google Scholar ] [ CrossRef ]
  • Gasser, L.Z.U.; Gassmann, O.; Hens, T.; Leifer, L.; Puschmann, T. Digital Banking 2025. 2017. Available online: http://www.dv.co.th/blog-th/digital-banking-trend/ (accessed on 1 January 2021).
  • Bhasin, M.L. Combatting Bank Frauds by Integration of Technology: Experience of a Developing Country. Br. J. Res. 2016 , 3 , 64–92. [ Google Scholar ]
  • Vives, X. Digital Disruption in Banking and Its Impact on Competition ; OECD: Paris, France, 2020; pp. 1–50. [ Google Scholar ]
  • Alexandru, A.G.; Radu, I.M.; Bizon, M.-L. Big Data in Healthcare—Opportunities and Challenges. Inform. Econ. 2018 , 22 , 43–54. [ Google Scholar ] [ CrossRef ]
  • Bourdeaux, M. Reimagining the Role of Technology in Education. Relig. Communist Lands 2017 , 9 , 2–3. [ Google Scholar ] [ CrossRef ]
  • Modeling the Big Data Challenges in Context of Smart Cities—An Integrated Fuzzy ISM-DEMATEL Approach. Available online: https://doi.org/10.1108/IJBPA-02-2021-0027 (accessed on 1 January 2021).
  • World Bank. Data Big in Action for Government ; World Bank: Washington, DC, USA, 2017; p. 18. [ Google Scholar ]
  • Use of Technology in the Ebola Response in West Africa. Available online: https://pdf.usaid.gov/pdf_docs/PA00K99H.pdf (accessed on 1 January 2021).
  • Oussous, A.; Benjelloun, F.Z.; Ait Lahcen, A.; Belfkih, S. Big Data technologies: A survey. J. King Saud Univ.—Comput. Inf. Sci. 2018 , 30 , 431–448. [ Google Scholar ] [ CrossRef ]
  • ENOVA Annual Report 2013. Available online: https://www.enova.no/upload_images/5649E609BFEA4B2A890777583DD1654D.pdf (accessed on 1 January 2021).
  • Data Analytics Tools in Higher Education. Available online: https://ceur-ws.org/Vol-3061/ERIS_2021-art10(sh).pdf (accessed on 1 January 2021).
  • Learning Analytics in Higher Education: A Review of UK and International Practice Full Report. Available online: https://www.jisc.ac.uk/sites/default/files/learning-analytics-in-he-v2_0.pdf%0Ahttps://www.jisc.ac.uk/reports/learning-analytics-in-higher-education (accessed on 1 January 2021).
  • Alsrehin, N.O.; Klaib, A.F.; Magableh, A. Intelligent Transportation and Control Systems Using Data Mining and Machine Learning Techniques: A Comprehensive Study. IEEE Access 2019 , 7 , 49830–49857. [ Google Scholar ] [ CrossRef ]
  • Big Data in Banking for Marketers—How to Derive Value from Big Data. Available online: https://dataanalytics.report/Resources/Whitepapers/be309286-97b2-497d-9c6d-af9de5f620e1_bank-2020---big-data---whitepaper.pdf (accessed on 1 January 2021).
  • Corsi, A.; de Souza, F.F.; Pagani, R.N.; Kovaleski, J.L. Big data analytics as a tool for fighting pandemics: A systematic review of literature. J. Ambient Intell. Humaniz. Comput. 2021 , 12 , 9163–9180. [ Google Scholar ] [ CrossRef ]
  • Gigauri, I. Effects of COVID-19 on Human Resource Management from the Perspective of Digitalization and Work-life-balance. Int. J. Innov. Technol. Econ. 2020 , 31 , 1–8. [ Google Scholar ] [ CrossRef ]
  • OCED. The Territorial Impact of COVID-19: Managing The Crisis across Levels of Government. 2020. Available online: https://www.oecd.org/coronavirus/policy-responses/the-territorial-impact-of-covid-19-managing-the-crisis-across-levels-of-government-d3e314e1/ (accessed on 1 January 2021).
  • Big Data, Big Outcomes: How Analytics Can Transform Public Services and Improve Citizens’ Lives. Available online: https://assets.ey.com/content/dam/ey-sites/ey-com/en_gl/topics/future-of-government/ey-future-of-gov-digital-analytics-report.pdf?download#:~:text=By%20combining%20data%20from%20a,%2C%20well%2Dbeing%20and%20safety (accessed on 1 January 2021).
  • E-Government Survey 2020. Available online: https://publicadministration.un.org/egovkb/en-us/Reports/UN-E-Government-Survey-2020 (accessed on 1 January 2021).
  • Mihalis, K. Ten technologies to fight coronavirus. Eur. Parliam. Res. Serv. 2020 , 1–20. [ Google Scholar ]
  • Central Bank of the Russian Federation. Using Big Data in The Financial Sector and Risk to Financial Stability ; Central Bank of the Russian Federation: Moscow, Russia, 2021. [ Google Scholar ]
  • Asli Demirgüç-Kunt, J.H.; Klapper, L.; Singer, D.; Ansar, S. The Global Findex Database 2017: Measuring Financial Inclusion and the Fintech Revolution ; World Bank: Washington, DC, USA, 2017; ISBN 9781464812590. [ Google Scholar ]
  • OECD. Artificial Intelligence, Machine Learning and Big Data in Finance: Opportunities, Challenges, and Implications for Policy Makers. 2021. Available online: https://www.oecd.org/finance/financial-markets/Artificial-intelligence-machine-learning-big-data-in-finance.pdf (accessed on 1 January 2021).

Click here to enlarge figure

FieldData Analytics TypeHow BDA Has Been UtilizedData Processing Models Used to Analyse Big DataReference
HealthcareDescriptive and Predictive Data Analytics Proactive actions and interventions based on predictive models to trigger any noncommunicable diseases.Predictive models based on search engines and social media data.
Smart phone applications tracking system to identify infection hot spots
[ , ]
Perspective Data AnalyticsVaccine distributionSentiment analysis to reduce community resistance towards the vaccine.[ , , ]
Diagnostic and Predictive Data Analytics Vaccine distributionMachine learning models to prioritize the citizens’ need and urgency to the vaccine
Diagnostic and Prescriptive Data Analytics Monitoring live and frequent data on the spread of the disease
Provide more personalized consultations by “virtual doctors”
Dashboards
AI Chabot
[ , ]
EducationDescriptive Data Analytics Enhance online educational platform experience Analyzing data captured from online educational platforms can ease educators remote leaning experience[ , ]
Diagnostic Data Analytics Bridge the gap of unemploymentAnalysis of data captured from job portals[ , ]
TransportationDescriptive and Prescriptive Data Analytics implementation of precautionary measures-Ensure social distancing in public transportationCapturing relevant data and use machine learning techniques to detect incompliance actions[ ]
Detect citizens’ commute route to store their travel history. Use both AI and Big data applications to capture, track and predict valuable insights about citizens movement within and across cities and countries [ ]
Banking Fraud Detection Use AI and ML techniques to describe and detect real-time abnormal activities and online transaction, and build ML models based on classification algorithims to predict any suspecious case.[ ]
Descriptive and Predictive Data AnalyticsRisk AssessmentUse both diagnositic and prescriptuve data analytics models to analyze real-time data and asses the creditworthiness to customers. Consequenlty developing the appropriate cutomer portfolio and tailor clients needs to their services. Cossequently boosting customers’ satisfaction, loayality and enhance banks botom line records. [ ]
FieldOpportunitiesDescriptionReference
HealthcareServe efficiently considering both value and costs to individual casesBDA have powerful ability to highlight the correlation and patterns between different variables rather than finding the casual inference between them and serve individual patients’ cases.[ , , , , ]
EducationImprove the learning process
Provide real time feedback and construct development plans
Construct a more personalized learning environment
Enrich the learning environment
Utilize BDA in marketing research purposes for institutions
BDA enables educational institutes and professionals to personalize the educational experience for students[ , , , , , , , ]
TransportationThe base for researchers, economists and regulators to analyze traffic flow, congestion and their social, economic and environmental impacts.
Apply a combination of new methods of analysis such as AI approaches, to pave the way for predicting and providing innovative solutions for the future in the field of transportation.
BDA predictive capabilities and the incorporation of economic insights can exceed the ability to understand and analyze the past and real time data, to predict the optimal legislations for traffic congestion issues in smart cities.[ , , , , , ]
BanksDetect fraud cases
Ease the merge and acquisition operations Optimize banking supply chain performance
Interpret clients’ behaviors.
Provide valued and satisfactory services to clients.
Analyze, predict, and visualize both external market conditions and internal clients’ trends and preferences
Increase market share and enhance profitability.
BDA supported the introduction of digital banking operations and virtual banking systems[ , , ]
Field in ChargeApplication NameDescriptionReference
HealthEbola Open Data InitiativeWest Africa-data has been utilized to develop an open-source global model for tracking the cases of Ebola cases in in 2014[ , , ]
HealthMapa platform used to visualize diseases trends and provides an early trigger on the proper response[ , ]
Proactive listening, mobile phone-based systemBrazil-to govern the issue of bribes in the health services, and handle any related issues and take an immediate and effective action against corruption.[ ]
EducationENOVAMexico, through the utilization of data and data analytics can analyze and predict students’ interactions. Consequently, boosts the educational strategies and enhances the used tools and techniques in the teaching-learning process.[ , ]
(PASS) Personalized Adaptive Study SuccessThe Open University Australia-Predicts course material, beside a more personalized studying environment. The predictive data analytics model is based on analyzing students, individual characteristics, beside other student related data captured from other systems.
The main goal of the application is to develop a more customized environment that ensures students involvement, engagement, and retention in an e-learning environment.
[ ]
TransportationOpenTraffic platformAn application to support in urban infrastructure decisions, based on data captured from both vehicles and smartphones, to analyze it and visualize it into both historic and real-time traffic situations.[ , ]
Seoul, South Africa-the application is used to support night bus drivers to ease their journey from origin to destination. This will occur through capturing data from tremendous number of calls and text data points, as well as private and corporate taxi data sources.[ ]
BanksAvaloq, Finnova, SAP, Sungard and TemenosOCBC is the largest bank in terms of market capitalization in Singapore. It operates in more than 15 countries globally. It is a success example of the utilization of BDA. For instance, the bank responded to customer actions, customers’ personalized events and their demographic profiles. Hence, OCBC Bank succeed in achieving higher customer engagement and increasing the level of customer satisfaction by 20% in comparison to a control group.
These core banking applications, such as Avaloq, Finnova, SAP, Sungard or Temenos for example, were designed to handle large amounts of transactions in back-office processes for basic financial products and services, such as bank accounts, deposits, etc.
[ , ]
FieldData Analytics TypeHow BDA Has Been UtilizedMethod/ModelReference
HealthcareDescriptive and Predictive Data Analytics ModelsProactive actions and interventions based on predictive models to trigger any noncommunicable diseases.Predictive models based on search engines and social media data.
Smart phone applications tracking system to identify infection hot spots
[ , ]
Perspective Data AnalyticsVaccine distributionSentiment analysis to reduce community resistance towards the vaccine.[ , ]
Diagnostic and Predictive Data Analytics ModelsVaccine distributionMachine learning models to prioritize the citizens’ need and urgency to the vaccine
Diagnostic and Prescriptive Data Analytics ModelsMonitoring live and frequent data on the spread of the disease
Provide more personalized consultations by “virtual doctors”
Dashboards
AI Chabot
[ , ]
EducationDescriptive Data Analytics ModelEnhance online educational platform experience Analyzing data captured from online educational platforms can ease educators remote leaning experience[ , ]
Diagnostic Data Analytics ModelBridge the gap of unemploymentAnalysis of data captured from job portals[ , ]
TransportationDescriptive and Prescriptive Data Analytics Modelsimplementation of precautionary measures-Ensure social distancing in public transportationCapturing relevant data and use machine learning techniques to detect incompliance actions[ ]
Descriptive and Predictive Data Analytics ModelsDetect citizens’ commute route to store their travel history. Use both AI and Big data applications to capture, track and predict valuable insights about citizens movement within and across cities and countries [ ]
Banking Fraud Detection Use AI and ML techniques to describe and detect real-time abnormal activities and online transaction, and build ML models based on classification algorithims to predict any suspecious case.[ ]
Risk AssessmentUse both diagnositic and prescriptuve data analytics models to analyze real-time data and asses the creditworthiness to customers. Consequenlty developing the appropriate cutomer portfolio and tailor clients needs to their services. Cossequently boosting customers’ satisfaction, loayality and enhance banks botom line records.[ ]
MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

Al-Sai, Z.A.; Husin, M.H.; Syed-Mohamad, S.M.; Abdin, R.M.S.; Damer, N.; Abualigah, L.; Gandomi, A.H. Explore Big Data Analytics Applications and Opportunities: A Review. Big Data Cogn. Comput. 2022 , 6 , 157. https://doi.org/10.3390/bdcc6040157

Al-Sai ZA, Husin MH, Syed-Mohamad SM, Abdin RMS, Damer N, Abualigah L, Gandomi AH. Explore Big Data Analytics Applications and Opportunities: A Review. Big Data and Cognitive Computing . 2022; 6(4):157. https://doi.org/10.3390/bdcc6040157

Al-Sai, Zaher Ali, Mohd Heikal Husin, Sharifah Mashita Syed-Mohamad, Rasha Moh’d Sadeq Abdin, Nour Damer, Laith Abualigah, and Amir H. Gandomi. 2022. "Explore Big Data Analytics Applications and Opportunities: A Review" Big Data and Cognitive Computing 6, no. 4: 157. https://doi.org/10.3390/bdcc6040157

Article Metrics

Article access statistics, further information, mdpi initiatives, follow mdpi.

MDPI

Subscribe to receive issue release notifications and newsletters from MDPI journals

  • How It Works
  • PhD thesis writing
  • Master thesis writing
  • Bachelor thesis writing
  • Dissertation writing service
  • Dissertation abstract writing
  • Thesis proposal writing
  • Thesis editing service
  • Thesis proofreading service
  • Thesis formatting service
  • Coursework writing service
  • Research paper writing service
  • Architecture thesis writing
  • Computer science thesis writing
  • Engineering thesis writing
  • History thesis writing
  • MBA thesis writing
  • Nursing dissertation writing
  • Psychology dissertation writing
  • Sociology thesis writing
  • Statistics dissertation writing
  • Buy dissertation online
  • Write my dissertation
  • Cheap thesis
  • Cheap dissertation
  • Custom dissertation
  • Dissertation help
  • Pay for thesis
  • Pay for dissertation
  • Senior thesis
  • Write my thesis

214 Best Big Data Research Topics for Your Thesis Paper

big data research topics

Finding an ideal big data research topic can take you a long time. Big data, IoT, and robotics have evolved. The future generations will be immersed in major technologies that will make work easier. Work that was done by 10 people will now be done by one person or a machine. This is amazing because, in as much as there will be job loss, more jobs will be created. It is a win-win for everyone.

Big data is a major topic that is being embraced globally. Data science and analytics are helping institutions, governments, and the private sector. We will share with you the best big data research topics.

On top of that, we can offer you the best writing tips to ensure you prosper well in your academics. As students in the university, you need to do proper research to get top grades. Hence, you can consult us if in need of research paper writing services.

Big Data Analytics Research Topics for your Research Project

Are you looking for an ideal big data analytics research topic? Once you choose a topic, consult your professor to evaluate whether it is a great topic. This will help you to get good grades.

  • Which are the best tools and software for big data processing?
  • Evaluate the security issues that face big data.
  • An analysis of large-scale data for social networks globally.
  • The influence of big data storage systems.
  • The best platforms for big data computing.
  • The relation between business intelligence and big data analytics.
  • The importance of semantics and visualization of big data.
  • Analysis of big data technologies for businesses.
  • The common methods used for machine learning in big data.
  • The difference between self-turning and symmetrical spectral clustering.
  • The importance of information-based clustering.
  • Evaluate the hierarchical clustering and density-based clustering application.
  • How is data mining used to analyze transaction data?
  • The major importance of dependency modeling.
  • The influence of probabilistic classification in data mining.

Interesting Big Data Analytics Topics

Who said big data had to be boring? Here are some interesting big data analytics topics that you can try. They are based on how some phenomena are done to make the world a better place.

  • Discuss the privacy issues in big data.
  • Evaluate the storage systems of scalable in big data.
  • The best big data processing software and tools.
  • Data mining tools and techniques are popularly used.
  • Evaluate the scalable architectures for parallel data processing.
  • The major natural language processing methods.
  • Which are the best big data tools and deployment platforms?
  • The best algorithms for data visualization.
  • Analyze the anomaly detection in cloud servers
  • The scrutiny normally done for the recruitment of big data job profiles.
  • The malicious user detection in big data collection.
  • Learning long-term dependencies via the Fourier recurrent units.
  • Nomadic computing for big data analytics.
  • The elementary estimators for graphical models.
  • The memory-efficient kernel approximation.

Big Data Latest Research Topics

Do you know the latest research topics at the moment? These 15 topics will help you to dive into interesting research. You may even build on research done by other scholars.

  • Evaluate the data mining process.
  • The influence of the various dimension reduction methods and techniques.
  • The best data classification methods.
  • The simple linear regression modeling methods.
  • Evaluate the logistic regression modeling.
  • What are the commonly used theorems?
  • The influence of cluster analysis methods in big data.
  • The importance of smoothing methods analysis in big data.
  • How is fraud detection done through AI?
  • Analyze the use of GIS and spatial data.
  • How important is artificial intelligence in the modern world?
  • What is agile data science?
  • Analyze the behavioral analytics process.
  • Semantic analytics distribution.
  • How is domain knowledge important in data analysis?

Big Data Debate Topics

If you want to prosper in the field of big data, you need to try even hard topics. These big data debate topics are interesting and will help you to get a better understanding.

  • The difference between big data analytics and traditional data analytics methods.
  • Why do you think the organization should think beyond the Hadoop hype?
  • Does the size of the data matter more than how recent the data is?
  • Is it true that bigger data are not always better?
  • The debate of privacy and personalization in maintaining ethics in big data.
  • The relation between data science and privacy.
  • Do you think data science is a rebranding of statistics?
  • Who delivers better results between data scientists and domain experts?
  • According to your view, is data science dead?
  • Do you think analytics teams need to be centralized or decentralized?
  • The best methods to resource an analytics team.
  • The best business case for investing in analytics.
  • The societal implications of the use of predictive analytics within Education.
  • Is there a need for greater control to prevent experimentation on social media users without their consent?
  • How is the government using big data; for the improvement of public statistics or to control the population?

University Dissertation Topics on Big Data

Are you doing your Masters or Ph.D. and wondering the best dissertation topic or thesis to do? Why not try any of these? They are interesting and based on various phenomena. While doing the research ensure you relate the phenomenon with the current modern society.

  • The machine learning algorithms are used for fall recognition.
  • The divergence and convergence of the internet of things.
  • The reliable data movements using bandwidth provision strategies.
  • How is big data analytics using artificial neural networks in cloud gaming?
  • How is Twitter accounts classification done using network-based features?
  • How is online anomaly detection done in the cloud collaborative environment?
  • Evaluate the public transportation insights provided by big data.
  • Evaluate the paradigm for cancer patients using the nursing EHR to predict the outcome.
  • Discuss the current data lossless compression in the smart grid.
  • How does online advertising traffic prediction helps in boosting businesses?
  • How is the hyperspectral classification done using the multiple kernel learning paradigm?
  • The analysis of large data sets downloaded from websites.
  • How does social media data help advertising companies globally?
  • Which are the systems recognizing and enforcing ownership of data records?
  • The alternate possibilities emerging for edge computing.

The Best Big Data Analysis Research Topics and Essays

There are a lot of issues that are associated with big data. Here are some of the research topics that you can use in your essays. These topics are ideal whether in high school or college.

  • The various errors and uncertainty in making data decisions.
  • The application of big data on tourism.
  • The automation innovation with big data or related technology
  • The business models of big data ecosystems.
  • Privacy awareness in the era of big data and machine learning.
  • The data privacy for big automotive data.
  • How is traffic managed in defined data center networks?
  • Big data analytics for fault detection.
  • The need for machine learning with big data.
  • The innovative big data processing used in health care institutions.
  • The money normalization and extraction from texts.
  • How is text categorization done in AI?
  • The opportunistic development of data-driven interactive applications.
  • The use of data science and big data towards personalized medicine.
  • The programming and optimization of big data applications.

The Latest Big Data Research Topics for your Research Proposal

Doing a research proposal can be hard at first unless you choose an ideal topic. If you are just diving into the big data field, you can use any of these topics to get a deeper understanding.

  • The data-centric network of things.
  • Big data management using artificial intelligence supply chain.
  • The big data analytics for maintenance.
  • The high confidence network predictions for big biological data.
  • The performance optimization techniques and tools for data-intensive computation platforms.
  • The predictive modeling in the legal context.
  • Analysis of large data sets in life sciences.
  • How to understand the mobility and transport modal disparities sing emerging data sources?
  • How do you think data analytics can support asset management decisions?
  • An analysis of travel patterns for cellular network data.
  • The data-driven strategic planning for citywide building retrofitting.
  • How is money normalization done in data analytics?
  • Major techniques used in data mining.
  • The big data adaptation and analytics of cloud computing.
  • The predictive data maintenance for fault diagnosis.

Interesting Research Topics on A/B Testing In Big Data

A/B testing topics are different from the normal big data topics. However, you use an almost similar methodology to find the reasons behind the issues. These topics are interesting and will help you to get a deeper understanding.

  • How is ultra-targeted marketing done?
  • The transition of A/B testing from digital to offline.
  • How can big data and A/B testing be done to win an election?
  • Evaluate the use of A/B testing on big data
  • Evaluate A/B testing as a randomized control experiment.
  • How does A/B testing work?
  • The mistakes to avoid while conducting the A/B testing.
  • The most ideal time to use A/B testing.
  • The best way to interpret results for an A/B test.
  • The major principles of A/B tests.
  • Evaluate the cluster randomization in big data
  • The best way to analyze A/B test results and the statistical significance.
  • How is A/B testing used in boosting businesses?
  • The importance of data analysis in conversion research
  • The importance of A/B testing in data science.

Amazing Research Topics on Big Data and Local Governments

Governments are now using big data to make the lives of the citizens better. This is in the government and the various institutions. They are based on real-life experiences and making the world better.

  • Assess the benefits and barriers of big data in the public sector.
  • The best approach to smart city data ecosystems.
  • The big analytics used for policymaking.
  • Evaluate the smart technology and emergence algorithm bureaucracy.
  • Evaluate the use of citizen scoring in public services.
  • An analysis of the government administrative data globally.
  • The public values are found in the era of big data.
  • Public engagement on local government data use.
  • Data analytics use in policymaking.
  • How are algorithms used in public sector decision-making?
  • The democratic governance in the big data era.
  • The best business model innovation to be used in sustainable organizations.
  • How does the government use the collected data from various sources?
  • The role of big data for smart cities.
  • How does big data play a role in policymaking?

Easy Research Topics on Big Data

Who said big data topics had to be hard? Here are some of the easiest research topics. They are based on data management, research, and data retention. Pick one and try it!

  • Who uses big data analytics?
  • Evaluate structure machine learning.
  • Explain the whole deep learning process.
  • Which are the best ways to manage platforms for enterprise analytics?
  • Which are the new technologies used in data management?
  • What is the importance of data retention?
  • The best way to work with images is when doing research.
  • The best way to promote research outreach is through data management.
  • The best way to source and manage external data.
  • Does machine learning improve the quality of data?
  • Describe the security technologies that can be used in data protection.
  • Evaluate token-based authentication and its importance.
  • How can poor data security lead to the loss of information?
  • How to determine secure data.
  • What is the importance of centralized key management?

Unique IoT and Big Data Research Topics

Internet of Things has evolved and many devices are now using it. There are smart devices, smart cities, smart locks, and much more. Things can now be controlled by the touch of a button.

  • Evaluate the 5G networks and IoT.
  • Analyze the use of Artificial intelligence in the modern world.
  • How do ultra-power IoT technologies work?
  • Evaluate the adaptive systems and models at runtime.
  • How have smart cities and smart environments improved the living space?
  • The importance of the IoT-based supply chains.
  • How does smart agriculture influence water management?
  • The internet applications naming and identifiers.
  • How does the smart grid influence energy management?
  • Which are the best design principles for IoT application development?
  • The best human-device interactions for the Internet of Things.
  • The relation between urban dynamics and crowdsourcing services.
  • The best wireless sensor network for IoT security.
  • The best intrusion detection in IoT.
  • The importance of big data on the Internet of Things.

Big Data Database Research Topics You Should Try

Big data is broad and interesting. These big data database research topics will put you in a better place in your research. You also get to evaluate the roles of various phenomena.

  • The best cloud computing platforms for big data analytics.
  • The parallel programming techniques for big data processing.
  • The importance of big data models and algorithms in research.
  • Evaluate the role of big data analytics for smart healthcare.
  • How is big data analytics used in business intelligence?
  • The best machine learning methods for big data.
  • Evaluate the Hadoop programming in big data analytics.
  • What is privacy-preserving to big data analytics?
  • The best tools for massive big data processing
  • IoT deployment in Governments and Internet service providers.
  • How will IoT be used for future internet architectures?
  • How does big data close the gap between research and implementation?
  • What are the cross-layer attacks in IoT?
  • The influence of big data and smart city planning in society.
  • Why do you think user access control is important?

Big Data Scala Research Topics

Scala is a programming language that is used in data management. It is closely related to other data programming languages. Here are some of the best scala questions that you can research.

  • Which are the most used languages in big data?
  • How is scala used in big data research?
  • Is scala better than Java in big data?
  • How is scala a concise programming language?
  • How does the scala language stream process in real-time?
  • Which are the various libraries for data science and data analysis?
  • How does scala allow imperative programming in data collection?
  • Evaluate how scala includes a useful REPL for interaction.
  • Evaluate scala’s IDE support.
  • The data catalog reference model.
  • Evaluate the basics of data management and its influence on research.
  • Discuss the behavioral analytics process.
  • What can you term as the experience economy?
  • The difference between agile data science and scala language.
  • Explain the graph analytics process.

Independent Research Topics for Big Data

These independent research topics for big data are based on the various technologies and how they are related. Big data will greatly be important for modern society.

  • The biggest investment is in big data analysis.
  • How are multi-cloud and hybrid settings deep roots?
  • Why do you think machine learning will be in focus for a long while?
  • Discuss in-memory computing.
  • What is the difference between edge computing and in-memory computing?
  • The relation between the Internet of things and big data.
  • How will digital transformation make the world a better place?
  • How does data analysis help in social network optimization?
  • How will complex big data be essential for future enterprises?
  • Compare the various big data frameworks.
  • The best way to gather and monitor traffic information using the CCTV images
  • Evaluate the hierarchical structure of groups and clusters in the decision tree.
  • Which are the 3D mapping techniques for live streaming data.
  • How does machine learning help to improve data analysis?
  • Evaluate DataStream management in task allocation.
  • How is big data provisioned through edge computing?
  • The model-based clustering of texts.
  • The best ways to manage big data.
  • The use of machine learning in big data.

Is Your Big Data Thesis Giving You Problems?

These are some of the best topics that you can use to prosper in your studies. Not only are they easy to research but also reflect on real-time issues. Whether in University or college, you need to put enough effort into your studies to prosper. However, if you have time constraints, we can provide professional writing help. Are you looking for online expert writers? Look no further, we will provide quality work at a cheap price.

annotated bibliography topics

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Comment * Error message

Name * Error message

Email * Error message

Save my name, email, and website in this browser for the next time I comment.

As Putin continues killing civilians, bombing kindergartens, and threatening WWIII, Ukraine fights for the world's peaceful future.

Ukraine Live Updates

Big Data Analytics: A Literature Review Paper

  • Conference paper
  • Cite this conference paper

big data thesis paper

  • Nada Elgendy 20 &
  • Ahmed Elragal 20  

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8557))

Included in the following conference series:

  • Industrial Conference on Data Mining

9193 Accesses

166 Citations

18 Altmetric

In the information era, enormous amounts of data have become available on hand to decision makers. Big data refers to datasets that are not only big, but also high in variety and velocity, which makes them difficult to handle using traditional tools and techniques. Due to the rapid growth of such data, solutions need to be studied and provided in order to handle and extract value and knowledge from these datasets. Furthermore, decision makers need to be able to gain valuable insights from such varied and rapidly changing data, ranging from daily transactions to customer interactions and social network data. Such value can be provided using big data analytics, which is the application of advanced analytics techniques on big data. This paper aims to analyze some of the different analytics methods and tools which can be applied to big data, as well as the opportunities provided by the application of big data analytics in various decision domains.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save.

  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Unable to display preview.  Download preview PDF.

Similar content being viewed by others

big data thesis paper

Big Data Analytics—Analysis and Comparison of Various Tools

big data thesis paper

Big Data and Analytics—A Journey Through Basic Concepts to Research Issues

big data thesis paper

The Nexus Between Big Data and Decision-Making: A Study of Big Data Techniques and Technologies

Adams, M.N.: Perspectives on Data Mining. International Journal of Market Research 52(1), 11–19 (2010)

Article   Google Scholar  

Asur, S., Huberman, B.A.: Predicting the Future with Social Media. In: ACM International Conference on Web Intelligence and Intelligent Agent Technology, vol. 1, pp. 492–499 (2010)

Google Scholar  

Bakshi, K.: Considerations for Big Data: Architecture and Approaches. In: Proceedings of the IEEE Aerospace Conference, pp. 1–7 (2012)

Cebr: Data equity, Unlocking the value of big data. in: SAS Reports, pp. 1–44 (2012)

Cohen, J., Dolan, B., Dunlap, M., Hellerstein, J.M., Welton, C.: MAD Skills: New Analysis Practices for Big Data. Proceedings of the ACM VLDB Endowment 2(2), 1481–1492 (2009)

Cuzzocrea, A., Song, I., Davis, K.C.: Analytics over Large-Scale Multidimensional Data: The Big Data Revolution! In: Proceedings of the ACM International Workshop on Data Warehousing and OLAP, pp. 101–104 (2011)

Economist Intelligence Unit: The Deciding Factor: Big Data & Decision Making. In: Capgemini Reports, pp. 1–24 (2012)

Elgendy, N.: Big Data Analytics in Support of the Decision Making Process. MSc Thesis, German University in Cairo, p. 164 (2013)

EMC: Data Science and Big Data Analytics. In: EMC Education Services, pp. 1–508 (2012)

He, Y., Lee, R., Huai, Y., Shao, Z., Jain, N., Zhang, X., Xu, Z.: RCFile: A Fast and Space-efficient Data Placement Structure in MapReduce-based Warehouse Systems. In: IEEE International Conference on Data Engineering (ICDE), pp. 1199–1208 (2011)

Herodotou, H., Lim, H., Luo, G., Borisov, N., Dong, L., Cetin, F.B., Babu, S.: Starfish: A Self-tuning System for Big Data Analytics. In: Proceedings of the Conference on Innovative Data Systems Research, pp. 261–272 (2011)

Kubick, W.R.: Big Data, Information and Meaning. In: Clinical Trial Insights, pp. 26–28 (2012)

Lee, R., Luo, T., Huai, Y., Wang, F., He, Y., Zhang, X.: Ysmart: Yet Another SQL-to-MapReduce Translator. In: IEEE International Conference on Distributed Computing Systems (ICDCS), pp. 25–36 (2011)

Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C., Byers, A.H.: Big Data: The Next Frontier for Innovation, Competition, and Productivity. In: McKinsey Global Institute Reports, pp. 1–156 (2011)

Mouthami, K., Devi, K.N., Bhaskaran, V.M.: Sentiment Analysis and Classification Based on Textual Reviews. In: International Conference on Information Communication and Embedded Systems (ICICES), pp. 271–276 (2013)

Plattner, H., Zeier, A.: In-Memory Data Management: An Inflection Point for Enterprise Applications. Springer, Heidelberg (2011)

Book   Google Scholar  

Russom, P.: Big Data Analytics. In: TDWI Best Practices Report, pp. 1–40 (2011)

Sanchez, D., Martin-Bautista, M.J., Blanco, I., Torre, C.: Text Knowledge Mining: An Alternative to Text Data Mining. In: IEEE International Conference on Data Mining Workshops, pp. 664–672 (2008)

Serrat, O.: Social Network Analysis. Knowledge Network Solutions 28, 1–4 (2009)

Shen, Z., Wei, J., Sundaresan, N., Ma, K.L.: Visual Analysis of Massive Web Session Data. In: Large Data Analysis and Visualization (LDAV), pp. 65–72 (2012)

Song, Z., Kusiak, A.: Optimizing Product Configurations with a Data Mining Approach. International Journal of Production Research 47(7), 1733–1751 (2009)

TechAmerica: Demystifying Big Data: A Practical Guide to Transforming the Business of Government. In: TechAmerica Reports, pp. 1–40 (2012)

Van der Valk, T., Gijsbers, G.: The Use of Social Network Analysis in Innovation Studies: Mapping Actors and Technologies. Innovation: Management, Policy & Practice 12(1), 5–17 (2010)

Zeng, D., Hsinchun, C., Lusch, R., Li, S.H.: Social Media Analytics and Intelligence. IEEE Intelligent Systems 25(6), 13–16 (2010)

Zhang, L., Stoffel, A., Behrisch, M., Mittelstadt, S., Schreck, T., Pompl, R., Weber, S., Last, H., Keim, D.: Visual Analytics for the Big Data Era—A Comparative Review of State-of-the-Art Commercial Systems. In: IEEE Conference on Visual Analytics Science and Technology (VAST), pp. 173–182 (2012)

Download references

Author information

Authors and affiliations.

Department of Business Informatics & Operations, German University in Cairo (GUC), Cairo, Egypt

Nada Elgendy & Ahmed Elragal

You can also search for this author in PubMed   Google Scholar

Editor information

Editors and affiliations.

Institute of Computer Vision and Applied Computer Sciences, IBaI, Leipzig, Germany

Petra Perner

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper.

Elgendy, N., Elragal, A. (2014). Big Data Analytics: A Literature Review Paper. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2014. Lecture Notes in Computer Science(), vol 8557. Springer, Cham. https://doi.org/10.1007/978-3-319-08976-8_16

Download citation

DOI : https://doi.org/10.1007/978-3-319-08976-8_16

Publisher Name : Springer, Cham

Print ISBN : 978-3-319-08975-1

Online ISBN : 978-3-319-08976-8

eBook Packages : Computer Science Computer Science (R0)

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Open access
  • Published: 26 December 2022

Systematic analysis of healthcare big data analytics for efficient care and disease diagnosing

  • Sulaiman Khan 1 ,
  • Habib Ullah Khan 1 &
  • Shah Nazir 2  

Scientific Reports volume  12 , Article number:  22377 ( 2022 ) Cite this article

7445 Accesses

14 Citations

7 Altmetric

Metrics details

  • Biotechnology
  • Computational biology and bioinformatics

Big data has revolutionized the world by providing tremendous opportunities for a variety of applications. It contains a gigantic amount of data, especially a plethora of data types that has been significantly useful in diverse research domains. In healthcare domain, the researchers use computational devices to extract enriched relevant information from this data and develop smart applications to solve real-life problems in a timely fashion. Electronic health (eHealth) and mobile health (mHealth) facilities alongwith the availability of new computational models have enabled the doctors and researchers to extract relevant information and visualize the healthcare big data in a new spectrum. Digital transformation of healthcare systems by using of information system, medical technology, handheld and smart wearable devices has posed many challenges to researchers and caretakers in the form of storage, minimizing treatment cost, and processing time (to extract enriched information, and minimize error rates to make optimum decisions). In this research work, the existing literature is analysed and assessed, to identify gaps that result in affecting the overall performance of the available healthcare applications. Also, it aims to suggest enhanced solutions to address these gaps. In this comprehensive systematic research work, the existing literature reported during 2011 to 2021, is thoroughly analysed for identifying the efforts made to facilitate the doctors and practitioners for diagnosing diseases using healthcare big data analytics. A set of rresearch questions are formulated to analyse the relevant articles for identifying the key features and optimum management solutions, and laterally use these analyses to achieve effective outcomes. The results of this systematic mapping conclude that despite of hard efforts made in the domains of healthcare big data analytics, the newer hybrid machine learning based systems and cloud computing-based models should be adapted to reduce treatment cost, simulation time and achieve improved quality of care. This systematic mapping will also result in enhancing the capabilities of doctors, practitioners, researchers, and policymakers to use this study as evidence for future research.

Similar content being viewed by others

big data thesis paper

Big data in digital healthcare: lessons learnt and recommendations for general practice

big data thesis paper

Data harnessing to nurture the human mind for a tailored approach to the child

big data thesis paper

Utilizing big data from electronic health records in pediatric clinical care

Introduction.

Healthcare around the world is under high pressure due to limiting financial resources, over-population, and disease burden. In this modern technological age the healthcare paradigm is shifting from traditional, one-size-fits-all approach to a focus on personalized individual care 1 . Additionally, the healthcare data is varying both in type and amount. The healthcare providers are not only dealing with patient’s historical, physical and namely information, but they also deal with imaging information, labs, and other digital and analogue information consists of ECG, MRI etc. This data is voluminous, varying in type and formats, and of differing structure. These are the capabilities of Big Data to handle not only different types of and forms of data, but can handle 10 V structure including volume, variety, venue, varifocal, varmint, vocabulary, validity, volatility, veracity and velocity. Thus, the doctors facing an increasing burden of rising patient numbers coupled with progressively less time to spend with each patient. In other words, we are facing more patients, more data, and less time.

Big data has significantly attracted the researchers to explore different research fields including healthcare, banking, imaging, smart cities, internet of things (IoT) based smart applications, tracking and transportation system etc. 2 . Software engineers constantly develops new applications for patient’s health and well-being. Both government and non-government organizations develop infrastructure using big data analytics for improved decision making capabilities of both doctors and managers 3 . It was recorded that 80% increase in big data is due to cloud sources, big data analytics, mobile technology and social media technologies 4 . A number of research articles proposed using big data analytics in varying domains especially in healthcare such as Kumar et al. 5 proposed a cognitive technology-based healthcare evaluations system using big data analytics. Chen et al. 6 presented an intelligent healthcare application for brain hemorrhage detection using Big Data analytics and machine learning (ML) techniques. Smart health appointment system is developed by Liang and Zhao using big data analytics is 7 .

Some researchers explored big data analytics in healthcare domain in different ways. They presented survey papers and review papers to understand the meanings of big data analytics in healthcare such as Galetsi and Kasaliasi performed a review of healthcare big data analytics 8 while Lindell defined big data analytics in terms of accounting and business perspectives 9 . Alharthi proposed a review article on healthcare challenges facing in Saudi Arabia by performing analysis of the available literature 10 . Lee et al. 11 presented a survey paper to explore the applications and challenges of healthcare big data analytics. From the literature it is concluded that multiple new applications are developed for big data analysis. Review and survey papers are presented to outline the published literature, but most of these papers are region specific or limited to a few numbers of papers. On the other side systematic review process formulate multiple research question and identifies keywords to explore the available literature from different angles. Systematic analysis of the available literature is presented in many fields like PMIPv6 domain 12 , in smart homes 13 , navigation assistants 14 , and many others, but there is no significant work reported on systematic analysis for healthcare big data domain to find the gaps in the available literature and suggest future research directions.

The inspirational point that led us to pursue this systematic analysis was the pervasive and ubiquitous nature of big data. Efficient management and timely execution are the dire needs of big data, to extract enriched information regarding a certain problem of interest 15 . Many factors involved behind this systematic research work, but the most eminent reasons are:

The exiting research reported on big data does not provide significant information about the key features that should be considered to integrate both structured and unstructured big data in healthcare domain. The pervasiveness of big data features challenging the researchers in pursuing research in this specialized domain. The underlying research on finding the key features will not only help in integrating big data in healthcare domain, but it will also assist in findings new gateways for future research directions.

Digital transformation of healthcare systems after the integration of information system, medical technology and other imaging systems have posed a big barrier for the research community in the form of a vast amount of information to deal with. While the over-population, limited data access, and disease burdens have restricted the doctors and practitioners to check more patients in a limited time. So, finding a suitable model that can efficiently process healthcare big data to extract information for a certain disease symptoms will not only helps the practitioners to suggest accurate medication and check more patients in timely manners, but it will open future research directions for the industrialists and policymakers to develop optimal healthcare big data processing models.

Accurate disease diagnosing by processing of gigantic amount of data, especially a plethora of types of data, within an interested processing domain is a key concern for both researchers and practitioners. Developing an efficient model that can accurately diagnose a certain by classifying images or other historical details of patients will not only helps the doctors to diagnose disease in timely manner and suggest medicine accordingly, but it will encourage the researchers and developers to develop an accurate disease identification model.

The remaining research paper of the paper is organized as follows. Section  2 of the paper outlines the related work reported in the proposed field. Section  3 presents the research framework followed for this systematic research work. Quality assessment is detailed in Sect.  4 . Section  5 outlines the discussion on findings of the proposed systematic research work. Section  6 provides the limitations of this systematic study traced by the conclusion and future work in Sect.  7 of the paper.

Literature review

From the last few decades, we experienced an unprecedented transformation of traditional healthcare systems to digital and portable healthcare applications with the help of information systems, medical technology and other imaging resources 16 . Big data are radically changing the healthcare system by encouraging the healthcare organizations to embrace extraction of relevant information from imaginary data and other clinical records. This information will produce high throughput in terms of accurate disease diagnosing, plummeting treatment cost increase availability. In data visualization context the term ‘big data’, is firstly introduced in 1997 17 , posed an ambitious and exceptional challenge for both policy-makers and doctors with special emphasis on personalized medicine. Nonetheless, data gathering moves faster than both data analysis and data processing, emphasizing the widening gap between the rapid technological progress in data acquisition and the comparatively slow functional characterization of healthcare information. In this regard, the historical information (phonotypical and other genomic information) of an individual patient form electronic health records (EHR) are becoming of critical importance. Figure  1 represents the primary sources of big data.

figure 1

Main steps of the research protocol.

Significant research work has been reported in the domains of healthcare big data analytics. To process this vast amount of information in timely manner and identify someone’s health condition based on his her is more difficult. Researchers proposed numerous applications to address this problem such as; Syed et al. 18 proposed a machine learning-based healthcare system for providing remote healthcare services to both diseased and healthy population using big data analytics and IoT devices. Venkatesh et al. 19 developed heart disease prediction model using big data analytics and Naïve Bayes classification technique. Kaur et al. 20 suggested a machine learning (ML) based healthcare application for disease diagnosing and data privacy restrictions. This model works by considering different aspects like activity monitoring, granular access control and mask encryption. Some researchers presented review and survey papers to outline the recent published work in a specific directions such as Patel and Gandhi reviewed the literature for identifying the machine learning approaches proposed for healthcare big data analytics 21 . Rumbold et al. 22 reviewed the literature for find the research work reported for diabetic diagnosing using big data analytics.

From the above discussions, it is worth mentioning that most of the researchers and industrialists gave significant attention towards the development of new computational models or surveyed the literature in a specific research direction (heart disease detection, diabetes detection, storage and security analysis etc.), but no significant research work is reported to systematically analyze the literature with different perspectives. To address this problem, this research work presents a systematic literature review (SLR) work to analyze the literature reported in healthcare big data analytics domain. This systematic analysis will not only find the gaps in the available literature but it will also suggest new directions of future research to explore.

Research framework

Systematic literature reviews and meta-analysis has gained significant attention and became increasingly important in healthcare domain. Clinicians, developers and researchers follow SLR studies to get updated about new knowledge reported in their fields 23 , 24 , and they are often followed as a starting point for preparing basic records. Granting agencies mostly requires SLR studies to ensure justification of further research 25 , and even some healthcare journals follows this direction 26 . Keeping these SLR applications in mind the proposed systematic analysis is performed following the guidelines presented by Moher et al. 27 (PRISMA) and Kitchenham et al. 28 . This SLR work accumulates the most relevant research work from primary sources. These papers are then evaluated and analyzed to grab the best results for the selected research problem. Figure  2 represents the results after following the PRISMA guidelines. This systematic analysis are performed using the following preliminary steps:

Identification of research questions to systematically analyze the proposed field from different perspectives.

Selection of relevant keywords and queries to download the most relevant research articles.

Selection of peer-reviewed online databases to download relevant research articles published in healthcare big data domain during the period ranging from 2011 – 2021.

Perform inclusion and exclusion process based on title, abstract and the contents presented in the article to remove duplicate records.

Assess the finalized relevant articles for identifying gaps in the available literature and suggest new research directions to explore.

figure 2

PRISMA process model for articles accumulation, screening, and final selection.

Research questions

Selecting a well-constructed research question(s) is essential for a successful review process. We formulate a set of five research questions based on the Goal Questions Metrics approach proposed by Van Solingen et al. 29 . The formulated research questions are depicted in Table 1 below.

Search strategy

Search strategy is the key step in any systematic research work because this is the step that ensures the most relevant article for the analysis and the assessment process. To define a well-organized search strategy a search string is developed using the formulated relevant keywords. For the accumulation of most relevant articles for a certain research problem, only keywords are not sufficient. These keywords are concatenated in different strings for searching articles in multiple online repositories 30 . Inspired from the SLR work of Achimugu et al. 31 , in software requirement domain, our search strategy consists of four main steps includes identification of keywords relevant to selected research problem, formulation of search string based on the keywords, and selection of online repositories to accumulate relevant articles to the problem selected.

Selection of keywords

List of keywords are defined for each research question to download all relevant articles. Some researchers defined a generic query 32 and starts downloading articles. Although it is simple for the accumulation of articles from online database but mostly it tends to skip some most relevant articles. So, the correct option is to define keywords for each research question. In fact, it is a hectic job, but it ensures the retrieval of each relevant article from online databases regarding a certain research problem.

Formulation of search string

Search strings (queries) are formulated using the keywords identified from the selected research questions. The search string is tested in online databases and was modified according to retrieve each relevant articles from these databases. Inspired from the guidelines proposed by Wohlin 33 , following are the key steps undertaken to develop an optimal search string:

Identification of key terms from the formulated topic and research questions

Selection of alternate words or synonyms for key terms

Use “OR” operator for alternating words or synonyms during query formation

Link all major terms with Boolean “AND” operator to validate every single keyword.

Following all these preliminary steps a generic query/search-string is developed that is depicted in Table 2 . This generic query is further refined for each research question as depicted in Table 3 to retrieve each relevant article.

Selection of online repositories

After identifying keywords and formulating search strings the next step is to download relevant articles specific to the interested research problem. For the accumulation of relevant articles six well-known and peer-reviewed online repositories are selected, as depicted in Table 3 .

Articles accumulation and final database development

For relevant articles accumulation and final database development we followed the guidelines suggested by Kable et al. 34 . After specifying the research questions, identifying keywords, and formulating search queries, and selecting online repositories, the next key step is to develop a relevant articles database for the analysis and assessment purposes that includes three prime steps: (1) identification of inclusion/exclusion criteria for a certain research article(s), and (2) Relevant articles database development. These steps are discussed in detail below.

Inclusion and exclusion criteria

After selecting online database and starts the articles downloading process, the most tedious task that the author (s) facing, is the decision about whether a certain paper should be included in the final database or not? To overcome this problem an inclusion and exclusion criteria is defined for the inclusion of a certain article in the final set of articles. Table 4 represents the inclusion and exclusion criteria followed for this systematic research work.

A manual process is followed by the authors for the inclusion and exclusion of a certain article. These articles are evaluated based on title, abstract and information provided in the overall paper. If more than half authors agree upon the inclusion of a certain article based on these parameters (title, abstract, and contents presented in the article), then that paper was counted in the final database otherwise rejected. A total of 134 relevant primary studies are selected for the final assessment process. To ensure no skip of relevant article snowballing is applied to retrieve each relevant article.

Snowballing To extract each relevant primary article snowballing is applied in the proposed research work 33 . In this systematic analysis both types of snowballing backward and forward snowballing is applied to ensure extraction of each relevant primary article. 145 relevant articles retrieved after applying snowballing process. These articles are then filtered by title and resulted for 53 relevant articles. After further processing by abstract resulted into 19 articles, and at last when filtered by contents presented in the paper resulted into only 5 relevant articles. This overall process is depicted in Fig.  3 . After adding these articles to the accumulated relevant articles, a total of 139 articles added to the final database.

figure 3

Extraction of each relevant article using snowballing.

Relevant articles database development

After accumulating each primary article reported in the proposed field, a database of relevant articles is developed for the assessment and analysis work, to find the current available trends in healthcare big data analytical domain and investigate the gaps in these research articles to open new gates for future research work. A total of 139 relevant articles are added to the final database. The overall contribution of the selected online repositories in the relevant articles database development is depicted in Fig.  4 .

figure 4

Distribution of primary studies.

From Fig.  4 , it is concluded that IEEE Xplore and Science Direct contributing the more that reflects the interest of research community to present their work with.

After developing a database of relevant articles, it is evaluated using different parameters like type of article (conference proceedings, journal article, book chapter etc.), publication year, and contribution of individual library. Figure  5 represents the information regarding the total contribution of articles by type in the final database.

figure 5

Evolution of final database by type of article and year.

Figure  5 concludes that the researchers paid significant attention towards the development of new healthcare systems instead of finding the gaps in the available systems and develop enhanced solutions accordingly. This enhanced solution can accurately identify and diagnose a certain disease based on patient’s historical medical information. A small amount of work is reported using review articles, survey papers, but no systematic mechanism is followed to analyse the work in specific range of years followed by a set of research questions. The same problem can also be seen from Fig.  6 where highest percentage contribution is shown more comparative to book sections, conference papers etc.

figure 6

Percentage contribution by type of paper.

Figure  7 depicts the percentage contribution of each library in the proposed assessment work.

figure 7

Percentage contribution of each library.

Figure  8 represents the annual distribution of articles selected for the analysis and assessment purposes. Form Fig.  8 it is evident, that with passage of time number of articles increases, and that shows the maturity and interest of the researchers in this specific domain.

figure 8

Annual distribution of articles.

From Fig.  8 , it is concluded that IEEE Xplore contributing the more in the final database of relevant articles that shows the trend of researchers to present healthcare relevant works in the IEEE journals. Figure  9 represents the total number of journal articles, survey papers, conference papers, and book sections in the selected relevant articles database.

figure 9

Evolution of database by number of articles by type.

From Fig.  9 it is concluded that significant attention is given towards the development of new healthcare models. This shows the maturity of the proposed field. Dealing with such a mature field and extracting useful information is hectic job for the researchers. A systematic analysis of this research field is needed to provide an overview of the work reported during a specific range of years. This analysis will not only save precious time of the researchers, but it will also open gates for the future research work in this field.

Table 5 represents the annual contribution of studies in the final relevant database.

Overall information regarding type of paper, publication year and number of records is depicted in Fig.  10 below.

figure 10

Evolution of final database.

Quality assesment

After executing exclusion and inclusion process, all the relevant articles in the database are manually assessed by authors to check the relevancy of each article with the selected research problem. A quality criterion is defined to check every research article against the formulated research questions. This quality criteria is defined in Table 6 .

Weighted values are assigned against each quality criteria to check the relevancy of an article with a certain research question. These weighted values and description is depicted in Fig.  11 .

figure 11

Quality criteria for the proposed SLR work.

After the assessment process, the relevancy of each article is decided based on its aggregated weighting score. If the score is greater than 3 it represents the most relevancy of an article to the selected research topic. Figure  12 represents the aggregate score values of each article based on the defined quality assessment criteria.

figure 12

Quality assessment process.

Results and discussion

After executing the quality assessment work, the next key step of an SLR work is, to analyse all the relevant article to identify different techniques proposed for efficient communication between patient and practitioner, accurate feature extraction from healthcare big data and implement it in practical use.

This section of the paper performs a descriptive analysis of each article based on five research questions. In this systematic review process, a total of 139 research articles published during the period ranging from 2011 to 2021.

Healthcare big data

The researcher and data analysts suggested no contextual name for “big data” in healthcare, but for implementation and interpretation purposes they divided it into 5 V architecture. Figure  13 depicts a 5 V architecture of big data.

figure 13

Big Data 5Vs 15 .

The exponential increase in IoT-based smart devices and information systems resulted a plethora of information in healthcare domain. This information increases exponentially on daily basis. These smart IoT based healthcare devices produces a huge of data. An alternated term “Big Data” is selected for this gigantic amount of data. This is the data for which scale, diversity, and complexities require innovative structure, variables, design, and analytics for efficient utilization and management, accurate data extraction and visualization, and to grab hidden stored information regarding a specific problem of interest. Main idea behind the implementation of healthcare big data analytics is to retrieve enriched information from huge amount of data using different machine leering and data mining techniques 191 . These techniques help in improving quality of care, reducing cost of care, and helps the practitioners to suggest medicines based on clinical historical information.

RQ1. What are the key features adapted to integrate the structured and unstructured data in healthcare big data domain?

Big data comprises a huge amount of data to be processed, especially a plethora of types of data to process and extract enriched information regarding a problem of interest. Several features are assessed and analyzed especially in healthcare domain, to integrate both structural and non-structural data. Multiple researchers analyzed semantic based big data features for big data integration purposes while some researchers proposed behavior and structural based features for patient monitoring and activity management purposes 151 , 192 . While some performed real-time analysis using a group of people for data integrating and clustering purposes. Table 7 enlists the research work published for the structural and non-structural data integration purposes.

After analysing the available literature in Table 8 , it was concluded that mostly semantic based, structure-based, and real-time activity-based features are considered for the information extraction and organization purposes. If we consider geometric based feature and adapt clustering mechanism for data organization purposes, then this will not only integrate both structural and non-structural data efficiently, but it will improve the simulation capabilities of different applications.

RQ2. What are different techniques proposed to provide an easy and timely data-access interface for doctors?

Digital transformation of healthcare systems by using of information system, medical technology, handheld and smart wearable devices has posed many challenges for both the researchers and caretakers in the form of storage, dropping the cost of care and processing time (to extract relevant information for refining quality of care and reduce waste and error rates). Prime goal of healthcare big data analytics is, to process this vast amount of data using machine learning and other processing models to extract certain problem relevant information and use it for human well beings 195 . Several supervised and unsupervised classification techniques are followed for the said purposes. ML-based architectures and big data analytical techniques are integrated in healthcare domain for efficient information retrieval and exchange purposes, risk analysis, optimum decision-support system in clinics, and suggesting precise medicines using genomic information 196 . Table 8 represent the literature reported for the providence of an easy and timely data-access interface for the practitioners.

RQ3. What are different ways to improve communication between the doctor and patient?

Healthcare around the world is under high pressure due to limiting financial resources, over-population, and disease burden. In this modern technological age, the healthcare paradigm is shifting from traditional, one-size-fits-all approach to a focus on personalized individual care 1 . Additionally, the healthcare data is varying both in type and amount. The healthcare providers are not only dealing with patient’s historical, physical and namely information, but they also deal with imaging information, labs, and other digital and analogue information consists of ECG, MRI etc. This data is voluminous, varying in type and formats, and of differing structure. These are the capabilities of Big Data to handle not only different types of and forms of data, but can handle 5 V structure including volume, variety, value, veracity, and velocity. Thus, the doctors facing an increasing burden of rising patient numbers coupled with progressively less time to spend with each patient. In other words, we are dealing with more patients, more data, and less time.

Different techniques are proposed in the literature to provide an easy and timely communication interface for both doctors and patients. Table 9 depicts different information exchange tools/techniques reported in the literature.

RQ4. What are different types of classification models proposed for accurate disease diagnosing using patient historical information?

This research question aims to outline different disease diagnosing models proposed in the literature using healthcare big data. Around the world diverse approaches are proposed by researchers for healthcare big data analysis to ensure accurate disease diagnosing capabilities, provide healthcare facilities at doorstep, development of eHealth and mHealth applications, and many others. Multiple statistical and ML-based approaches proposed for accurate diagnosing purposes. Figure  14 represents multiple techniques proposed for automatic disease diagnosing purposes using healthcare big data domain.

figure 14

Multiple disease diagnosing techniques proposed in the literature.

All these techniques perform the diagnosing process using semantic-based features or structural based features. But no attention is given towards geometric feature extraction techniques that are prominent in extracting enriched information from data and results in high identification rates. Also, no advanced hybrid neural network and shallow architectures are proposed for the automatic diagnosing purposes. Keeping these gaps in mind, an optimum eHealth application can be developed by applying these hybrid techniques.

RQ5. What are different applications of big data analytics in healthcare domain?

Big data analytics has revolutionized our lives by presenting many state of the art applications in various domains ranging from eHealth to mHealth, weather forecasting to climate changes, traffic management to object detection, and many others. This research question mainly focusing on enlisting different applications of big data analytics in Table 10 .

Limitations

This article has a number of limitations. Some of these limitations are listed below.

For this systematic analysis articles are only accumulated from six different peer-reviewed libraries (ACM, SpringerLink, Taylor & Francis, Science Direct = IEEE Xplore, and Wiley online library), but there exist a number of multi-disciplinary databases for articles accumulation purposes.

This systematic analysis covers a specific range of years (2011 –2021), while a number of articles are reporting on daily basis.

Articles are accumulated from online libraries using search queries, so if a paper has no matching words to the query, then it was skipped during search process.

Google Scholar is skipped during the articles accumulation phase to shorten the searching time. Also, it gives access to both peer-reviewed and non-peer-reviewed journals and we only focused on peer-reviewed journals for the relevant articles.

Being a systematic literature work it can be broadened to grab the knowledge about other varying topics such as healthcare data commercialization, health sociology etc.

Besides these limitations we hope that this systematic research work will be an inspiration for future research in the recommended fields and will open gates for both industrialists and policymakers.

Conclusion and future work

In this research article, the existing research reported during 2011 to 2021 is thoroughly analysed for the efforts made by researchers to help caretakers and clinicians to make authentic decisions in disease diagnosing and suggest medicines accordingly. Based on the research problem and underlying requirements, the researchers proposed several feature extraction, identification, and remote communication frameworks to develop doctor and patient communication in a timely fashion. These real-time or nearer to real-time applications mostly use big data analytics and computational devices. This research work identified several key features and optimum management designs proposed in healthcare big data analytical domain to achieve effective outcomes in disease diagnosing. The results of this systematic work suggests that advanced hybrid machine learning-based models and cloud computing application should be adapted to reduce treatment cost, simulation time, and achieve improved quality of care. The findings of this research work will not only help the policymakers to encourage the researchers and practitioners to develop advanced disease diagnosing models, but it will also assist in presenting an improved quality of treatment mechanism for patients.

Advanced hybrid machine learning architectures for cognitive computing are considered as the future toolbox for the data-driven analysis of healthcare big data. Also, geometric-based features must be considered for feature extraction purposes instead of semantic and structural-based features. These geometric-based feature extraction techniques will not only reduce the simulation time, but it will also improve the identification and disease diagnosing capabilities of smart health devices. Additionally, these features can help in accurate identification of Alzheimer, tumours in PET or MRI images using upgraded machine learning and big data analytics. Cluster-based mechanism should be considered for data organization purposes to improve big data timely-access and easy-management capabilities. Promoting research in these areas will be crucial for future innovation in healthcare domain.

Data availability

The data used and/or analyzed during the current study available from the corresponding author on reasonable request.

Rahman, F. & Slepian, M. J. Application of big-data in healthcare analytics—Prospects and challenges. In 2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI) 13–16 (2016).

Khan, N. et al. Big data: Survey, technologies, opportunities, and challenges. Sci. World J. 2014 , 1–18 (2014).

Google Scholar  

Groves, P., Kayyali, B., Knott, D. & Van Kuiken, S. The ‘big data ‘revolution in healthcare. In McKinsey Quarterly (2013).

Andreu-Perez, J., Poon, C. C., Merrifield, R. D., Wong, S. T. & Yang, G.-Z. Big data for health. IEEE J. Biomed. Health Inform. 19 , 1193–1208 (2015).

Article   Google Scholar  

Kumar, M. A., Vimala, R. & Britto, K. A. A cognitive technology based healthcare monitoring system and medical data transmission. Measurement 146 , 322–332 (2019).

Article   ADS   Google Scholar  

Chen, H., Khan, S., Kou, B., Nazir, S., Liu, W. & Hussain, A. A smart machine learning model for the detection of brain hemorrhage diagnosis based internet of things in smart cities. Complexity 2020 (2020).

Liang, Y. & Zhao, L. Intelligent hospital appointment system based on health data bank. Procedia Comput. Sci. 159 , 1880–1889 (2019).

Galetsi, P. & Katsaliaki, K. A review of the literature on big data analytics in healthcare. J. Oper. Res. Soc. 1–19 (2019).

Lindell, J. What are big data and analytics?. In Analytics and Big Data for Accountants (2018).

Alharthi, H. Healthcare predictive analytics: An overview with a focus on Saudi Arabia. J. Infect. Public Health 11 , 749–756 (2018).

Lee, C. et al. "Big healthcare data analytics: Challenges and applications. In Handbook of Large-Scale Distributed Computing in Smart Healthcare 11–41 (Springer, 2017).

Chapter   Google Scholar  

Hussain, A., Nazir, S., Khan, S. & Ullah, A. Analysis of PMIPv6 extensions for identifying and assessing the efforts made for solving the issues in the PMIPv6 domain: A systematic review. Comput. Netw. 179 , 107366 (2020).

Khan, H.-U. et al. Systematic analysis of safety and security risks in smart homes. Comput. Mater. Contin. 68 , 1409–1428 (2021).

Khan, S., Nazir, S. & Khan, H.-U. Analysis of navigation assistants for blind and visually impaired people: A systematic review. IEEE Access 9 , 26712–26734 (2021).

Nazir, S. et al. A comprehensive analysis of healthcare big data management, analytics and scientific programming. IEEE Access 8 , 95714–95733 (2020).

Kitchin, R. Big Data, new epistemologies and paradigm shifts. Big Data Soc. 1 , 2053951714528481 (2014).

Cox, M. & Ellsworth, D. Application-controlled demand paging for out-of-core visualization. In Proceedings. Visualization’97 (Cat. No. 97CB36155) 235–244 (1997).

Syed, L., Jabeen, S., Manimala, S. & Elsayed, H. A. Data science algorithms and techniques for smart healthcare using IoT and big data analytics. In Smart Techniques for a Smarter Planet 211–241 (Springer, 2019).

Venkatesh, R., Balasubramanian, C. & Kaliappan, M. Development of big data predictive analytics model for disease prediction using machine learning technique. J. Med. Syst. 43 , 272 (2019).

Article   CAS   Google Scholar  

Kaur, P., Sharma, M. & Mittal, M. Big data and machine learning based secure healthcare framework. Procedia Comput. Sci. 132 , 1049–1059 (2018).

Patel, H. B. & Gandhi, S. A review on big data analytics in healthcare using machine learning approaches. In 2018 2nd International Conference on Trends in Electronics and Informatics (ICOEI) 84–90 (2018).

Rumbold, J. M. M., O’Kane, M., Philip, N. & Pierscionek, B. K. Big Data and diabetes: The applications of Big Data for diabetes care now and in the future. Diabetic Med. (2019).

Oxman, A. D. et al. Users’ guides to the medical literature: VI. How to use an overview. JAMA 272 , 1367–1371 (1994).

Swingler, G. H., Volmink, J. & Ioannidis, J. P. Number of published systematic reviews and global burden of disease: database analysis. BMJ 327 , 1083–1084 (2003).

Research, C. I. O. H. Randomized controlled trials registration/application checklist (12/2006). Available at: http://www.cihr-irsc.gc.ca/e/documents/rct_reg_e.pdf . Accessed 22 June 2009.

Young, C. & Horton, R. Putting clinical trials into context. Lancet 366 , 107–107 (2005).

P. Group, Moher, D., Liberati, A., Tetzlaff, J. & Altman, D. G. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLoS Med. 6 , e1000097 (2009).

Kitchenham, B. & Charters, S. Guidelines for performing systematic literature reviews in software engineering (2007).

Van Solingen, R., Basili, V., Caldiera, G. & Rombach, H. D. Goal question metric (gqm) approach. Encycl. Softw. Eng. (2002).

Brereton, P., Kitchenham, B. A., Budgen, D., Turner, M. & Khalil, M. Lessons from applying the systematic literature review process within the software engineering domain. J. Syst. Softw. 80 , 571–583 (2007).

Achimugu, P., Selamat, A., Ibrahim, R. & Mahrin, M. N. R. A systematic literature review of software requirements prioritization research. Inf. Softw. Technol. 56 , 568–585 (2014).

Nazir, S., Ali, Y., Ullah, N. & García-Magariño, I. Internet of things for healthcare using effects of mobile computing: A systematic literature review. Wirel. Commun. Mobile Comput. 109 , 5931315 (2019).

Wohlin, C. Guidelines for snowballing in systematic literature studies and a replication in software engineering. In Proceedings of the 18th International Conference on Evaluation and Assessment in Software Engineering 1–10 (2014).

Kable, A. K., Pich, J. & Maslin-Prothero, S. E. A structured approach to documenting a search strategy for publication: A 12 step guideline for authors. Nurse Educ. Today 32 , 878–886 (2012).

Helmer, A., Kretschmer, F., Müller, F., Eichelberg, M., Deparade, R., Tegtbur, U. et al. Integration of medical models in personal health records using the example of rehabilitation training for cardiopulmonary patients. In 2011 4th International Conference on Biomedical Engineering and Informatics (BMEI) 1887–1892 (2011).

Tian, M. Integrated feature based medical image retrieval. In 2011 International Conference on Control, Automation and Systems Engineering (CASE) 1–3 (2011).

Chaves, R., Ramírez, J., Górriz, J. M., Illán, I. A. & Salas-Gonzalez, D. FDG and PIB biomarker PET analysis for the Alzheimer’s disease detection using Association Rules. In 2012 IEEE Nuclear Science Symposium and Medical Imaging Conference Record (NSS/MIC) 2576–2579 (2012).

Chute, C. G. Obstacles and options for big-data applications in biomedicine: The role of standards and normalizations. In 2012 IEEE International Conference on Bioinformatics and Biomedicine (2012).

Goel, A. & Chandra, N. A prototype model for secure storage of medical images and method for detail analysis of patient records with PACS. In 2012 International Conference on Communication Systems and Network Technologies 167–170 (2012).

Huang, H. & Hsiao, I. Use of anatomical information in a Bayesian reconstruction with an edge-preserving median prior. In 2012 IEEE Nuclear Science Symposium and Medical Imaging Conference Record (NSS/MIC) 3321–3323 (2012).

López, C. M., Welkenhuysen, M., Musa, S., Eberle, W., Bartic, C., Puers, R. et al. Towards a noise prediction model for in vivo neural recording. In 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society 759–762 (2012).

Ng, H., Chuang, C. & Hsu, C. Extraction and analysis of structural features of lateral ventricle in brain medical images. In 2012 Sixth International Conference on Genetic and Evolutionary Computing 35–38 (2012).

Patel, A. B., Birla, M. & Nair, U. Addressing big data problem using Hadoop and Map Reduce. In 2012 Nirma University International Conference on Engineering (NUiCONE) 1–5 (2012).

Zheng, G., Yu, L., Feng, Y., Han, Z., Chen, L., Zhang, S. et al. Seizure prediction model based on method of common spatial patterns and support vector machine. In 2012 IEEE International Conference on Information Science and Technology 29–34 (2012).

Li, L., Bagheri, S., Goote, H., Hasan, A. & Hazard, G. Risk adjustment of patient expenditures: A big data analytics approach. In 2013 IEEE International Conference on Big Data 12–14 (2013).

Loshin, D. Chapter 8—Developing big data applications. In Big Data Analytics (ed. Loshin, D.) 73–81 (Morgan Kaufmann, 2013).

Chapter   MATH   Google Scholar  

Loshin, D. Chapter 9—NoSQL data management for big data. In Big Data Analytics (ed. Loshin, D.) 83–90 (Morgan Kaufmann, 2013).

Loshin, D. Chapter 1—Market and business drivers for big data analytics. In Big Data Analytics (ed. Loshin, D.) 1–9 (Morgan Kaufmann, 2013).

MATH   Google Scholar  

Purkayastha, S. & Braa, J. Big data analytics for developing countries–Using the cloud for operational BI in health. Electron. J. Inf. Syst. Dev. Ctries. 59 , 1–17 (2013).

Lin, C.-H., Huang, L.-C., Chou, S.-C. T., Liu, C.-H., Cheng, H.-F. & Chiang, I. J. Temporal event tracing on big healthcare data analytics. In 2014 IEEE International Congress on Big Data 281–287 (2014)

Martínez, J. G., Ramos-Becerril, F. J., Leija, L., López, F., García, U., Vera, A. et al. Development of an electronic equipment for the pre medical diagnose in the progress of diabetic foot disease. In 2014 International Conference on Control, Decision and Information Technologies (CoDIT) 679–683 (2014).

Mian, M., Teredesai, A., Hazel, D., Pokuri, S. & Uppala, K. Work in progress—In-memory analysis for healthcare big data. In 2014 IEEE International Congress on Big Data 778–779 (2014).

Panahiazar, M., Taslimitehrani, V., Jadhav, A. & Pathak, J. Empowering personalized medicine with big data and semantic web technology: Promises, challenges, and use cases. In 2014 IEEE International Conference on Big Data (Big Data) 790–795 (2014).

Vargheese, R. Dynamic protection for critical health care systems using cisco CWS: Unleashing the power of big data analytics. In 2014 Fifth International Conference on Computing for Geospatial Research and Application 77–81 (2014).

Archenaa, J. & Anita, E. A. M. A survey of big data analytics in healthcare and government. Procedia Comput. Sci. 50 , 408–413 (2015).

Boman, M. & Sanches, P. Sensemaking in intelligent health data analytics. KI Künstliche Intell. 29 , 143–152 (2015).

Chong, D. & Shi, H. Big data analytics: A literature review. J. Manag. Anal. 2 , 175–201 (2015).

Dantanarayana, G., Sahama, T. & Wikramanayake, G. Quality of information for quality of life: Healthcare big data analytics. In 2015 Fifteenth International Conference on Advances in ICT for Emerging Regions (ICTer) 281–281 (2015).

Gomathi, S. & Narayani, V. Implementing big data analytics to predict systemic lupus erythematosus. In 2015 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS) 1–5 (2015).

Hussain, S. & Lee, S. Semantic transformation model for clinical documents in big data to support healthcare analytics. In 2015 Tenth International Conference on Digital Information Management (ICDIM) 99–102 (2015).

Kuo, M., Chrimes, D., Moa, B. & Hu, W. Design and construction of a big data analytics framework for health applications. In 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity) 631–636 (2015).

Mehmood, R. & Graham, G. Big data logistics: A health-care transport capacity sharing model. Procedia Comput. Sci. 64 , 1107–1114 (2015).

Raj, P., Raman, A., Nagaraj, D. & Duggirala, S. Big data analytics for healthcare. In High-Performance Big-Data Analytics Computer Communications and Networks 1525–1525 (Springer, Cham, 2015).

Viceconti, M., Hunter, P. & Hose, R. Big data, big knowledge: Big data for personalized healthcare. IEEE J. Biomed. Health Inform. 19 , 1209–1215 (2015).

Wang, M. D. Biomedical big data analytics for patient-centric and outcome-driven precision health. In 2015 IEEE 39th Annual Computer Software and Applications Conference 1–2 (2015).

Batarseh, F. A. & Latif, E. A. Assessing the quality of service using big data analytics: With application to healthcare. Big Data Res. 4 , 13–24 (2016).

Buzzi, M. C. et al. Facebook: A new tool for collecting health data?. Multimed. Tools Appl. 76 , 10677–10700 (2016).

Chauhan, R., Jangade, R. & Mudunuru, V. K. A cloud based environment for big data analytics in healthcare. In International Conference on Soft Computing and Pattern Recognition 315–321 (2016).

Stefano, A. D., Corte, A. L., Lió, P. & Scatá, M. Bio-inspired ICT for big data management in healthcare. In Intelligent Agents in Data-intensive Computing 1–26 (Springer, 2016).

Gupta, S. & Tripathi, P. An emerging trend of big data analytics with health insurance in India. In 2016 International Conference on Innovation and Challenges in Cyber Security (ICICCS-INBUSH) 64–69 (2016).

Haas, M. et al. Big data to smart data in Alzheimer’s disease: Real-world examples of advanced modeling and simulation. Alzheimers Dement. 12 , 1022–1030 (2016).

Jiang, P. et al. An intelligent information forwarder for healthcare big data systems with distributed wearable sensors. IEEE Syst. J. 10 , 1147–1159 (2016).

Kankanhalli, A., Hahn, J., Tan, S. & Gao, G. Big data and analytics in healthcare: Introduction to the special section. Inf. Syst. Front. 18 , 233–235 (2016).

Kashyap, H., Ahmed, H. A., Hoque, N., Roy, S. & Bhattacharyya, D. K. Big data analytics in bioinformatics: Architectures, techniques, tools and issues. Netw. Model. Anal. Health Inform. Bioinform. 5 , 28 (2016).

Lv, Z., Chirivella, J. & Gagliardo, P. Bigdata oriented multimedia mobile health applications. J. Med. Syst. 40 , 120 (2016).

Pandey, M. K. & Subbiah, K. A novel storage architecture for facilitating efficient analytics of health informatics big data in cloud. In 2016 IEEE International Conference on Computer and Information Technology (CIT) 578–585 (2016).

Plachkinova, M., Vo, A., Bhaskar, R. & Hilton, B. A conceptual framework for quality healthcare accessibility: A scalable approach for big data technologies. Inf. Syst. Front. 20 , 289–302 (2016).

Rallapalli, S., Gondkar, R. R. & Ketavarapu, U. P. K. Impact of processing and analyzing healthcare big data on cloud computing environment by implementing hadoop cluster. Procedia Comput. Sci. 85 , 16–22 (2016).

Sakr, S. & Elgammal, A. Towards a comprehensive data analytics framework for smart healthcare services. Big Data Res. 4 , 44–58 (2016).

Xu, B. et al. Healthcare data analytics: Using a metadata annotation approach for integrating electronic hospital records. J. Manag. Anal. 3 , 136–151 (2016).

Tresp, V. et al. Going digital: A survey on digitalization and large-scale data analytics in healthcare. Proc. IEEE 104 , 2180–2206 (2016).

Straton, N., Hansen, K., Mukkamala, R. R., Hussain, A., Gronli, T., Langberg, H. et al. Big social data analytics for public health: Facebook engagement and performance. In 2016 IEEE 18th International Conference on e-Health Networking, Applications and Services (Healthcom) 1–6 (2016).

Abouelmehdi, K., Beni-Hssane, A., Khaloufi, H. & Saadi, M. Big data security and privacy in healthcare: A review. Procedia Comput. Sci. 113 , 73–80 (2017).

Alonso, S. G., de la Torre, Diez I., Rodrigues, J. J., Hamrioui, S. & Lopez-Coronado, M. A systematic review of techniques and sources of big data in the healthcare sector. J. Med. Syst. 41 , 183 (2017).

Anjum, A. et al. Big data analytics in healthcare: A cloud-based framework for generating insights. In Cloud Computing 153–170 (Springer, 2017).

Barik, R. K., Dubey, H. & Mankodiya, K. SOA-FOG: Secure service-oriented edge computing architecture for smart health big data analytics. In 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 477–481 (2017).

Cano, I., Tenyi, A., Vela, E., Miralles, F. & Roca, J. Perspectives on big data applications of health information. Curr. Opin. Syst. Biol. 3 , 36–42 (2017).

A. Di Meglio and M. Manca, "From Big Data to Big Insights: The Role of Platforms in Healthcare IT," in New Perspectives in Medical Records, ed: Springer, 2017, pp. 33–47.

Manogaran, G. et al. Big data analytics in healthcare Internet of Things. In Innovative Healthcare Systems for the 21st Century 263–284 (Springer, 2017).

Plageras, A. P., Stergiou, C., Kokkonis, G., Psannis, K. E., Ishibashi, Y., Kim, B. et al. Efficient large-scale medical data (eHealth Big Data) analytics in Internet of Things. In 2017 IEEE 19th Conference on Business Informatics (CBI) 21–27 (2017).

Pramanik, M. I., Lau, R. Y. K., Demirkan, H. & Azad, M. A. K. Smart health: Big data enabled health paradigm within smart cities. Expert Syst. Appl. 87 , 370–383 (2017).

Spanoudakis, G., Katrakazas, P., Koutsouris, D., Kikidis, D., Bibas, A. & Pontopidan, N. H. Public health policy for management of hearing impairments based on big data analytics: EVOTION at genesis. In 2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE) 525–530 (2017).

Wu, J., Li, H., Liu, L. & Zheng, H. Adoption of big data and analytics in mobile healthcare market: An economic perspective. Electron. Commer. Res. Appl. 22 , 24–41 (2017).

Aceto, G., Persico, V. & Pescape, A. The role of Information and Communication Technologies in healthcare: Taxonomies, perspectives, and challenges. J. Netw. Comput. Appl. 107 , 125–154 (2018).

Antoniou, C., Dimitriou, L. & Pereira, F. Mobility Patterns, Big Data and Transport Analytics: Tools and Applications for Modeling (Elsevier, 2018).

Bates, D. W., Heitmueller, A., Kakad, M. & Saria, S. Why policymakers should care about “big data” in healthcare. Health Policy Technol. 7 , 211–216 (2018).

Choi, T.-M., Wallace, S. W. & Wang, Y. Big data analytics in operations management. Prod. Oper. Manag. 27 , 1868–1883 (2018).

Forestiero, A. & Papuzzo, G. Distributed algorithm for big data analytics in healthcare. In 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI) 776–779 (2018).

Ganesh, S. & Talukder, A. K. Formal methods, artificial intelligence, big-data analytics, and knowledge engineering in medical care to reduce disease burden and health disparities. In International Conference on Big Data Analytics 307–321 (2018).

Giacalone, M., Cusatelli, C. & Santarcangelo, V. Big data compliance for innovative clinical models. Big Data Res. 12 , 35–40 (2018).

Guha, S. & Kumar, S. Emergence of big data research in operations management, information systems, and healthcare: Past contributions and future roadmap. Prod. Oper. Manag. 27 , 1724–1735 (2018).

Gupta, V., Singh Gill, H., Singh, P. & Kaur, R. An energy efficient fog-cloud based architecture for healthcare. J. Stat. Manag. Syst. 21 , 529–537 (2018).

Hopp, W. J., Li, J. & Wang, G. Big data and the precision medicine revolution. Prod. Oper. Manag. 27 , 1647–1664 (2018).

Huang, H. K. Big data in PACS-based multimedia medical imaging informatics. In PACS Based Multimedia Imaging Informatics (ed Huang, H.) 575–589 (2018).

Istepanian, R. S. H. & Al-Anzi, T. m-Health 2.0: New perspectives on mobile health, machine learning and big data analytics. Methods 151 , 34–40 (2018).

Khaloufi, H., Abouelmehdi, K., Beni-hssane, A. & Saadi, M. Security model for big healthcare data lifecycle. Procedia Comput. Sci. 141 , 294–301 (2018).

Krittanawong, C., Johnson, K. W., Hershman, S. G. & Tang, W. H. W. Big data, artificial intelligence, and cardiovascular precision medicine. Expert Rev. Precis. Med. Drug Dev. 3 , 305–317 (2018).

Ma, X., Wang, Z., Zhou, S., Wen, H. & Zhang, Y. Intelligent healthcare systems assisted by data analytics and mobile computing. In 2018 14th International Wireless Communications & Mobile Computing Conference (IWCMC) 1317–1322 (2018).

Manogaran, G. et al. A new architecture of Internet of Things and big data ecosystem for secured smart healthcare monitoring and alerting system. Future Gener. Comput. Syst. 82 , 375–387 (2018).

Mehta, N. & Pandit, A. Concurrence of big data analytics and healthcare: A systematic review. Int. J. Med. Inform. 114 , 57–65 (2018).

Miller, J. B. Big data and biomedical informatics: Preparing for the modernization of clinical neuropsychology. Clin. Neuropsychol. 33 , 287–304 (2018).

Moutselos, K., Kyriazis, D. & Maglogiannis, I. A web based modular environment for assisting health policy making utilizing big data analytics. In 2018 9th International Conference on Information, Intelligence, Systems and Applications (IISA) 1–5 (2018).

Nair, L. R., Shetty, S. D. & Shetty, S. D. Applying spark based machine learning model on streaming big data for health status prediction. Comput. Electr. Eng. 65 , 393–399 (2018).

Pashazadeh, A. & Navimipour, N. J. Big data handling mechanisms in the healthcare applications: A comprehensive and systematic literature review. J. Biomed. Inform. 82 , 47–62 (2018).

Ravishankar Rao, A., Clarke, D. & Vargas, M. Building an open health data analytics platform: A case study examining relationships and trends in seniority and performance in healthcare providers. J. Healthc. Inform. Res. 2 , 44–70 (2018).

Sahoo, P. K., Mohapatra, S. K. & Wu, S.-L. SLA based healthcare big data analysis and computing in cloud network. J. Parallel Distrib. Comput. 119 , 121–135 (2018).

Sarkar, B. K. & Sana, S. S. A conceptual distributed framework for improved and secured healthcare system. Int. J. Healthc. Manag. 1–13 (2018).

Sebaa, A., Chikh, F., Nouicer, A. & Tari, A. Medical big data warehouse: architecture and system design, a case study: Improving healthcare resources distribution. J. Med. Syst. 42 , 59 (2018).

Shafqat, S., Kishwer, S., Rasool, R. U., Qadir, J., Amjad, T. & Ahmad, H. F. Big data analytics enhanced healthcare systems: A review. J. Supercomput.

Sivaparthipan, C. B., Karthikeyan, N. & Karthik, S. Designing statistical assessment healthcare information system for diabetics analysis using big data. Multimed. Tools Appl.

Tang, V. et al. An adaptive clinical decision support system for serving the elderly with chronic diseases in healthcare industry. Expert. Syst. 36 , e12369 (2018).

Wang, Y., Kung, L. & Byrd, T. A. Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations. Technol. Forecast. Soc. Change 126 , 3–13 (2018).

Agrawal, A. & Choudhary, A. Health services data: Big data analytics for deriving predictive healthcare insights. In Health Services Evaluation 3–18 (2019).

Ahmed, M., Choudhury, S. & Al-Turjman, F. Big data analytics for intelligent internet of things. In Artificial Intelligence in IoT 107–127 (Springer, 2019).

Ahmed, Z. & Liang, B. T. Systematically dealing practical issues associated to healthcare data analytics. In Future of Information and Communication Conference 599–613 (2019).

Bora, D. J. Chapter 3—Big data analytics in healthcare: A critical analysis. In Big Data Analytics for Intelligent Healthcare Management (eds Dey, N. et al. ) 43–57 (Academic Press, 2019).

Chanchaichujit, J., Tan, A., Meng, F. & Eaimkhong, S. Internet of Things (IoT) and big data analytics in healthcare. In Healthcare 4.0 17–36 (Springer, 2019).

Cirillo, D. & Valencia, A. Big data analytics for personalized medicine. Curr. Opin. Biotechnol. 58 , 161–167 (2019).

Dey, N., Das, H., Naik, B. & Behera, H. S. Big Data Analytics for Intelligent Healthcare Management (Academic Press, 2019).

Din, S. & Paul, A. Smart health monitoring and management system: Toward autonomous wearable sensing for Internet of Things using big data analytics. Future Gener. Comput. Syst. 91 , 611–619 (2019).

Galetsi, P., Katsaliaki, K. & Kumar, S. Values, challenges and future directions of big data analytics in healthcare: A systematic review. Soc. Sci. Med. 241 , 112533 (2019).

Guo, C. & Chen, J. Big data analytics in healthcare: data-driven methods for typical treatment pattern mining. J. Syst. Sci. Syst. Eng. 28 , 694–714 (2019).

Hussain, S. et al. Semantic preservation of standardized healthcare documents in big data. Int. J. Med. Inform. 129 , 133–145 (2019).

Mehta, N., Pandit, A. & Shukla, S. Transforming healthcare with big data analytics and artificial intelligence: A systematic mapping study. J. Biomed. Inform. 100 , 103311 (2019).

Muniasamy, A., Tabassam, S., Hussain, M. A., Sultana, H., Muniasamy, V. & Bhatnagar, R. Deep learning for predictive analytics in healthcare. In International Conference on Advanced Machine Learning Technologies and Applications 32–42 (2019).

Palanisamy, V. & Thirunavukarasu, R. Implications of big data analytics in developing healthcare frameworks–A review. J. King Saud Univ. Comput. Inf. Sci. 31 , 415–425 (2019).

Rajabion, L., Shaltooki, A. A., Taghikhah, M., Ghasemi, A. & Badfar, A. Healthcare big data processing mechanisms: The role of cloud computing. Int. J. Inf. Manag. 49 , 271–289 (2019).

Ramasamy, V., Gomathy, B. & Verma, R. K. Smart HIV/AIDS digital system using big data analytics. In Progress in Advanced Computing and Intelligent Engineering 415–421 (Springer, 2019).

Razzak, M. I., Imran, M. & Xu, G. Big data analytics for preventive medicine. Neural Comput. Appl.

Reiz, A. N., de la Hoz, M. A. & García, M. S. Big data analysis and machine learning in intensive care units. Med. Intensiva 43 , 416–426 (2019).

Saheb, T. & Izadi, L. Paradigm of IoT big data analytics in the healthcare industry: A review of scientific literature and mapping of research trends. Telematics Inform. 41 , 70–85 (2019).

Sahoo, A. K. et al. Chapter 9—Intelligence-based health recommendation system using big data analytics. In Big Data Analytics for Intelligent Healthcare Management (eds Dey, N. et al. ) 227–246 (Academic Press, 2019).

Shahbaz, M., Gao, C., Zhai, L., Shahzad, F. & Hu, Y. Investigating the adoption of big data analytics in healthcare: The moderating role of resistance to change. J. Big Data 6 , 6 (2019).

Sivaparthipan, C. B. et al. Innovative and efficient method of robotics for helping the Parkinson’s disease patient using IoT in big data analytics. Trans. Emerg. Telecommun. Technol. 31 , e3838 (2019).

Sousa, M. J., Pesqueira, A. N. M., Lemos, C., Sousa, M. & Rocha, Ãl. Decision-making based on big data analytics for people management in healthcare organizations. J. Med. Syst. 43 , 290 (2019).

Strang, K. D. Problems with research methods in medical device big data analytics. Int. J. Data Sci. Anal.

Thomas, J., Kneale, D., McKenzie, J. E., Brennan, S. E. & Bhaumik, S. Determining the scope of the review and the questions it will address. In Cochrane Handbook for Systematic Reviews of Interventions 13–31 (2019).

Wang, Y., Kung, L., Gupta, S. & Ozdemir, S. Leveraging big data analytics to improve quality of care in healthcare organizations: A configurational perspective. Br. J. Manag. 30 , 362–388 (2019).

Zetino, J. & Mendoza, N. Big data and its utility in social work: Learning from the big data revolution in business and healthcare. Soc. Work Public Health 34 , 409–417 (2019).

Nazir, S., Nawaz, M., Adnan, A., Shahzad, S. & Asadi, S. Big data features, applications, and analytics in cardiology—A systematic literature review. IEEE Access 7 , 143742–143771 (2019).

Shah, G., Shah, A. & Shah, M. Panacea of challenges in real-world application of big data analytics in healthcare sector. J. Data Inf. Manag. 1 , 107–116 (2019).

Galetsi, P., Katsaliaki, K. & Kumar, S. Big data analytics in health sector: Theoretical framework, techniques and prospects. Int. J. Inf. Manag. 50 , 206–216 (2020).

Iyengar, S. P., Acharya, H. & Kadam, M. Big data analytics in healthcare using spreadsheets. In Big Data Analytics in Healthcare 155–187 (Springer, 2002).

Kumar, S. A. & Venkatesulu, M. BrownBoost classifier-based bloom hash data storage for healthcare big data analytics. In Information and Communication Technology for Sustainable Development 53–69 (Springer, 2020).

Kumar, Y., Sood, K., Kaul, S. & Vasuja, R. Big data analytics and its benefits in healthcare. In Big Data Analytics in Healthcare 3–21 (Springer, 2020).

Naqishbandi, T. A. & Ayyanathan, N. Clinical big data predictive analytics transforming healthcare:-An integrated framework for promise towards value based healthcare. In Advances in Decision Sciences 545–561 (Springer, 2020).

Lambay, M. A. & Mohideen, S. P. Big data analytics for healthcare recommendation systems. In 2020 International Conference on System, Computation, Automation and Networking (ICSCAN) 1–6 (2020).

Katarya, R. & Jain, S. Exploration of big data analytics in healthcare analytics. In 2020 4th International Conference on Computer, Communication and Signal Processing (ICCCSP) 1–4 (2020).

Javid, T., Faris, M., Beenish, H. & Fahad, M. Cybersecurity and data privacy in the cloudlet for preliminary healthcare big data analytics. In 2020 International Conference on Computing and Information Technology (ICCIT-1441) 1–4 (2020).

Leung, C. K., Chen, Y., Hoi, C. S. H., Shang, S. & Cuzzocrea, A. Machine learning and OLAP on big COVID-19 data. In 2020 IEEE International Conference on Big Data (Big Data) 5118–5127 (2020).

Akhtar, U., Lee, J. W., Bilal, H. S. M., Ali, T., Khan, W. A. & Lee, S. The impact of big data in healthcare analytics. In 2020 International Conference on Information Networking (ICOIN) 61–63 (2020).

Mung, P. S. & Phyu, S. Effective analytics on healthcare big data using ensemble learning. In 2020 IEEE Conference on Computer Applications (ICCA) 1–4 (2002).

Georgakopoulos, S. V., Gallos, P. & Plagianakos, V. P. Using big data analytics to detect fraud in healthcare provision. In 2020 IEEE 5th Middle East and Africa Conference on Biomedical Engineering (MECBME) 1–3 (2020).

Leung, C. K., Chen, Y., Shang, S. & Deng, D. Big data science on COVID-19 Data. In 2020 IEEE 14th International Conference on Big Data Science and Engineering (BigDataSE) 14–21 (2020).

Juddoo, S. & George, C. A Qualitative assessment of machine learning support for detecting data completeness and accuracy issues to improve data analytics in big data for the healthcare industry. In 2020 3rd International Conference on Emerging Trends in Electrical, Electronic and Communications Engineering (ELECOM) 58–66 (2020).

Chauhan, R. & Yafi, E. Big data analytics for prediction modelling in healthcare databases. In 2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM) 1–5 (2021).

Islam, M., Karim, R., Khatun, M. A. & Reza, S. A research on big data analytics in healthcare industry. In 2020 International Conference on Information Science and Communications Technologies (ICISCT) 1–5 (2020).

Leung, C. K., Chen, Y., Hoi, C. S. H., Shang, S., Wen, Y. & Cuzzocrea, A. Big data visualization and visual analytics of COVID-19 data. In 2020 24th International Conference Information Visualisation (IV) 415–420 (2020).

Balaji, S. & Prasathkumar, V. Dynamic changes by big data in health care. In 2020 International Conference on Computer Communication and Informatics (ICCCI) 1–4 (2020).

Alahmar, A. & Benlamri, R. Optimizing hospital resources using big data analytics with standardized e-clinical pathways. In 2020 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech) 650–657 (2020).

Sadineni, P. K. Developing a model to enhance the quality of health informatics using big data. In 2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC) 1267–1272 (2020).

Pramanik, M. I. et al. Healthcare informatics and analytics in big data. Expert Syst. Appl. 152 , 113388 (2020).

Ravikumaran, P., Vimala Devi, K., Kartheeban, K. & Narayanan Prasanth, N. Health data analytics: Framework & review on tool & technology. Mater. Today Proc. (2020).

Ramesh, T. & Santhi, V. Exploring big data analytics in health care. Int. J. Intell. Netw. 1 , 135–140 (2020).

Galetsi, P. & Katsaliaki, K. A review of the literature on big data analytics in healthcare. J. Oper. Res. Soc. 71 , 1511–1529 (2020).

Mehta, N., Pandit, A. & Kulkarni, M. Elements of healthcare big data analytics. In Big Data Analytics in Healthcare 23–43 (Springer, 2020).

Ehwerhemuepha, L. et al. HealtheDataLab–a cloud computing solution for data science and advanced analytics in healthcare with application to predicting multi-center pediatric readmissions. BMC Med. Inform. Decis. Mak. 20 , 1–12 (2020).

Sivasangari, A., Lakshmanan, L., Ajitha, P., Deepa, D. & Jabez, J. Big data analytics for 5G-enabled IoT healthcare. In Blockchain for 5G-Enabled IoT 261.

Ma, S. & Huai, J. Approximate computation for big data analytics. SIGWEB Newsl. (2021).

Uzunbaz, S. & Aref, W. G. Shared execution techniques for business data analytics over big data streams. In Presented at the 32nd International Conference on Scientific and Statistical Database Management, Vienna, Austria (2020).

Chalumporn, G. & Hewett, R. Health data analytics with an opportunistic big data algorithm. In Presented at the Proceedings of the 11th International Conference on Advances in Information Technology, Bangkok, Thailand (2020).

Minami, T. & Ohura, Y. Small data analysis for bigger data analysis. In Presented at the 2021 Workshop on Algorithm and Big Data, Fuzhou, China (2021).

Chakraborty, C. & Rathi, M. Chapter 2—Smart healthcare systems using big data. In Demystifying Big Data, Machine Learning, and Deep Learning for Healthcare Analytics (eds Kautish, P. N. S. & Peng, S.-L.) 17–32 (Academic Press, 2021).

Ilmudeen, A. Chapter 3—Big data-based frameworks for healthcare systems. In Demystifying Big Data, Machine Learning, and Deep Learning for Healthcare Analytics (eds Kautish, P. N. S. & Peng, S.-L.) 33–56 (Academic Press, 2021).

Mendhe, C. H., Henderson, N., Srivastava, G. & Mago, V. A scalable platform to collect, store, visualize, and analyze big data in real time. IEEE Trans. Comput. Soc. Syst. 8 , 260–269 (2021).

Sivabalaselvamani, D., Selvakarthi, D., Yogapriya, J., Thiruvenkatasuresh, M. P., Maruthappa, M. & Chandra, A. S. Artificial Intelligence in data-driven analytics for the personalized healthcare. In 2021 International Conference on Computer Communication and Informatics (ICCCI) 1–5 (2021)

Harb, H., Mansour, A., Nasser, A., Cruz, E. M. & de la Torre Diez, I. A sensor-based data analytics for patient monitoring in connected healthcare applications. IEEE Sens. J. 21 , 974–984 (2021).

Article   ADS   CAS   Google Scholar  

Jones, J. & Jones, J. Optimizing healthcare. In 2020 IEEE International Conference on E-health Networking, Application & Services (HEALTHCOM) 1–6 (2021).

Hassan, S., Dhali, M., Zaman, F. & Tanveer, M. Big data and predictive analytics in healthcare in Bangladesh: Regulatory challenges. Heliyon 7 , e07179 (2021).

Khan, S. et al. KNN and ANN-based recognition of handwritten pashto letters using zoning features. Mach. Learn. 9 , 570–577 (2018).

Pant, D., Kumar, V., Kishore, J. & Pal, R. Healthcare data modeling in R. In 2017 1st International Conference on Intelligent Systems and Information Management (ICISIM) 230–233 (2017).

Brennan, P. F. & Bakken, S. Nursing needs big data and big data needs nursing. J. Nurs. Scholarsh. 47 , 477–484 (2015).

Sreedevi, A. G., Nitya Harshitha, T., Sugumaran, V. & Shankar, P. Application of cognitive computing in healthcare, cybersecurity, big data and IoT: A literature review. Inform. Process. Manag. 59 , 102888 (2022).

Sinha, A., Hripcsak, G. & Markatou, M. Large datasets in biomedicine: A discussion of salient analytic issues. J. Am. Med. Inform. Assoc. JAMIA 16 , 759–767 (2009).

Alonso-Betanzos, A. & Bolón-Canedo, V. Big-Data analysis, cluster analysis, and machine-learning approaches (2018).

Dayal, M. & Singh, N. Indian health care analysis using big data programming tool. Procedia Comput. Sci. 89 , 521–527 (2016).

Jayaraman, P. P., Forkan, A. R. M., Morshed, A., Haghighi, P. D. & Kang, Y.-B. Healthcare 4.0: A review of frontiers in digital health. WIREs Data Min. Knowl. Discov. 10 , e1350 (2018).

Gallos, P. et al. CrowdHEALTH: Big data analytics and holistic health records. Stud. Health Technol. Inform. 258 , 255–256 (2019).

Wang, L., Ranjan, R., Kołodziej, J., Zomaya, A. & Alem, L. Software tools and techniques for big data computing in healthcare clouds. Future Gener. Comput. Syst. 43–44 , 38–39 (2015).

Kiourtis, A. et al. An autoscaling platform supporting graph data modelling big data analytics. Stud. Health Technol. Inform. 295 , 376–379 (2022).

Download references

Acknowledgements

This research work is performed by Department of Accounting and Information Systems, Collage of Business and Economics, Qatar University in collaboration with the Department of Computer Science, University of Swabi, Swabi, Pakistan.

Open Access funding provided by the Qatar National Library. This research was funded by Qatar University Internal Grant under Grant No. IRCC-2021–010. The findings achieved herein are solely the responsibility of the authors.

Author information

Authors and affiliations.

Department of Accounting and Information Systems, College of Business and Economics, Qatar University, Doha, Qatar

Sulaiman Khan & Habib Ullah Khan

Department of Computer Science, University of Swabi, Swabi, Pakistan

You can also search for this author in PubMed   Google Scholar

Contributions

S.K. wrote the original draft of the paper. He also revised the draft based on the reviewers suggestions. Dr. H.U.K. developed the experimental setup for the proposed systematic research work. Dr. S.N. performed articles accumulation and database development process.

Corresponding author

Correspondence to Habib Ullah Khan .

Ethics declarations

Competing interests.

The authors declare no competing interests.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Khan, S., Khan, H.U. & Nazir, S. Systematic analysis of healthcare big data analytics for efficient care and disease diagnosing. Sci Rep 12 , 22377 (2022). https://doi.org/10.1038/s41598-022-26090-5

Download citation

Received : 09 September 2022

Accepted : 09 December 2022

Published : 26 December 2022

DOI : https://doi.org/10.1038/s41598-022-26090-5

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

This article is cited by

The usage of population and disease registries as pre-screening tools for clinical trials, a systematic review.

  • Juliette Foucher
  • Louisa Azizi
  • Caroline Ingre

Systematic Reviews (2024)

By submitting a comment you agree to abide by our Terms and Community Guidelines . If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

big data thesis paper

To read this content please select one of the options below:

Please note you do not have access to teaching notes, big data and performance measurement research: trends, evolution and future opportunities.

Measuring Business Excellence

ISSN : 1368-3047

Article publication date: 20 June 2020

Issue publication date: 30 November 2023

Current literature recognised big data as a digital revolution affecting all organisational processes. To obtain a competitive advantage from the use of big data, an efficient integration in a performance measurement system (PMS) is needed, but it is still a “great challenge” in performance measurement research. This paper aims to review the big data and performance measurement studies to identify the publications’ trends and future research opportunities.

Design/methodology/approach

The authors reviewed 873 documents on big data and performance carrying out an extensive bibliometric analysis using two main techniques, i.e. performance analysis and science mapping.

Results point to a significant increase in the number of publications on big data and performance, highlighting a shortage of studies on business, management and accounting areas, and on how big data can improve performance measurement. Future research opportunities are identified. They regard the development of further research to explain how performance measurement field can effectively integrate big data into a PMS and describe the main themes related to big data in performance measurement literature.

Originality/value

This paper gives a holistic view of big data and performance measurement research through the inclusion of numerous contributions on different research streams. It also encourages further study for developing concrete tools.

  • Performance measurement
  • Performance management
  • Bibliometric analysis
  • Strategic management
  • Performance management systems

Sardi, A. , Sorano, E. , Cantino, V. and Garengo, P. (2023), "Big data and performance measurement research: trends, evolution and future opportunities", Measuring Business Excellence , Vol. 27 No. 4, pp. 531-548. https://doi.org/10.1108/MBE-06-2019-0053

Emerald Publishing Limited

Copyright © 2021, Emerald Publishing Limited

Related articles

All feedback is valuable.

Please share your general feedback

Report an issue or find answers to frequently asked questions

Contact Customer Support

  • Open access
  • Published: 06 January 2022

The use of Big Data Analytics in healthcare

  • Kornelia Batko   ORCID: orcid.org/0000-0001-6561-3826 1 &
  • Andrzej Ślęzak 2  

Journal of Big Data volume  9 , Article number:  3 ( 2022 ) Cite this article

80k Accesses

142 Citations

28 Altmetric

Metrics details

The introduction of Big Data Analytics (BDA) in healthcare will allow to use new technologies both in treatment of patients and health management. The paper aims at analyzing the possibilities of using Big Data Analytics in healthcare. The research is based on a critical analysis of the literature, as well as the presentation of selected results of direct research on the use of Big Data Analytics in medical facilities. The direct research was carried out based on research questionnaire and conducted on a sample of 217 medical facilities in Poland. Literature studies have shown that the use of Big Data Analytics can bring many benefits to medical facilities, while direct research has shown that medical facilities in Poland are moving towards data-based healthcare because they use structured and unstructured data, reach for analytics in the administrative, business and clinical area. The research positively confirmed that medical facilities are working on both structural data and unstructured data. The following kinds and sources of data can be distinguished: from databases, transaction data, unstructured content of emails and documents, data from devices and sensors. However, the use of data from social media is lower as in their activity they reach for analytics, not only in the administrative and business but also in the clinical area. It clearly shows that the decisions made in medical facilities are highly data-driven. The results of the study confirm what has been analyzed in the literature that medical facilities are moving towards data-based healthcare, together with its benefits.

Introduction

The main contribution of this paper is to present an analytical overview of using structured and unstructured data (Big Data) analytics in medical facilities in Poland. Medical facilities use both structured and unstructured data in their practice. Structured data has a predetermined schema, it is extensive, freeform, and comes in variety of forms [ 27 ]. In contrast, unstructured data, referred to as Big Data (BD), does not fit into the typical data processing format. Big Data is a massive amount of data sets that cannot be stored, processed, or analyzed using traditional tools. It remains stored but not analyzed. Due to the lack of a well-defined schema, it is difficult to search and analyze such data and, therefore, it requires a specific technology and method to transform it into value [ 20 , 68 ]. Integrating data stored in both structured and unstructured formats can add significant value to an organization [ 27 ]. Organizations must approach unstructured data in a different way. Therefore, the potential is seen in Big Data Analytics (BDA). Big Data Analytics are techniques and tools used to analyze and extract information from Big Data. The results of Big Data analysis can be used to predict the future. They also help in creating trends about the past. When it comes to healthcare, it allows to analyze large datasets from thousands of patients, identifying clusters and correlation between datasets, as well as developing predictive models using data mining techniques [ 60 ].

This paper is the first study to consolidate and characterize the use of Big Data from different perspectives. The first part consists of a brief literature review of studies on Big Data (BD) and Big Data Analytics (BDA), while the second part presents results of direct research aimed at diagnosing the use of big data analyses in medical facilities in Poland.

Healthcare is a complex system with varied stakeholders: patients, doctors, hospitals, pharmaceutical companies and healthcare decision-makers. This sector is also limited by strict rules and regulations. However, worldwide one may observe a departure from the traditional doctor-patient approach. The doctor becomes a partner and the patient is involved in the therapeutic process [ 14 ]. Healthcare is no longer focused solely on the treatment of patients. The priority for decision-makers should be to promote proper health attitudes and prevent diseases that can be avoided [ 81 ]. This became visible and important especially during the Covid-19 pandemic [ 44 ].

The next challenges that healthcare will have to face is the growing number of elderly people and a decline in fertility. Fertility rates in the country are found below the reproductive minimum necessary to keep the population stable [ 10 ]. The reflection of both effects, namely the increase in age and lower fertility rates, are demographic load indicators, which is constantly growing. Forecasts show that providing healthcare in the form it is provided today will become impossible in the next 20 years [ 70 ]. It is especially visible now during the Covid-19 pandemic when healthcare faced quite a challenge related to the analysis of huge data amounts and the need to identify trends and predict the spread of the coronavirus. The pandemic showed it even more that patients should have access to information about their health condition, the possibility of digital analysis of this data and access to reliable medical support online. Health monitoring and cooperation with doctors in order to prevent diseases can actually revolutionize the healthcare system. One of the most important aspects of the change necessary in healthcare is putting the patient in the center of the system.

Technology is not enough to achieve these goals. Therefore, changes should be made not only at the technological level but also in the management and design of complete healthcare processes and what is more, they should affect the business models of service providers. The use of Big Data Analytics is becoming more and more common in enterprises [ 17 , 54 ]. However, medical enterprises still cannot keep up with the information needs of patients, clinicians, administrators and the creator’s policy. The adoption of a Big Data approach would allow the implementation of personalized and precise medicine based on personalized information, delivered in real time and tailored to individual patients.

To achieve this goal, it is necessary to implement systems that will be able to learn quickly about the data generated by people within clinical care and everyday life. This will enable data-driven decision making, receiving better personalized predictions about prognosis and responses to treatments; a deeper understanding of the complex factors and their interactions that influence health at the patient level, the health system and society, enhanced approaches to detecting safety problems with drugs and devices, as well as more effective methods of comparing prevention, diagnostic, and treatment options [ 40 ].

In the literature, there is a lot of research showing what opportunities can be offered to companies by big data analysis and what data can be analyzed. However, there are few studies showing how data analysis in the area of healthcare is performed, what data is used by medical facilities and what analyses and in which areas they carry out. This paper aims to fill this gap by presenting the results of research carried out in medical facilities in Poland. The goal is to analyze the possibilities of using Big Data Analytics in healthcare, especially in Polish conditions. In particular, the paper is aimed at determining what data is processed by medical facilities in Poland, what analyses they perform and in what areas, and how they assess their analytical maturity. In order to achieve this goal, a critical analysis of the literature was performed, and the direct research was based on a research questionnaire conducted on a sample of 217 medical facilities in Poland. It was hypothesized that medical facilities in Poland are working on both structured and unstructured data and moving towards data-based healthcare and its benefits. Examining the maturity of healthcare facilities in the use of Big Data and Big Data Analytics is crucial in determining the potential future benefits that the healthcare sector can gain from Big Data Analytics. There is also a pressing need to predicate whether, in the coming years, healthcare will be able to cope with the threats and challenges it faces.

This paper is divided into eight parts. The first is the introduction which provides background and the general problem statement of this research. In the second part, this paper discusses considerations on use of Big Data and Big Data Analytics in Healthcare, and then, in the third part, it moves on to challenges and potential benefits of using Big Data Analytics in healthcare. The next part involves the explanation of the proposed method. The result of direct research and discussion are presented in the fifth part, while the following part of the paper is the conclusion. The seventh part of the paper presents practical implications. The final section of the paper provides limitations and directions for future research.

Considerations on use Big Data and Big Data Analytics in the healthcare

In recent years one can observe a constantly increasing demand for solutions offering effective analytical tools. This trend is also noticeable in the analysis of large volumes of data (Big Data, BD). Organizations are looking for ways to use the power of Big Data to improve their decision making, competitive advantage or business performance [ 7 , 54 ]. Big Data is considered to offer potential solutions to public and private organizations, however, still not much is known about the outcome of the practical use of Big Data in different types of organizations [ 24 ].

As already mentioned, in recent years, healthcare management worldwide has been changed from a disease-centered model to a patient-centered model, even in value-based healthcare delivery model [ 68 ]. In order to meet the requirements of this model and provide effective patient-centered care, it is necessary to manage and analyze healthcare Big Data.

The issue often raised when it comes to the use of data in healthcare is the appropriate use of Big Data. Healthcare has always generated huge amounts of data and nowadays, the introduction of electronic medical records, as well as the huge amount of data sent by various types of sensors or generated by patients in social media causes data streams to constantly grow. Also, the medical industry generates significant amounts of data, including clinical records, medical images, genomic data and health behaviors. Proper use of the data will allow healthcare organizations to support clinical decision-making, disease surveillance, and public health management. The challenge posed by clinical data processing involves not only the quantity of data but also the difficulty in processing it.

In the literature one can find many different definitions of Big Data. This concept has evolved in recent years, however, it is still not clearly understood. Nevertheless, despite the range and differences in definitions, Big Data can be treated as a: large amount of digital data, large data sets, tool, technology or phenomenon (cultural or technological.

Big Data can be considered as massive and continually generated digital datasets that are produced via interactions with online technologies [ 53 ]. Big Data can be defined as datasets that are of such large sizes that they pose challenges in traditional storage and analysis techniques [ 28 ]. A similar opinion about Big Data was presented by Ohlhorst who sees Big Data as extremely large data sets, possible neither to manage nor to analyze with traditional data processing tools [ 57 ]. In his opinion, the bigger the data set, the more difficult it is to gain any value from it.

In turn, Knapp perceived Big Data as tools, processes and procedures that allow an organization to create, manipulate and manage very large data sets and storage facilities [ 38 ]. From this point of view, Big Data is identified as a tool to gather information from different databases and processes, allowing users to manage large amounts of data.

Similar perception of the term ‘Big Data’ is shown by Carter. According to him, Big Data technologies refer to a new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of data by enabling high velocity capture, discovery and/or analysis [ 13 ].

Jordan combines these two approaches by identifying Big Data as a complex system, as it needs data bases for data to be stored in, programs and tools to be managed, as well as expertise and personnel able to retrieve useful information and visualization to be understood [ 37 ].

Following the definition of Laney for Big Data, it can be state that: it is large amount of data generated in very fast motion and it contains a lot of content [ 43 ]. Such data comes from unstructured sources, such as stream of clicks on the web, social networks (Twitter, blogs, Facebook), video recordings from the shops, recording of calls in a call center, real time information from various kinds of sensors, RFID, GPS devices, mobile phones and other devices that identify and monitor something [ 8 ]. Big Data is a powerful digital data silo, raw, collected with all sorts of sources, unstructured and difficult, or even impossible, to analyze using conventional techniques used so far to relational databases.

While describing Big Data, it cannot be overlooked that the term refers more to a phenomenon than to specific technology. Therefore, instead of defining this phenomenon, trying to describe them, more authors are describing Big Data by giving them characteristics included a collection of V’s related to its nature [ 2 , 3 , 23 , 25 , 58 ]:

Volume (refers to the amount of data and is one of the biggest challenges in Big Data Analytics),

Velocity (speed with which new data is generated, the challenge is to be able to manage data effectively and in real time),

Variety (heterogeneity of data, many different types of healthcare data, the challenge is to derive insights by looking at all available heterogenous data in a holistic manner),

Variability (inconsistency of data, the challenge is to correct the interpretation of data that can vary significantly depending on the context),

Veracity (how trustworthy the data is, quality of the data),

Visualization (ability to interpret data and resulting insights, challenging for Big Data due to its other features as described above).

Value (the goal of Big Data Analytics is to discover the hidden knowledge from huge amounts of data).

Big Data is defined as an information asset with high volume, velocity, and variety, which requires specific technology and method for its transformation into value [ 21 , 77 ]. Big Data is also a collection of information about high-volume, high volatility or high diversity, requiring new forms of processing in order to support decision-making, discovering new phenomena and process optimization [ 5 , 7 ]. Big Data is too large for traditional data-processing systems and software tools to capture, store, manage and analyze, therefore it requires new technologies [ 28 , 50 , 61 ] to manage (capture, aggregate, process) its volume, velocity and variety [ 9 ].

Undoubtedly, Big Data differs from the data sources used so far by organizations. Therefore, organizations must approach this type of unstructured data in a different way. First of all, organizations must start to see data as flows and not stocks—this entails the need to implement the so-called streaming analytics [ 48 ]. The mentioned features make it necessary to use new IT tools that allow the fullest use of new data [ 58 ]. The Big Data idea, inseparable from the huge increase in data available to various organizations or individuals, creates opportunities for access to valuable analyses, conclusions and enables making more accurate decisions [ 6 , 11 , 59 ].

The Big Data concept is constantly evolving and currently it does not focus on huge amounts of data, but rather on the process of creating value from this data [ 52 ]. Big Data is collected from various sources that have different data properties and are processed by different organizational units, resulting in creation of a Big Data chain [ 36 ]. The aim of the organizations is to manage, process and analyze Big Data. In the healthcare sector, Big Data streams consist of various types of data, namely [ 8 , 51 ]:

clinical data, i.e. data obtained from electronic medical records, data from hospital information systems, image centers, laboratories, pharmacies and other organizations providing health services, patient generated health data, physician’s free-text notes, genomic data, physiological monitoring data [ 4 ],

biometric data provided from various types of devices that monitor weight, pressure, glucose level, etc.,

financial data, constituting a full record of economic operations reflecting the conducted activity,

data from scientific research activities, i.e. results of research, including drug research, design of medical devices and new methods of treatment,

data provided by patients, including description of preferences, level of satisfaction, information from systems for self-monitoring of their activity: exercises, sleep, meals consumed, etc.

data from social media.

These data are provided not only by patients but also by organizations and institutions, as well as by various types of monitoring devices, sensors or instruments [ 16 ]. Data that has been generated so far in the healthcare sector is stored in both paper and digital form. Thus, the essence and the specificity of the process of Big Data analyses means that organizations need to face new technological and organizational challenges [ 67 ]. The healthcare sector has always generated huge amounts of data and this is connected, among others, with the need to store medical records of patients. However, the problem with Big Data in healthcare is not limited to an overwhelming volume but also an unprecedented diversity in terms of types, data formats and speed with which it should be analyzed in order to provide the necessary information on an ongoing basis [ 3 ]. It is also difficult to apply traditional tools and methods for management of unstructured data [ 67 ]. Due to the diversity and quantity of data sources that are growing all the time, advanced analytical tools and technologies, as well as Big Data analysis methods which can meet and exceed the possibilities of managing healthcare data, are needed [ 3 , 68 ].

Therefore, the potential is seen in Big Data analyses, especially in the aspect of improving the quality of medical care, saving lives or reducing costs [ 30 ]. Extracting from this tangle of given association rules, patterns and trends will allow health service providers and other stakeholders in the healthcare sector to offer more accurate and more insightful diagnoses of patients, personalized treatment, monitoring of the patients, preventive medicine, support of medical research and health population, as well as better quality of medical services and patient care while, at the same time, the ability to reduce costs (Fig.  1 ).

figure 1

(Source: Own elaboration)

Healthcare Big Data Analytics applications

The main challenge with Big Data is how to handle such a large amount of information and use it to make data-driven decisions in plenty of areas [ 64 ]. In the context of healthcare data, another major challenge is to adjust big data storage, analysis, presentation of analysis results and inference basing on them in a clinical setting. Data analytics systems implemented in healthcare are designed to describe, integrate and present complex data in an appropriate way so that it can be understood better (Fig.  2 ). This would improve the efficiency of acquiring, storing, analyzing and visualizing big data from healthcare [ 71 ].

figure 2

Process of Big Data Analytics

The result of data processing with the use of Big Data Analytics is appropriate data storytelling which may contribute to making decisions with both lower risk and data support. This, in turn, can benefit healthcare stakeholders. To take advantage of the potential massive amounts of data in healthcare and to ensure that the right intervention to the right patient is properly timed, personalized, and potentially beneficial to all components of the healthcare system such as the payer, patient, and management, analytics of large datasets must connect communities involved in data analytics and healthcare informatics [ 49 ]. Big Data Analytics can provide insight into clinical data and thus facilitate informed decision-making about the diagnosis and treatment of patients, prevention of diseases or others. Big Data Analytics can also improve the efficiency of healthcare organizations by realizing the data potential [ 3 , 62 ].

Big Data Analytics in medicine and healthcare refers to the integration and analysis of a large amount of complex heterogeneous data, such as various omics (genomics, epigenomics, transcriptomics, proteomics, metabolomics, interactomics, pharmacogenetics, deasomics), biomedical data, talemedicine data (sensors, medical equipment data) and electronic health records data [ 46 , 65 ].

When analyzing the phenomenon of Big Data in the healthcare sector, it should be noted that it can be considered from the point of view of three areas: epidemiological, clinical and business.

From a clinical point of view, the Big Data analysis aims to improve the health and condition of patients, enable long-term predictions about their health status and implementation of appropriate therapeutic procedures. Ultimately, the use of data analysis in medicine is to allow the adaptation of therapy to a specific patient, that is personalized medicine (precision, personalized medicine).

From an epidemiological point of view, it is desirable to obtain an accurate prognosis of morbidity in order to implement preventive programs in advance.

In the business context, Big Data analysis may enable offering personalized packages of commercial services or determining the probability of individual disease and infection occurrence. It is worth noting that Big Data means not only the collection and processing of data but, most of all, the inference and visualization of data necessary to obtain specific business benefits.

In order to introduce new management methods and new solutions in terms of effectiveness and transparency, it becomes necessary to make data more accessible, digital, searchable, as well as analyzed and visualized.

Erickson and Rothberg state that the information and data do not reveal their full value until insights are drawn from them. Data becomes useful when it enhances decision making and decision making is enhanced only when analytical techniques are used and an element of human interaction is applied [ 22 ].

Thus, healthcare has experienced much progress in usage and analysis of data. A large-scale digitalization and transparency in this sector is a key statement of almost all countries governments policies. For centuries, the treatment of patients was based on the judgment of doctors who made treatment decisions. In recent years, however, Evidence-Based Medicine has become more and more important as a result of it being related to the systematic analysis of clinical data and decision-making treatment based on the best available information [ 42 ]. In the healthcare sector, Big Data Analytics is expected to improve the quality of life and reduce operational costs [ 72 , 82 ]. Big Data Analytics enables organizations to improve and increase their understanding of the information contained in data. It also helps identify data that provides insightful insights for current as well as future decisions [ 28 ].

Big Data Analytics refers to technologies that are grounded mostly in data mining: text mining, web mining, process mining, audio and video analytics, statistical analysis, network analytics, social media analytics and web analytics [ 16 , 25 , 31 ]. Different data mining techniques can be applied on heterogeneous healthcare data sets, such as: anomaly detection, clustering, classification, association rules as well as summarization and visualization of those Big Data sets [ 65 ]. Modern data analytics techniques explore and leverage unique data characteristics even from high-speed data streams and sensor data [ 15 , 16 , 31 , 55 ]. Big Data can be used, for example, for better diagnosis in the context of comprehensive patient data, disease prevention and telemedicine (in particular when using real-time alerts for immediate care), monitoring patients at home, preventing unnecessary hospital visits, integrating medical imaging for a wider diagnosis, creating predictive analytics, reducing fraud and improving data security, better strategic planning and increasing patients’ involvement in their own health.

Big Data Analytics in healthcare can be divided into [ 33 , 73 , 74 ]:

descriptive analytics in healthcare is used to understand past and current healthcare decisions, converting data into useful information for understanding and analyzing healthcare decisions, outcomes and quality, as well as making informed decisions [ 33 ]. It can be used to create reports (i.e. about patients’ hospitalizations, physicians’ performance, utilization management), visualization, customized reports, drill down tables, or running queries on the basis of historical data.

predictive analytics operates on past performance in an effort to predict the future by examining historical or summarized health data, detecting patterns of relationships in these data, and then extrapolating these relationships to forecast. It can be used to i.e. predict the response of different patient groups to different drugs (dosages) or reactions (clinical trials), anticipate risk and find relationships in health data and detect hidden patterns [ 62 ]. In this way, it is possible to predict the epidemic spread, anticipate service contracts and plan healthcare resources. Predictive analytics is used in proper diagnosis and for appropriate treatments to be given to patients suffering from certain diseases [ 39 ].

prescriptive analytics—occurs when health problems involve too many choices or alternatives. It uses health and medical knowledge in addition to data or information. Prescriptive analytics is used in many areas of healthcare, including drug prescriptions and treatment alternatives. Personalized medicine and evidence-based medicine are both supported by prescriptive analytics.

discovery analytics—utilizes knowledge about knowledge to discover new “inventions” like drugs (drug discovery), previously unknown diseases and medical conditions, alternative treatments, etc.

Although the models and tools used in descriptive, predictive, prescriptive, and discovery analytics are different, many applications involve all four of them [ 62 ]. Big Data Analytics in healthcare can help enable personalized medicine by identifying optimal patient-specific treatments. This can influence the improvement of life standards, reduce waste of healthcare resources and save costs of healthcare [ 56 , 63 , 71 ]. The introduction of large data analysis gives new analytical possibilities in terms of scope, flexibility and visualization. Techniques such as data mining (computational pattern discovery process in large data sets) facilitate inductive reasoning and analysis of exploratory data, enabling scientists to identify data patterns that are independent of specific hypotheses. As a result, predictive analysis and real-time analysis becomes possible, making it easier for medical staff to start early treatments and reduce potential morbidity and mortality. In addition, document analysis, statistical modeling, discovering patterns and topics in document collections and data in the EHR, as well as an inductive approach can help identify and discover relationships between health phenomena.

Advanced analytical techniques can be used for a large amount of existing (but not yet analytical) data on patient health and related medical data to achieve a better understanding of the information and results obtained, as well as to design optimal clinical pathways [ 62 ]. Big Data Analytics in healthcare integrates analysis of several scientific areas such as bioinformatics, medical imaging, sensor informatics, medical informatics and health informatics [ 65 ]. Big Data Analytics in healthcare allows to analyze large datasets from thousands of patients, identifying clusters and correlation between datasets, as well as developing predictive models using data mining techniques [ 65 ]. Discussing all the techniques used for Big Data Analytics goes beyond the scope of a single article [ 25 ].

The success of Big Data analysis and its accuracy depend heavily on the tools and techniques used to analyze the ability to provide reliable, up-to-date and meaningful information to various stakeholders [ 12 ]. It is believed that the implementation of big data analytics by healthcare organizations could bring many benefits in the upcoming years, including lowering health care costs, better diagnosis and prediction of diseases and their spread, improving patient care and developing protocols to prevent re-hospitalization, optimizing staff, optimizing equipment, forecasting the need for hospital beds, operating rooms, treatments, and improving the drug supply chain [ 71 ].

Challenges and potential benefits of using Big Data Analytics in healthcare

Modern analytics gives possibilities not only to have insight in historical data, but also to have information necessary to generate insight into what may happen in the future. Even when it comes to prediction of evidence-based actions. The emphasis on reform has prompted payers and suppliers to pursue data analysis to reduce risk, detect fraud, improve efficiency and save lives. Everyone—payers, providers, even patients—are focusing on doing more with fewer resources. Thus, some areas in which enhanced data and analytics can yield the greatest results include various healthcare stakeholders (Table 1 ).

Healthcare organizations see the opportunity to grow through investments in Big Data Analytics. In recent years, by collecting medical data of patients, converting them into Big Data and applying appropriate algorithms, reliable information has been generated that helps patients, physicians and stakeholders in the health sector to identify values and opportunities [ 31 ]. It is worth noting that there are many changes and challenges in the structure of the healthcare sector. Digitization and effective use of Big Data in healthcare can bring benefits to every stakeholder in this sector. A single doctor would benefit the same as the entire healthcare system. Potential opportunities to achieve benefits and effects from Big Data in healthcare can be divided into four groups [ 8 ]:

Improving the quality of healthcare services:

assessment of diagnoses made by doctors and the manner of treatment of diseases indicated by them based on the decision support system working on Big Data collections,

detection of more effective, from a medical point of view, and more cost-effective ways to diagnose and treat patients,

analysis of large volumes of data to reach practical information useful for identifying needs, introducing new health services, preventing and overcoming crises,

prediction of the incidence of diseases,

detecting trends that lead to an improvement in health and lifestyle of the society,

analysis of the human genome for the introduction of personalized treatment.

Supporting the work of medical personnel

doctors’ comparison of current medical cases to cases from the past for better diagnosis and treatment adjustment,

detection of diseases at earlier stages when they can be more easily and quickly cured,

detecting epidemiological risks and improving control of pathogenic spots and reaction rates,

identification of patients who are predicted to have the highest risk of specific, life-threatening diseases by collating data on the history of the most common diseases, in healing people with reports entering insurance companies,

health management of each patient individually (personalized medicine) and health management of the whole society,

capturing and analyzing large amounts of data from hospitals and homes in real time, life monitoring devices to monitor safety and predict adverse events,

analysis of patient profiles to identify people for whom prevention should be applied, lifestyle change or preventive care approach,

the ability to predict the occurrence of specific diseases or worsening of patients’ results,

predicting disease progression and its determinants, estimating the risk of complications,

detecting drug interactions and their side effects.

Supporting scientific and research activity

supporting work on new drugs and clinical trials thanks to the possibility of analyzing “all data” instead of selecting a test sample,

the ability to identify patients with specific, biological features that will take part in specialized clinical trials,

selecting a group of patients for which the tested drug is likely to have the desired effect and no side effects,

using modeling and predictive analysis to design better drugs and devices.

Business and management

reduction of costs and counteracting abuse and counseling practices,

faster and more effective identification of incorrect or unauthorized financial operations in order to prevent abuse and eliminate errors,

increase in profitability by detecting patients generating high costs or identifying doctors whose work, procedures and treatment methods cost the most and offering them solutions that reduce the amount of money spent,

identification of unnecessary medical activities and procedures, e.g. duplicate tests.

According to research conducted by Wang, Kung and Byrd, Big Data Analytics benefits can be classified into five categories: IT infrastructure benefits (reducing system redundancy, avoiding unnecessary IT costs, transferring data quickly among healthcare IT systems, better use of healthcare systems, processing standardization among various healthcare IT systems, reducing IT maintenance costs regarding data storage), operational benefits (improving the quality and accuracy of clinical decisions, processing a large number of health records in seconds, reducing the time of patient travel, immediate access to clinical data to analyze, shortening the time of diagnostic test, reductions in surgery-related hospitalizations, exploring inconceivable new research avenues), organizational benefits (detecting interoperability problems much more quickly than traditional manual methods, improving cross-functional communication and collaboration among administrative staffs, researchers, clinicians and IT staffs, enabling data sharing with other institutions and adding new services, content sources and research partners), managerial benefits (gaining quick insights about changing healthcare trends in the market, providing members of the board and heads of department with sound decision-support information on the daily clinical setting, optimizing business growth-related decisions) and strategic benefits (providing a big picture view of treatment delivery for meeting future need, creating high competitive healthcare services) [ 73 ].

The above specification does not constitute a full list of potential areas of use of Big Data Analysis in healthcare because the possibilities of using analysis are practically unlimited. In addition, advanced analytical tools allow to analyze data from all possible sources and conduct cross-analyses to provide better data insights [ 26 ]. For example, a cross-analysis can refer to a combination of patient characteristics, as well as costs and care results that can help identify the best, in medical terms, and the most cost-effective treatment or treatments and this may allow a better adjustment of the service provider’s offer [ 62 ].

In turn, the analysis of patient profiles (e.g. segmentation and predictive modeling) allows identification of people who should be subject to prophylaxis, prevention or should change their lifestyle [ 8 ]. Shortened list of benefits for Big Data Analytics in healthcare is presented in paper [ 3 ] and consists of: better performance, day-to-day guides, detection of diseases in early stages, making predictive analytics, cost effectiveness, Evidence Based Medicine and effectiveness in patient treatment.

Summarizing, healthcare big data represents a huge potential for the transformation of healthcare: improvement of patients’ results, prediction of outbreaks of epidemics, valuable insights, avoidance of preventable diseases, reduction of the cost of healthcare delivery and improvement of the quality of life in general [ 1 ]. Big Data also generates many challenges such as difficulties in data capture, data storage, data analysis and data visualization [ 15 ]. The main challenges are connected with the issues of: data structure (Big Data should be user-friendly, transparent, and menu-driven but it is fragmented, dispersed, rarely standardized and difficult to aggregate and analyze), security (data security, privacy and sensitivity of healthcare data, there are significant concerns related to confidentiality), data standardization (data is stored in formats that are not compatible with all applications and technologies), storage and transfers (especially costs associated with securing, storing, and transferring unstructured data), managerial skills, such as data governance, lack of appropriate analytical skills and problems with Real-Time Analytics (health care is to be able to utilize Big Data in real time) [ 4 , 34 , 41 ].

The research is based on a critical analysis of the literature, as well as the presentation of selected results of direct research on the use of Big Data Analytics in medical facilities in Poland.

Presented research results are part of a larger questionnaire form on Big Data Analytics. The direct research was based on an interview questionnaire which contained 100 questions with 5-point Likert scale (1—strongly disagree, 2—I rather disagree, 3—I do not agree, nor disagree, 4—I rather agree, 5—I definitely agree) and 4 metrics questions. The study was conducted in December 2018 on a sample of 217 medical facilities (110 private, 107 public). The research was conducted by a specialized market research agency: Center for Research and Expertise of the University of Economics in Katowice.

When it comes to direct research, the selected entities included entities financed from public sources—the National Health Fund (23.5%), and entities operating commercially (11.5%). In the surveyed group of entities, more than a half (64.9%) are hybrid financed, both from public and commercial sources. The diversity of the research sample also applies to the size of the entities, defined by the number of employees. Taking into account proportions of the surveyed entities, it should be noted that in the sector structure, medium-sized (10–50 employees—34% of the sample) and large (51–250 employees—27%) entities dominate. The research was of all-Poland nature, and the entities included in the research sample come from all of the voivodships. The largest group were entities from Łódzkie (32%), Śląskie (18%) and Mazowieckie (18%) voivodships, as these voivodships have the largest number of medical institutions. Other regions of the country were represented by single units. The selection of the research sample was random—layered. As part of medical facilities database, groups of private and public medical facilities have been identified and the ones to which the questionnaire was targeted were drawn from each of these groups. The analyses were performed using the GNU PSPP 0.10.2 software.

The aim of the study was to determine whether medical facilities in Poland use Big Data Analytics and if so, in which areas. Characteristics of the research sample is presented in Table 2 .

The research is non-exhaustive due to the incomplete and uneven regional distribution of the samples, overrepresented in three voivodeships (Łódzkie, Mazowieckie and Śląskie). The size of the research sample (217 entities) allows the authors of the paper to formulate specific conclusions on the use of Big Data in the process of its management.

For the purpose of this paper, the following research hypotheses were formulated: (1) medical facilities in Poland are working on both structured and unstructured data (2) medical facilities in Poland are moving towards data-based healthcare and its benefits.

The paper poses the following research questions and statements that coincide with the selected questions from the research questionnaire:

From what sources do medical facilities obtain data? What types of data are used by the particular organization, whether structured or unstructured, and to what extent?

From what sources do medical facilities obtain data?

In which area organizations are using data and analytical systems (clinical or business)?

Is data analytics performed based on historical data or are predictive analyses also performed?

Determining whether administrative and medical staff receive complete, accurate and reliable data in a timely manner?

Determining whether real-time analyses are performed to support the particular organization’s activities.

Results and discussion

On the basis of the literature analysis and research study, a set of questions and statements related to the researched area was formulated. The results from the surveys show that medical facilities use a variety of data sources in their operations. These sources are both structured and unstructured data (Table 3 ).

According to the data provided by the respondents, considering the first statement made in the questionnaire, almost half of the medical institutions (47.58%) agreed that they rather collect and use structured data (e.g. databases and data warehouses, reports to external entities) and 10.57% entirely agree with this statement. As much as 23.35% of representatives of medical institutions stated “I agree or disagree”. Other medical facilities do not collect and use structured data (7.93%) and 6.17% strongly disagree with the first statement. Also, the median calculated based on the obtained results (median: 4), proves that medical facilities in Poland collect and use structured data (Table 4 ).

In turn, 28.19% of the medical institutions agreed that they rather collect and use unstructured data and as much as 9.25% entirely agree with this statement. The number of representatives of medical institutions that stated “I agree or disagree” was 27.31%. Other medical facilities do not collect and use structured data (17.18%) and 13.66% strongly disagree with the first statement. In the case of unstructured data the median is 3, which means that the collection and use of this type of data by medical facilities in Poland is lower.

In the further part of the analysis, it was checked whether the size of the medical facility and form of ownership have an impact on whether it analyzes unstructured data (Tables 4 and 5 ). In order to find this out, correlation coefficients were calculated.

Based on the calculations, it can be concluded that there is a small statistically monotonic correlation between the size of the medical facility and its collection and use of structured data (p < 0.001; τ = 0.16). This means that the use of structured data is slightly increasing in larger medical facilities. The size of the medical facility is more important according to use of unstructured data (p < 0.001; τ = 0.23) (Table 4 .).

To determine whether the form of medical facility ownership affects data collection, the Mann–Whitney U test was used. The calculations show that the form of ownership does not affect what data the organization collects and uses (Table 5 ).

Detailed information on the sources of from which medical facilities collect and use data is presented in the Table 6 .

The questionnaire results show that medical facilities are especially using information published in databases, reports to external units and transaction data, but they also use unstructured data from e-mails, medical devices, sensors, phone calls, audio and video data (Table 6 ). Data from social media, RFID and geolocation data are used to a small extent. Similar findings are concluded in the literature studies.

From the analysis of the answers given by the respondents, more than half of the medical facilities have integrated hospital system (HIS) implemented. As much as 43.61% use integrated hospital system and 16.30% use it extensively (Table 7 ). 19.38% of exanimated medical facilities do not use it at all. Moreover, most of the examined medical facilities (34.80% use it, 32.16% use extensively) conduct medical documentation in an electronic form, which gives an opportunity to use data analytics. Only 4.85% of medical facilities don’t use it at all.

Other problems that needed to be investigated were: whether medical facilities in Poland use data analytics? If so, in what form and in what areas? (Table 8 ). The analysis of answers given by the respondents about the potential of data analytics in medical facilities shows that a similar number of medical facilities use data analytics in administration and business (31.72% agreed with the statement no. 5 and 12.33% strongly agreed) as in the clinical area (33.04% agreed with the statement no. 6 and 12.33% strongly agreed). When considering decision-making issues, 35.24% agree with the statement "the organization uses data and analytical systems to support business decisions” and 8.37% of respondents strongly agree. Almost 40.09% agree with the statement that “the organization uses data and analytical systems to support clinical decisions (in the field of diagnostics and therapy)” and 15.42% of respondents strongly agree. Exanimated medical facilities use in their activity analytics based both on historical data (33.48% agree with statement 7 and 12.78% strongly agree) and predictive analytics (33.04% agrees with the statement number 8 and 15.86% strongly agree). Detailed results are presented in Table 8 .

Medical facilities focus on development in the field of data processing, as they confirm that they conduct analytical planning processes systematically and analyze new opportunities for strategic use of analytics in business and clinical activities (38.33% rather agree and 10.57% strongly agree with this statement). The situation is different with real-time data analysis, here, the situation is not so optimistic. Only 28.19% rather agree and 14.10% strongly agree with the statement that real-time analyses are performed to support an organization’s activities.

When considering whether a facility’s performance in the clinical area depends on the form of ownership, it can be concluded that taking the average and the Mann–Whitney U test depends. A higher degree of use of analyses in the clinical area can be observed in public institutions.

Whether a medical facility performs a descriptive or predictive analysis do not depend on the form of ownership (p > 0.05). It can be concluded that when analyzing the mean and median, they are higher in public facilities, than in private ones. What is more, the Mann–Whitney U test shows that these variables are dependent from each other (p < 0.05) (Table 9 ).

When considering whether a facility’s performance in the clinical area depends on its size, it can be concluded that taking the Kendall’s Tau (τ) it depends (p < 0.001; τ = 0.22), and the correlation is weak but statistically important. This means that the use of data and analytical systems to support clinical decisions (in the field of diagnostics and therapy) increases with the increase of size of the medical facility. A similar relationship, but even less powerful, can be found in the use of descriptive and predictive analyses (Table 10 ).

Considering the results of research in the area of analytical maturity of medical facilities, 8.81% of medical facilities stated that they are at the first level of maturity, i.e. an organization has developed analytical skills and does not perform analyses. As much as 13.66% of medical facilities confirmed that they have poor analytical skills, while 38.33% of the medical facility has located itself at level 3, meaning that “there is a lot to do in analytics”. On the other hand, 28.19% believe that analytical capabilities are well developed and 6.61% stated that analytics are at the highest level and the analytical capabilities are very well developed. Detailed data is presented in Table 11 . Average amounts to 3.11 and Median to 3.

The results of the research have enabled the formulation of following conclusions. Medical facilities in Poland are working on both structured and unstructured data. This data comes from databases, transactions, unstructured content of emails and documents, devices and sensors. However, the use of data from social media is smaller. In their activity, they reach for analytics in the administrative and business, as well as in the clinical area. Also, the decisions made are largely data-driven.

In summary, analysis of the literature that the benefits that medical facilities can get using Big Data Analytics in their activities relate primarily to patients, physicians and medical facilities. It can be confirmed that: patients will be better informed, will receive treatments that will work for them, will have prescribed medications that work for them and not be given unnecessary medications [ 78 ]. Physician roles will likely change to more of a consultant than decision maker. They will advise, warn, and help individual patients and have more time to form positive and lasting relationships with their patients in order to help people. Medical facilities will see changes as well, for example in fewer unnecessary hospitalizations, resulting initially in less revenue, but after the market adjusts, also the accomplishment [ 78 ]. The use of Big Data Analytics can literally revolutionize the way healthcare is practiced for better health and disease reduction.

The analysis of the latest data reveals that data analytics increase the accuracy of diagnoses. Physicians can use predictive algorithms to help them make more accurate diagnoses [ 45 ]. Moreover, it could be helpful in preventive medicine and public health because with early intervention, many diseases can be prevented or ameliorated [ 29 ]. Predictive analytics also allows to identify risk factors for a given patient, and with this knowledge patients will be able to change their lives what, in turn, may contribute to the fact that population disease patterns may dramatically change, resulting in savings in medical costs. Moreover, personalized medicine is the best solution for an individual patient seeking treatment. It can help doctors decide the exact treatments for those individuals. Better diagnoses and more targeted treatments will naturally lead to increases in good outcomes and fewer resources used, including doctors’ time.

The quantitative analysis of the research carried out and presented in this article made it possible to determine whether medical facilities in Poland use Big Data Analytics and if so, in which areas. Thanks to the results obtained it was possible to formulate the following conclusions. Medical facilities are working on both structured and unstructured data, which comes from databases, transactions, unstructured content of emails and documents, devices and sensors. According to analytics, they reach for analytics in the administrative and business, as well as in the clinical area. It clearly showed that the decisions made are largely data-driven. The results of the study confirm what has been analyzed in the literature. Medical facilities are moving towards data-based healthcare and its benefits.

In conclusion, Big Data Analytics has the potential for positive impact and global implications in healthcare. Future research on the use of Big Data in medical facilities will concern the definition of strategies adopted by medical facilities to promote and implement such solutions, as well as the benefits they gain from the use of Big Data analysis and how the perspectives in this area are seen.

Practical implications

This work sought to narrow the gap that exists in analyzing the possibility of using Big Data Analytics in healthcare. Showing how medical facilities in Poland are doing in this respect is an element that is part of global research carried out in this area, including [ 29 , 32 , 60 ].

Limitations and future directions

The research described in this article does not fully exhaust the questions related to the use of Big Data Analytics in Polish healthcare facilities. Only some of the dimensions characterizing the use of data by medical facilities in Poland have been examined. In order to get the full picture, it would be necessary to examine the results of using structured and unstructured data analytics in healthcare. Future research may examine the benefits that medical institutions achieve as a result of the analysis of structured and unstructured data in the clinical and management areas and what limitations they encounter in these areas. For this purpose, it is planned to conduct in-depth interviews with chosen medical facilities in Poland. These facilities could give additional data for empirical analyses based more on their suggestions. Further research should also include medical institutions from beyond the borders of Poland, enabling international comparative analyses.

Future research in the healthcare field has virtually endless possibilities. These regard the use of Big Data Analytics to diagnose specific conditions [ 47 , 66 , 69 , 76 ], propose an approach that can be used in other healthcare applications and create mechanisms to identify “patients like me” [ 75 , 80 ]. Big Data Analytics could also be used for studies related to the spread of pandemics, the efficacy of covid treatment [ 18 , 79 ], or psychology and psychiatry studies, e.g. emotion recognition [ 35 ].

Availability of data and materials

The datasets for this study are available on request to the corresponding author.

Abouelmehdi K, Beni-Hessane A, Khaloufi H. Big healthcare data: preserving security and privacy. J Big Data. 2018. https://doi.org/10.1186/s40537-017-0110-7 .

Article   Google Scholar  

Agrawal A, Choudhary A. Health services data: big data analytics for deriving predictive healthcare insights. Health Serv Eval. 2019. https://doi.org/10.1007/978-1-4899-7673-4_2-1 .

Al Mayahi S, Al-Badi A, Tarhini A. Exploring the potential benefits of big data analytics in providing smart healthcare. In: Miraz MH, Excell P, Ware A, Ali M, Soomro S, editors. Emerging technologies in computing—first international conference, iCETiC 2018, proceedings (Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST). Cham: Springer; 2018. p. 247–58. https://doi.org/10.1007/978-3-319-95450-9_21 .

Bainbridge M. Big data challenges for clinical and precision medicine. In: Househ M, Kushniruk A, Borycki E, editors. Big data, big challenges: a healthcare perspective: background, issues, solutions and research directions. Cham: Springer; 2019. p. 17–31.

Google Scholar  

Bartuś K, Batko K, Lorek P. Business intelligence systems: barriers during implementation. In: Jabłoński M, editor. Strategic performance management new concept and contemporary trends. New York: Nova Science Publishers; 2017. p. 299–327. ISBN: 978-1-53612-681-5.

Bartuś K, Batko K, Lorek P. Diagnoza wykorzystania big data w organizacjach-wybrane wyniki badań. Informatyka Ekonomiczna. 2017;3(45):9–20.

Bartuś K, Batko K, Lorek P. Wykorzystanie rozwiązań business intelligence, competitive intelligence i big data w przedsiębiorstwach województwa śląskiego. Przegląd Organizacji. 2018;2:33–9.

Batko K. Możliwości wykorzystania Big Data w ochronie zdrowia. Roczniki Kolegium Analiz Ekonomicznych. 2016;42:267–82.

Bi Z, Cochran D. Big data analytics with applications. J Manag Anal. 2014;1(4):249–65. https://doi.org/10.1080/23270012.2014.992985 .

Boerma T, Requejo J, Victora CG, Amouzou A, Asha G, Agyepong I, Borghi J. Countdown to 2030: tracking progress towards universal coverage for reproductive, maternal, newborn, and child health. Lancet. 2018;391(10129):1538–48.

Bollier D, Firestone CM. The promise and peril of big data. Washington, D.C: Aspen Institute, Communications and Society Program; 2010. p. 1–66.

Bose R. Competitive intelligence process and tools for intelligence analysis. Ind Manag Data Syst. 2008;108(4):510–28.

Carter P. Big data analytics: future architectures, skills and roadmaps for the CIO: in white paper, IDC sponsored by SAS. 2011. p. 1–16.

Castro EM, Van Regenmortel T, Vanhaecht K, Sermeus W, Van Hecke A. Patient empowerment, patient participation and patient-centeredness in hospital care: a concept analysis based on a literature review. Patient Educ Couns. 2016;99(12):1923–39.

Chen H, Chiang RH, Storey VC. Business intelligence and analytics: from big data to big impact. MIS Q. 2012;36(4):1165–88.

Chen CP, Zhang CY. Data-intensive applications, challenges, techniques and technologies: a survey on big data. Inf Sci. 2014;275:314–47.

Chomiak-Orsa I, Mrozek B. Główne perspektywy wykorzystania big data w mediach społecznościowych. Informatyka Ekonomiczna. 2017;3(45):44–54.

Corsi A, de Souza FF, Pagani RN, et al. Big data analytics as a tool for fighting pandemics: a systematic review of literature. J Ambient Intell Hum Comput. 2021;12:9163–80. https://doi.org/10.1007/s12652-020-02617-4 .

Davenport TH, Harris JG. Competing on analytics, the new science of winning. Boston: Harvard Business School Publishing Corporation; 2007.

Davenport TH. Big data at work: dispelling the myths, uncovering the opportunities. Boston: Harvard Business School Publishing; 2014.

De Cnudde S, Martens D. Loyal to your city? A data mining analysis of a public service loyalty program. Decis Support Syst. 2015;73:74–84.

Erickson S, Rothberg H. Data, information, and intelligence. In: Rodriguez E, editor. The analytics process. Boca Raton: Auerbach Publications; 2017. p. 111–26.

Fang H, Zhang Z, Wang CJ, Daneshmand M, Wang C, Wang H. A survey of big data research. IEEE Netw. 2015;29(5):6–9.

Fredriksson C. Organizational knowledge creation with big data. A case study of the concept and practical use of big data in a local government context. 2016. https://www.abo.fi/fakultet/media/22103/fredriksson.pdf .

Gandomi A, Haider M. Beyond the hype: big data concepts, methods, and analytics. Int J Inf Manag. 2015;35(2):137–44.

Groves P, Kayyali B, Knott D, Van Kuiken S. The ‘big data’ revolution in healthcare. Accelerating value and innovation. 2015. http://www.pharmatalents.es/assets/files/Big_Data_Revolution.pdf (Reading: 10.04.2019).

Gupta V, Rathmore N. Deriving business intelligence from unstructured data. Int J Inf Comput Technol. 2013;3(9):971–6.

Gupta V, Singh VK, Ghose U, Mukhija P. A quantitative and text-based characterization of big data research. J Intell Fuzzy Syst. 2019;36:4659–75.

Hampel HOBS, O’Bryant SE, Castrillo JI, Ritchie C, Rojkova K, Broich K, Escott-Price V. PRECISION MEDICINE-the golden gate for detection, treatment and prevention of Alzheimer’s disease. J Prev Alzheimer’s Dis. 2016;3(4):243.

Harerimana GB, Jang J, Kim W, Park HK. Health big data analytics: a technology survey. IEEE Access. 2018;6:65661–78. https://doi.org/10.1109/ACCESS.2018.2878254 .

Hu H, Wen Y, Chua TS, Li X. Toward scalable systems for big data analytics: a technology tutorial. IEEE Access. 2014;2:652–87.

Hussain S, Hussain M, Afzal M, Hussain J, Bang J, Seung H, Lee S. Semantic preservation of standardized healthcare documents in big data. Int J Med Inform. 2019;129:133–45. https://doi.org/10.1016/j.ijmedinf.2019.05.024 .

Islam MS, Hasan MM, Wang X, Germack H. A systematic review on healthcare analytics: application and theoretical perspective of data mining. In: Healthcare. Basel: Multidisciplinary Digital Publishing Institute; 2018. p. 54.

Ismail A, Shehab A, El-Henawy IM. Healthcare analysis in smart big data analytics: reviews, challenges and recommendations. In: Security in smart cities: models, applications, and challenges. Cham: Springer; 2019. p. 27–45.

Jain N, Gupta V, Shubham S, et al. Understanding cartoon emotion using integrated deep neural network on large dataset. Neural Comput Appl. 2021. https://doi.org/10.1007/s00521-021-06003-9 .

Janssen M, van der Voort H, Wahyudi A. Factors influencing big data decision-making quality. J Bus Res. 2017;70:338–45.

Jordan SR. Beneficence and the expert bureaucracy. Public Integr. 2014;16(4):375–94. https://doi.org/10.2753/PIN1099-9922160404 .

Knapp MM. Big data. J Electron Resourc Med Libr. 2013;10(4):215–22.

Koti MS, Alamma BH. Predictive analytics techniques using big data for healthcare databases. In: Smart intelligent computing and applications. New York: Springer; 2019. p. 679–86.

Krumholz HM. Big data and new knowledge in medicine: the thinking, training, and tools needed for a learning health system. Health Aff. 2014;33(7):1163–70.

Kruse CS, Goswamy R, Raval YJ, Marawi S. Challenges and opportunities of big data in healthcare: a systematic review. JMIR Med Inform. 2016;4(4):e38.

Kyoungyoung J, Gang HK. Potentiality of big data in the medical sector: focus on how to reshape the healthcare system. Healthc Inform Res. 2013;19(2):79–85.

Laney D. Application delivery strategies 2011. http://blogs.gartner.com/doug-laney/files/2012/01/ad949-3D-Data-Management-Controlling-Data-Volume-Velocity-and-Variety.pdf .

Lee IK, Wang CC, Lin MC, Kung CT, Lan KC, Lee CT. Effective strategies to prevent coronavirus disease-2019 (COVID-19) outbreak in hospital. J Hosp Infect. 2020;105(1):102.

Lerner I, Veil R, Nguyen DP, Luu VP, Jantzen R. Revolution in health care: how will data science impact doctor-patient relationships? Front Public Health. 2018;6:99.

Lytras MD, Papadopoulou P, editors. Applying big data analytics in bioinformatics and medicine. IGI Global: Hershey; 2017.

Ma K, et al. Big data in multiple sclerosis: development of a web-based longitudinal study viewer in an imaging informatics-based eFolder system for complex data analysis and management. In: Proceedings volume 9418, medical imaging 2015: PACS and imaging informatics: next generation and innovations. 2015. p. 941809. https://doi.org/10.1117/12.2082650 .

Mach-Król M. Analiza i strategia big data w organizacjach. In: Studia i Materiały Polskiego Stowarzyszenia Zarządzania Wiedzą. 2015;74:43–55.

Madsen LB. Data-driven healthcare: how analytics and BI are transforming the industry. Hoboken: Wiley; 2014.

Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Hung BA. Big data: the next frontier for innovation, competition, and productivity. Washington: McKinsey Global Institute; 2011.

Marconi K, Dobra M, Thompson C. The use of big data in healthcare. In: Liebowitz J, editor. Big data and business analytics. Boca Raton: CRC Press; 2012. p. 229–48.

Mehta N, Pandit A. Concurrence of big data analytics and healthcare: a systematic review. Int J Med Inform. 2018;114:57–65.

Michel M, Lupton D. Toward a manifesto for the ‘public understanding of big data.’ Public Underst Sci. 2016;25(1):104–16. https://doi.org/10.1177/0963662515609005 .

Mikalef P, Krogstie J. Big data analytics as an enabler of process innovation capabilities: a configurational approach. In: International conference on business process management. Cham: Springer; 2018. p. 426–41.

Mohammadi M, Al-Fuqaha A, Sorour S, Guizani M. Deep learning for IoT big data and streaming analytics: a survey. IEEE Commun Surv Tutor. 2018;20(4):2923–60.

Nambiar R, Bhardwaj R, Sethi A, Vargheese R. A look at challenges and opportunities of big data analytics in healthcare. In: 2013 IEEE international conference on big data; 2013. p. 17–22.

Ohlhorst F. Big data analytics: turning big data into big money, vol. 65. Hoboken: Wiley; 2012.

Olszak C, Mach-Król M. A conceptual framework for assessing an organization’s readiness to adopt big data. Sustainability. 2018;10(10):3734.

Olszak CM. Toward better understanding and use of business intelligence in organizations. Inf Syst Manag. 2016;33(2):105–23.

Palanisamy V, Thirunavukarasu R. Implications of big data analytics in developing healthcare frameworks—a review. J King Saud Univ Comput Inf Sci. 2017;31(4):415–25.

Provost F, Fawcett T. Data science and its relationship to big data and data-driven decisionmaking. Big Data. 2013;1(1):51–9.

Raghupathi W, Raghupathi V. An overview of health analytics. J Health Med Inform. 2013;4:132. https://doi.org/10.4172/2157-7420.1000132 .

Raghupathi W, Raghupathi V. Big data analytics in healthcare: promise and potential. Health Inf Sci Syst. 2014;2(1):3.

Ratia M, Myllärniemi J. Beyond IC 4.0: the future potential of BI-tool utilization in the private healthcare, conference: proceedings IFKAD, 2018 at: Delft, The Netherlands.

Ristevski B, Chen M. Big data analytics in medicine and healthcare. J Integr Bioinform. 2018. https://doi.org/10.1515/jib-2017-0030 .

Rumsfeld JS, Joynt KE, Maddox TM. Big data analytics to improve cardiovascular care: promise and challenges. Nat Rev Cardiol. 2016;13(6):350–9. https://doi.org/10.1038/nrcardio.2016.42 .

Schmarzo B. Big data: understanding how data powers big business. Indianapolis: Wiley; 2013.

Senthilkumar SA, Rai BK, Meshram AA, Gunasekaran A, Chandrakumarmangalam S. Big data in healthcare management: a review of literature. Am J Theor Appl Bus. 2018;4:57–69.

Shubham S, Jain N, Gupta V, et al. Identify glomeruli in human kidney tissue images using a deep learning approach. Soft Comput. 2021. https://doi.org/10.1007/s00500-021-06143-z .

Thuemmler C. The case for health 4.0. In: Thuemmler C, Bai C, editors. Health 4.0: how virtualization and big data are revolutionizing healthcare. New York: Springer; 2017.

Tsai CW, Lai CF, Chao HC, et al. Big data analytics: a survey. J Big Data. 2015;2:21. https://doi.org/10.1186/s40537-015-0030-3 .

Wamba SF, Gunasekaran A, Akter S, Ji-fan RS, Dubey R, Childe SJ. Big data analytics and firm performance: effects of dynamic capabilities. J Bus Res. 2017;70:356–65.

Wang Y, Byrd TA. Business analytics-enabled decision-making effectiveness through knowledge absorptive capacity in health care. J Knowl Manag. 2017;21(3):517–39.

Wang Y, Kung L, Wang W, Yu C, Cegielski CG. An integrated big data analytics-enabled transformation model: application to healthcare. Inf Manag. 2018;55(1):64–79.

Wicks P, et al. Scaling PatientsLikeMe via a “generalized platform” for members with chronic illness: web-based survey study of benefits arising. J Med Internet Res. 2018;20(5):e175.

Willems SM, et al. The potential use of big data in oncology. Oral Oncol. 2019;98:8–12. https://doi.org/10.1016/j.oraloncology.2019.09.003 .

Williams N, Ferdinand NP, Croft R. Project management maturity in the age of big data. Int J Manag Proj Bus. 2014;7(2):311–7.

Winters-Miner LA. Seven ways predictive analytics can improve healthcare. Medical predictive analytics have the potential to revolutionize healthcare around the world. 2014. https://www.elsevier.com/connect/seven-ways-predictive-analytics-can-improve-healthcare (Reading: 15.04.2019).

Wu J, et al. Application of big data technology for COVID-19 prevention and control in China: lessons and recommendations. J Med Internet Res. 2020;22(10): e21980.

Yan L, Peng J, Tan Y. Network dynamics: how can we find patients like us? Inf Syst Res. 2015;26(3):496–512.

Yang JJ, Li J, Mulder J, Wang Y, Chen S, Wu H, Pan H. Emerging information technologies for enhanced healthcare. Comput Ind. 2015;69:3–11.

Zhang Q, Yang LT, Chen Z, Li P. A survey on deep learning for big data. Inf Fusion. 2018;42:146–57.

Download references

Acknowledgements

We would like to thank those who have touched our science paths.

This research was fully funded as statutory activity—subsidy of Ministry of Science and Higher Education granted for Technical University of Czestochowa on maintaining research potential in 2018. Research Number: BS/PB–622/3020/2014/P. Publication fee for the paper was financed by the University of Economics in Katowice.

Author information

Authors and affiliations.

Department of Business Informatics, University of Economics in Katowice, Katowice, Poland

Kornelia Batko

Department of Biomedical Processes and Systems, Institute of Health and Nutrition Sciences, Częstochowa University of Technology, Częstochowa, Poland

Andrzej Ślęzak

You can also search for this author in PubMed   Google Scholar

Contributions

KB proposed the concept of research and its design. The manuscript was prepared by KB with the consultation of AŚ. AŚ reviewed the manuscript for getting its fine shape. KB prepared the manuscript in the contexts such as definition of intellectual content, literature search, data acquisition, data analysis, and so on. AŚ obtained research funding. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Kornelia Batko .

Ethics declarations

Ethics approval and consent to participate.

Not applicable.

Consent for publication

Competing interests.

The author declares no conflict of interest.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Batko, K., Ślęzak, A. The use of Big Data Analytics in healthcare. J Big Data 9 , 3 (2022). https://doi.org/10.1186/s40537-021-00553-4

Download citation

Received : 28 August 2021

Accepted : 19 December 2021

Published : 06 January 2022

DOI : https://doi.org/10.1186/s40537-021-00553-4

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Big Data Analytics
  • Data-driven healthcare

big data thesis paper

THE IMPLICATIONS OF BIG DATA ANALYTICS ON BUSINESS INTELLIGENCE: A QUALITATIVE STUDY

Sudhir Allam. (2021). THE IMPLICATIONS OF BIG DATA ANALYTICS ON BUSINESS INTELLIGENCE: A QUALITATIVE STUDY. International Journal of Innovations in Engineering Research and Technology, 8(2), 76–81

6 Pages Posted: 28 May 2021

Sudhir Allam

Sr. Data Scientist

Date Written: February 20, 2021

This paper is a qualitative study on how big data analytics has helped shaped business intelligence and drive the success of many companies. The findings reveal that big data analytics has facilitated the extraction of information from the vast amounts of data generated by businesses every second. When combined with the massive amounts of external data accumulated daily – whether it's consumer reviews and experience, competitor knowledge, seasonal purchasing patterns, or something else – companies will amass a massive amount of data [1]. Big data applications collect data from a range of sources in real-time, independent of their formats or structures, while BI systems extract extensive data from pre-defined sources to expand them into insights. Networking sites have ushered in a transition and compelled a fundamental change in business operations around the world. As a consequence, large amounts of data have been collected from many social networking outlets, necessitating the utilization of this data for market intelligence purposes. Despite its relevance, no study has been done on the implications of using Big Data analytics in business intelligence [1]. This research addresses this information gap by exploring the role and implications of Big Data analytics on business intelligence with data obtained from social networking platforms such as Facebook. Since the thesis is qualitative, it uses a qualitative method for data analysis by examining the case of Facebook. The findings would have a huge effect on both theory and practice when it comes to developing plans and methods to maximize the benefits of social networking platforms for market gain.

Keywords: Big data, Business intelligence, Data analytics, Facebook, Social media analytics

Suggested Citation: Suggested Citation

Sudhir Allam (Contact Author)

Sr. data scientist ( email ), do you have a job opening that you would like to promote on ssrn, paper statistics, related ejournals, data science, data analytics & informatics ejournal.

Subscribe to this fee journal for more curated articles on this topic

Big Data, Artificial Intelligence, and Financial Economics

The proliferation of large unstructured datasets along with advances in artificial intelligence (AI) technology provides researchers in financial economics with new opportunities for data analysis, and it also changes the set of subjects these researchers are studying. As AI becomes increasingly important in making decisions using financial market data, it becomes crucial to study how AI interacts with both data resources and with human decisionmakers. 

To promote research on emerging issues related to the methodology, applications, and socioeconomic implications of the growing availability of large datasets and AI tools, the National Bureau of Economic Research (NBER), with the generous support of the Office of Financial Research (OFR) and in collaboration with the Review of Financial Studies (RFS) , will convene a research conference on December 13, 2024. The program will be organized by RFS  Executive Editor Tarun Ramadorai of Imperial College London, and NBER Research Associates Itay Goldstein of the University of Pennsylvania, Chester Spatt of Carnegie Mellon University, and Mao Ye of Cornell University.

The organizers will consider submissions on topics including, but not limited to: 

 •  Unstructured Data Analysis and AI: The impact on financial markets of the growing use of AI technology to analyze unstructured data, such as text, images, audio, and video.

 •  Trading and AI: The impact of using AI in high-frequency trading, algorithmic trading, and the impacts of this use on financial markets.

 •  Big Data and AI in Investment: The rise of machines in asset management, particularly the growing analysis of high-dimensional datasets using machine learning techniques.

 •  Big Data and AI in Corporate Decisions: The impact of AI as well as other means of analyzing unstructured datasets and automating decision-making on corporate decisions, such as capital budgeting, working capital management, and regulatory compliance and reporting.

 •  Financial Institutions and Financial Intermediation: The impact of AI, fintech, and the analysis of large datasets on traditional financial institutions.

 •  AI and Regulation: The role of AI in detecting improper market conduct, the regulation of algorithms and winner-take-all markets, and strategies for ensuring accountability, fairness and transparency in AI models.

The organizers welcome submissions of both empirical and theoretical research papers and encourage submissions from scholars who are early in their careers, who are not NBER affiliates, and who are from under-represented groups in the financial economics profession.  Papers that are submitted for presentation at the conference may also be submitted to the RFS  under its dual review system at no extra cost. Papers that are rejected at any stage of this process are not considered to have been “rejected” at the RFS .  Authors may submit a future version of the same paper to the RFS , even if the paper is not selected for presentation at the conference. For a paper to be considered under the dual submission option, it may not be under review or invited revision at any journal, including the RFS, until the author has been notified of the outcome of the dual submission process. The details of the dual submission program may be found at http://sfs.org/dualsubmissionpolicy/. To be considered for inclusion on the program, papers must be uploaded by 11:59 pm EDT on Thursday, September 12, 2024 to one of the following sites:

For submissions to both the conference and the Review of Financial Studies

For submissions to the conference alone  

Please do not submit papers that have been accepted for publication or that will be published before the conference. Authors chosen to present papers will be notified in October, 2024. All presenters are expected to attend the meeting in person. The NBER will cover the travel and lodging cost of up to two presenters per paper. 

Questions about this conference may be addressed to  [email protected] .

Research on Data Science, Data Analytics and Big Data

  • SSRN Electronic Journal 9(5):99-105
  • 9(5):99-105
  • This person is not on ResearchGate, or hasn't claimed this research yet.

Discover the world's research

  • 25+ million members
  • 160+ million publication pages
  • 2.3+ billion citations

Umesh Rijal

  • Yahye Abukar Ahmed

Shirsendu Kar

  • Alavudeen Basha
  • Azath Mubarak Ali

Parthasarathy Panchatcharam

  • Muhammed Shuja

Anurag Sinha

  • Sharmistha Roy
  • Muneeb Hashmi
  • Emmanuel Adewuyi Ojo
  • Adeola O. Opesade
  • Vasudev Trivedi

Michel Fathi

  • INT J HUM-COMPUT INT

Cheng Xu

  • SOFT COMPUT

Alavudeen Basha Ahath

  • Mohammed Shujauddin
  • S Dheva Rajan
  • Sivajee Vavilapalli
  • Shahriar Hasan

Iskandar Muda

  • Recruit researchers
  • Join for free
  • Login Email Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google Welcome back! Please log in. Email · Hint Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google No account? Sign up

IMAGES

  1. Big Data Thesis Topics

    big data thesis paper

  2. Solved WQD7007 Big Data Management Essay Write a structured

    big data thesis paper

  3. Big data thesis introduction

    big data thesis paper

  4. Big Data Short Paper 1 .docx

    big data thesis paper

  5. Big data notes 1652666718

    big data thesis paper

  6. Tips for writing best big data thesis

    big data thesis paper

COMMENTS

  1. Big data analytics in healthcare: a systematic literature review

    2.1. Characteristics of big data. The concept of BDA overarches several data-intensive approaches to the analysis and synthesis of large-scale data (Galetsi, Katsaliaki, and Kumar Citation 2020; Mergel, Rethemeyer, and Isett Citation 2016).Such large-scale data derived from information exchange among different systems is often termed 'big data' (Bahri et al. Citation 2018; Khanra, Dhir ...

  2. Big data analytics capabilities: Patchwork or progress? A systematic

    This paper presents a systematic literature review of the research field on big data analytics capabilities (BDACs). With the emergence of big data and digital transformation, a growing number of researchers have highlighted the need for organizations to develop BDACs. ... "Big data analytics capability is defined as the ability to acquire ...

  3. Explore Big Data Analytics Applications and Opportunities: A Review

    This paper reviews the literature on big data applications and analytics in four industries: healthcare, education, transportation, and banking. It compares the pre and peri-COVID-19 scenarios and discusses the effectiveness of data analytics models in the pandemic era.

  4. 214 Big Data Research Topics: Interesting Ideas To Try

    These 15 topics will help you to dive into interesting research. You may even build on research done by other scholars. Evaluate the data mining process. The influence of the various dimension reduction methods and techniques. The best data classification methods. The simple linear regression modeling methods.

  5. Big Data Analytics: A Literature Review Paper

    MSc Thesis, German University in Cairo, p. 164 ... and value by using big data analytics. This paper aims to research how big data analytics can be integrated into the decision making process ...

  6. PDF MASTER THESIS

    MASTER THESIS Big data and Sentiment Analysis considering reviews from e-commerce platforms to predict consumer behavior MSc IN BUSINESS RESEARCH University of Barcelona ... The first documented time the term BD appeared was in 1980 in a working paper from Charles Tilly that finally became a published article in 1984 titled 'The Old New Social

  7. PDF The Evolution of Big Data and Its Business Applications

    The Evolution of Big Data and Its Business Applications. Doctor of Philosophy (Information Science), May 2018, 113 pp., 12 tables, 10 figures, references, 203 titles. The arrival of the Big Data era has become a major topic of discussion in many sectors because of the premises of big data utilizations and its impact on decisionmaking. It is an -

  8. PDF Big Data Analytics: A Literature Review Perspective

    thesis work, various big data analytical techniques and tools are discussed to allow analysis of the application of big data analytics in several different domains. ... as well as whether the paper is from a peer-reviewed journal, conference, or other respectable source. These criteria are thus not randomly dependent on time

  9. Big Data Analytics: A Literature Review Paper

    Abstract. In the information era, enormous amounts of data have become available on hand to decision makers. Big data refers to datasets that are not only big, but also high in variety and velocity, which makes them difficult to handle using traditional tools and techniques. Due to the rapid growth of such data, solutions need to be studied and ...

  10. Big Data Research

    Read the latest articles of Big Data Research at ScienceDirect.com, Elsevier's leading platform of peer-reviewed scholarly literature

  11. Systematic analysis of healthcare big data analytics for ...

    Big data has revolutionized the world by providing tremendous opportunities for a variety of applications. ... Lee et al. 11 presented a survey paper to explore the applications and challenges of ...

  12. PDF Investigating the Impact of Big Data Analytics on Supply Chain

    Thesis Title: Investigating the Impact of Big Data Analytics on Supply Chain Operations: Case Studies from the UK Private Sector A thesis submitted for the degree of Doctor of Philosophy By Ruaa Hasan Brunel Business School Brunel University London 2021 . 2 | P a g e

  13. Big data and performance measurement research: trends, evolution and

    This paper aims to review the big data and performance measurement studies to identify the publications' trends and future research opportunities.,The authors reviewed 873 documents on big data and performance carrying out an extensive bibliometric analysis using two main techniques, i.e. performance analysis and science mapping.,Results ...

  14. Big data analytics and firm performance: Findings from a mixed-method

    This paper draws on complexity theory and investigates the configurations of resources and contextual factors that lead to performance gains from big data analytics investments. Our empirical investigation followed a mixed methods approach using survey data from 175 chief information officers and IT managers working in Greek firms, and three ...

  15. The use of Big Data Analytics in healthcare

    The introduction of Big Data Analytics (BDA) in healthcare will allow to use new technologies both in treatment of patients and health management. The paper aims at analyzing the possibilities of using Big Data Analytics in healthcare. The research is based on a critical analysis of the literature, as well as the presentation of selected results of direct research on the use of Big Data ...

  16. Business analytics and big data research in information systems

    The "Business Analytics and Big Data" track as a melting pot for topics in information systems (IS) and neighbouring disciplines has a long and successful history at the European Conference on Information Systems (ECIS). From its initial year in 2012 to 2021, the track has received 512 submissions.

  17. The Implications of Big Data Analytics on Business Intelligence ...

    This paper is a qualitative study on how big data analytics has helped shaped business intelligence and drive the success of many companies. The findings reveal that big data analytics has facilitated the extraction of information from the vast amounts of data generated by businesses every second. ... Since the thesis is qualitative, it uses a ...

  18. (PDF) Big Data Semantics

    Big Data technology has discarded traditional data modeling approaches as no longer applicable to distributed data processing. ... In this paper, we analyze the data gathered from the longitudinal ...

  19. (PDF) The Role of AI, Machine Learning, and Big Data in Digital

    Therefore, this article emphasizes the role of big data and AI-ML in the creation of digital twins (DTs) or DT-based systems for various industrial applications, by highlighting the current state ...

  20. Big Data: Challenges, Opportunities and Realities

    Big Data: Challenges, Opportunities, and Realities. n Institute of Technology Delhi, IndiaABSTRACT With the advent of Internet of Things (IoT) and Web 2.0 technologies, there has been a. remendous growth in the amount of data generated. This chapter emphasizes on the need for big data, technological advancements, tools and techni.

  21. Top 20 Latest Research Problems in Big Data and Data Science

    Fig 1: 8V's of Big data Courtesy: Elena. Having understood the 8V's of big data, let us look into details of research problems to be addressed. General big data research topics [3] are in the lines of: Scalability — Scalable Architectures for parallel data processing; Real-time big data analytics — Stream data processing of text, image ...

  22. Big Data, Artificial Intelligence, and Financial Economics

    • Big Data and AI in Investment: The rise of machines in asset management, particularly the growing analysis of high-dimensional datasets using machine learning techniques. ... Authors may submit a future version of the same paper to the RFS, even if the paper is not selected for presentation at the conference.

  23. (PDF) Security and Privacy Issues of Big Data

    10. Supervising the Security Level in Real-Time. These security and privacy challenges cov er the entire spectrum of the Big D ata lifecycle (Figure 2): sources of data production (devices), the ...

  24. Research on Data Science, Data Analytics and Big Data

    BIG DATA: Big Data refers to a huge volume of data of various t ypes, i.e., st ructured, semi st ructured, and unstructured. This data is ge nerated through various digital channels such as mobile ...