KEMBAR78
Data Science and Applications | PDF | Bioinformatics | Data Science
0% found this document useful (0 votes)
95 views10 pages

Data Science and Applications

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
95 views10 pages

Data Science and Applications

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/373370003

Data Science and Applications

Article in Journal of Data Science and Intelligent Systems · August 2023


DOI: 10.47852/bonviewJDSIS3202837

CITATIONS READS

15 5,498

2 authors:

Narender Chinthamu Manideep Karukuri


MIT 4 PUBLICATIONS 38 CITATIONS
51 PUBLICATIONS 121 CITATIONS
SEE PROFILE
SEE PROFILE

All content following this page was uploaded by Narender Chinthamu on 25 August 2023.

The user has requested enhancement of the downloaded file.


Received: 9 March 2023 | Revised: 5 May 2023 | Accepted: 4 July 2023 | Published online: 11 July 2023

Journal of Data Science and Intelligent Systems


RESEARCH ARTICLE 2023, Vol. 00(00) 1–9
DOI: 10.47852/bonviewJDSIS3202837

Data Science and Applications

Narender Chinthamu1,* and Manideep Karukuri2


1
WESCO International, USA
2
Macquaire Group, USA

Abstract: This paper investigates the significance of data science as an indispensable instrument for decision-making across multiple
domains. The study examines the history, concepts, methods, and applications of data science, as well as its impact on numerous
industries, such as artificial intelligence, manufacturing, fintech, government, Astro-informatics, e-commerce, education, and
biotechnology. Enterprise resource planning (ERP) software was first developed by SAP in the 1960s, with modern ERP systems
emerging in the 1990s, according to the research. This paper highlights the significance of data science in enhancing the functionality of
ERP systems, with artificial intelligence-based solutions such as those offered by MahaaAi and other firms automating human tasks,
chat-based ERP applications, and virtual assistant support to avoid human efforts. The conclusion of the study emphasizes the significant
benefits of data science in the ERP industry, including self-service analytics, predictions, and prescriptive analysis.
Keywords: data science, big data, machine learning, artificial intelligence analytics, applications, healthcare

1. Introduction policymaking. Data science has aided scientists extract knowledge


from astronomical data in Astro-informatics. Data science has
Data science has become an indispensable instrument for improved the consumer experience and facilitated targeted
decision-making in a variety of industries and domains. Data marketing in e-commerce. In education and bioinformatics, data
science is an ever-evolving discipline, with new techniques and science has contributed to the personalization of education and
applications arising frequently. With the exponential development advancements in the comprehension of biological systems.
of available data, data science has become an indispensable This paper concludes by emphasizing the significance of data
discipline that enables organizations to extract valuable insights science as an indispensable instrument for decision-making across
from data and make more informed decisions. multiple domains. As data volume and complexity continue to
This paper examines in depth the history, concepts, methods, and increase, the demand for data scientists will continue to rise
applications of data science. We investigate the application of data (Davenport & Patil, 2012). Data science has the potential to
science in various industries, including artificial intelligence (AI), transform how we live, work, and make decisions, as well as
manufacturing, fintech, government, Astro-informatics, e-commerce, resolve some of society’s most critical problems.
education, and biotechnology. We investigate how data science has
transformed decision-making processes in these domains.
This paper begins with a concise historical overview of data 1.1. History of data science
science, emphasizing the field’s development and the emergence
of new techniques and tools. The course then explores the Data science has a long history, beginning with John Graunt’s
concepts and methods of data science, including data acquisition, statistical analysis of mortality rates in the 17th century, followed
cleansing, transformation, analysis, and visualization. We also by Pierre-Simon Laplace’s use of probability theory to make
investigate the various data types and their relevance to data science. predictions in the 18th century. The 19th century witnessed the
The applications of data science in various industries are then emergence of statistics as a discipline, with Francis Galton and
examined. In AI, data science has facilitated the creation of Karl Pearson making significant contributions. Ronald Fisher
machine learning (ML) algorithms that can learn from data and introduced experimental design, a method for conducting controlled
progress over time. By identifying patterns and anomalies in data, experiments and analyzing data, in the early 20th century. Due to
data science has increased efficiency and decreased costs in the the pervasive use of computers during the middle of the 20th
manufacturing industry. According to Kumar (2021), data science century, automated and sophisticated data analysis rose to prominence.
has revolutionized the delivery of financial services in fintech The history of data dates to 19,000 B.C., when basic calculations
by facilitating personalized services and fraud detection. The were performed using primitive instruments. John Graunt revolutionized
government has utilized data science to enhance public services and the comprehension of health patterns through mortality statistics in the
1640s. In the 1880s, Herman Hollerith’s punch card system expedited
*
Corresponding author: Narender Chinthamu, WESCO International, USA. data processing, whereas Fritz Pfleumer’s magnetic tape in 1928
Email: Narender.chinthamu@gmail.com set the groundwork for data storage. Edgar Codd introduced the

© The Author(s) 2023. Published by BON VIEW PUBLISHING PTE. LTD. This is an open access article under the CC BY License (https://creativecommons.org/
licenses/by/4.0/).

01
Journal of Data Science and Intelligent Systems Vol. 00 Iss. 00 2023

relational database management system in the 1960s, laying the finance, and business. These applications involve using data
foundation for modern data tables. to develop predictive models, optimize processes, and inform
Through hypertext, hyperlinks, and search engines, the internet decision-making. Impact of Data Science: The impact of data
era facilitated the proliferation of big data. In the 21st century, data science can be seen in the improvements it brings to various fields.
science has emerged as a distinct discipline, evolving continuously For instance, in healthcare, data science can help improve patient
with new tools, techniques, and applications and has become an outcomes by enabling early detection of diseases. In education, data
integral part of industries such as finance and astronomy. science can help personalize learning and improve student
performance. In finance, data science can help detect fraud and
2. Literature Review improve risk assessment. In business, data science can help improve
customer segmentation and market analysis.
Data science has become an integral part of modern society, and In conclusion, data science is a multidisciplinary field that has
its applications are becoming increasingly important. With the applications in various fields. The field is built on the principles of
advancement of technology, data collection has become easier, and data collection, analysis, and interpretation, which are crucial for
the need for data-driven decision-making has increased. In recent informed decision-making. The applications of data science have
years, numerous studies have been undertaken to explore the the potential to revolutionize various industries and improve
applications of data science in various fields. This literature review decision-making processes.
will provide an overview of the studies conducted on data science
and its applications. 3. Data Science and Its Applications
Data science is a multidisciplinary field that combines statistics,
computer science, and domain-specific knowledge to extract insights Data science has a wide range of applications across various
and knowledge from data. The field is concerned with the extraction industries and domains. Some of the key applications of data
of useful information from data, and its applications range from science include:
business to healthcare, education to social sciences, and more. The Business: Data science is widely used in the business world to
field of data science has evolved significantly over the past few optimize operations, improve customer experience, and make better
decades, and with the growth of data collection, the need for data- decisions. Applications of data science in business include:
driven decision-making has become more important.
(a) Customer Segmentation: Data science is used to segment
Data science has found applications in various fields, including
customers based on their behavior, demographics, and
but not limited to healthcare, education, finance, and business. In
preferences. This information can be used to develop targeted
healthcare, data science is used to analyze patient data and
marketing campaigns, improve customer service, and increase
develop predictive models to aid in the diagnosis and treatment of
customer satisfaction.
diseases. In education, data science is used to analyze student
(b) Fraud Detection: Data science is used to detect fraudulent
performance and develop personalized learning plans. In finance,
transactions, identify suspicious patterns, and prevent financial
data science is used for fraud detection, credit scoring, and risk
losses. Fraud detection techniques include anomaly detection,
assessment. In business, data science is used for market analysis,
clustering, and classification.
customer segmentation, and supply chain optimization. According
(c) Predictive Modeling: Data science is used to develop predictive
to Data Science & AI Community (2022), data science has been
models that can forecast future trends and outcomes. Predictive
extensively used for fraud detection, tax evasion, defense,
modeling techniques include regression analysis, time series
cyberattacks, and terrorist activity.
analysis, and decision trees.
2.1. Theoretical framework
3.1. Healthcare
Data science is a multidisciplinary field that draws from statistics,
computer science, and domain-specific knowledge. The field is built on Science has revolutionized healthcare by allowing for more
the principles of data collection, analysis, and interpretation, which are precise diagnoses, disease prediction, and the development of
crucial for informed decision-making. The following theoretical individualized treatments. It also facilitates the streamlining of
framework outlines the key concepts and principles that underpin the operations and the improvement of patient outcomes, thereby
applications of data science. Data Collection: Data collection is the reducing costs and increasing efficiency. With data science,
process of gathering relevant information for analysis. The quality of healthcare providers can analyze vast quantities of medical data,
the data collected is crucial for accurate analysis and interpretation. such as patient records, medication interactions, and genetic
The collection of data can be done using various methods, including information, resulting in more informed decisions. This can also
surveys, interviews, observations, and experiments. Data Analysis: result in more efficient clinical trials and a shortened approval
Data analysis involves the process of examining, cleaning, and process for new medications. Data science is ultimately transforming
transforming data to extract insights and knowledge. Data analysis healthcare by making it more patient-centric and data-driven,
can be done using various techniques, including descriptive statistics, thereby enhancing the quality of treatment and patient outcomes.
inferential statistics, data visualization, and Kassambara (2017) has
Applications of data science in healthcare include:
made some interesting insights about network analysis and
visualization through data science. (a) Personalized Medicine: Data science is used to develop
Data Interpretation: Data interpretation involves making sense of personalized treatment plans based on a patient’s genetics,
the data analyzed. This process involves drawing conclusions and lifestyle, and medical history. This approach can improve the
making predictions based on the insights and knowledge extracted effectiveness of treatment and reduce the risk of adverse reactions.
from the data. Data interpretation can help inform decision-making (b) Disease Modeling: Data science is used to model the spread and
in various fields. Applications of Data Science: Data science has impact of diseases, such as COVID-19. These models can help
applications in various fields, including healthcare, education, healthcare providers and policymakers make informed decisions

02
Journal of Data Science and Intelligent Systems Vol. 00 Iss. 00 2023

about resource allocation, prevention measures, and treatment and student support using data science. It also helps forecast the
strategies. student’s success and performance. By incorporating data science into
(c) Electronic Health Records (EHRs): Data science is used to education, institutions can improve student outcomes, decrease
analyze EHRs to identify patterns and trends in patient data. attrition rates, and boost overall efficiency. Data science is an
This information can be used to develop new treatments, indispensable instrument for educators who wish to provide a high-
improve patient outcomes, and reduce healthcare costs. quality education that meets the requirements of all students.
(d) Data science has developed a robust image identification tool
that provides physicians with a comprehensive comprehension Applications of data science in education include:
of intricate medical imagery. Machines make it possible to (a) Learning Analytics: Data science is used to analyze student data,
identify the flaws in the image. Provost and Fawcett (2013a, such as grades, attendance, and engagement, to identify areas of
2013b) have demonstrated the impact of data science in business. improvement and develop personalized learning plans.
(b) Educational Research: Data science is used to analyze large-scale
“Based on a journal published by Johnson & Johnson, Data science is
educational datasets to identify trends, patterns, and correlations.
revolutionizing modern medicine” by accelerating and improving
This information can be used to inform policy decisions, develop
the understanding, diagnosis, and treatment of diseases.
new teaching strategies, and improve student outcomes.
Algorithms, ML, and AI enable this transformation by enabling
physicians to rapidly analyze huge amounts of data, which allows Many universities, including John Park University, are incorporating
for prompt and effective medical interventions. data science into their curricula in recognition of its significance in
Historically, comprehending a disease required the manual sorting modern healthcare. Sonography, computed tomography, magnetic
and analysis of data, which was a sluggish and often tedious process. resonance imaging, and nuclear medicine are available
This has drastically changed with the introduction of sophisticated specializations in the Bachelor of Science in Medical Imaging
data science instruments. Two years after the emergence of the program. The incorporation of data science into these courses
SARS-CoV-2 virus, for instance, scientists have gained extensive provides students with the skills necessary to manage complex
knowledge about its infectious nature, treatment, and methods to medical imaging data and prepares them for leadership positions
mitigate severe disease, primarily because of global data sharing. in a variety of healthcare settings. The initiative emphasizes the
Michael Morrissey, Global Head of Early Detection & Data significance of data science in enhancing the efficacy and
Science at Johnson & Johnson’s Lung Cancer Initiative, compares efficiency of modern medical image processing.
the process of extracting treatment indications from large datasets
to searching for a needle in a haystack. However, the 3.3. Government
implementation of rigorous statistical methods by data scientists
can more precisely pinpoint the “needle.” Data science has numerous applications in everyday life,
Johnson & Johnson employs these innovative techniques in especially in government operations, where it aids in the detection
over 120 projects, or roughly 90% of their pipeline, to enhance of fraud, tax evasion, terrorist activities, and cybercrime.
treatments and potentially prevent the advent of fatal diseases. Governments employ data analytics and intelligent data
According to Najat Khan, Chief Data Science Officer and Global technologies to combat fraud and financial irregularities, thereby
Head of Strategy and Operations for Research & Development at minimizing losses. Modern analytic techniques are also used to
Janssen Pharmaceutical Companies of Johnson & Johnson, data detect tax evasion by examining financial and social media data and
science is utilized from the disease discovery phase until a comparing individuals’ expenditure patterns to their reported
medicine is made available to patients. They combine AI, ML, incomes. Big data, ML, and AI technologies are crucial for
real-world evidence, and digital health with a vast quantity of decision-making and real-time threat detection in defense and anti-
anonymized patient data to obtain transformative insights and terrorism efforts. According to Chinthamu et al. (2023),
generate tangible results for their pipeline and patients. cybersecurity is another area where data science plays a significant
In addition to lung cancer, pulmonary arterial hypertension, and role by monitoring network activity for suspicious behavior.
the diversification of clinical trials, the company is utilizing data According to Open Access Government (2019), speeding up results
science in several other areas. Early lung cancer detection is one of in the public sector with data science, and the increasing demand
their primary initiatives. Early detection of lung cancer can for data scientists, is proof of the rising demand for these skills in
substantially improve a patient’s prognosis, but detection can be the public sector.
difficult due to ambiguous symptoms and a lack of screening Data science can provide the government with numerous
resources. Through the Lung Cancer Initiative, the data scientists at benefits. Utilizing advanced analytics and ML techniques, data
Johnson & Johnson are utilizing data and technology to assist science can assist in enhancing decision-making, optimizing
physicians in identifying and treating lung cancer before it progresses. resource allocation, identifying potential risks and opportunities,
In conclusion, the advent of data science ushers in a new era of and enhancing the delivery of public services. In addition, data
modern medicine, allowing for quicker disease comprehension, early science can aid in the detection of deception and corruption, the
detection, and effective treatments (Mascia, 2022). As Johnson & monitoring and evaluation of policy efficacy, and the
Johnson exemplifies, data science is not only transforming the medical improvement of public safety and security. By utilizing data
landscape but also offering optimism for improved patient outcomes. science, governments can become more efficient, effective, and
responsive to the requirements of their constituents, resulting in
3.2. Education improved governance and social outcomes.
Applications of data science in government include:
In numerous ways, data science has revolutionized education. It has
made it simpler to individualize instruction, monitor student progress, (a) Public Safety: Data science is used to analyze crime data to
and identify areas where students may require additional assistance. identify patterns and trends and to develop predictive models
Educators can make informed decisions about curriculum, instruction, that can help prevent crime and improve public safety.

03
Journal of Data Science and Intelligent Systems Vol. 00 Iss. 00 2023

(b) Disaster Response: Data science is used to model the impact of 3.5. E-Commerce
natural disasters and to develop response plans that can minimize
the impact on affected populations. The e-commerce companies can benefit from data science by
(c) Social Services: Data science is used to analyze social service using it to:
data, such as welfare and housing assistance, to identify areas
of need and to develop targeted programs and services. First, dividing up clients.
Second, tailored suggestions.
Emma Tomkins, Public Sector Specialist at SAS UK, discusses the Third, identifying instances of fraud.
potential of data science to improve citizen outcomes in the public Fourth, enhanced ability to make sound choices.
sector, despite budgetary restrictions and Brexit uncertainty. By Fifth, benefit from a superior edge.
integrating data analytics and data science, meaningful insights for
improved decision-making can be derived. For example, the police Rebate Key’s CEO and creator Ian Sells says that the site’s dual
force uses advanced analytics to focus on actual threats, and Rogers purpose of assisting shoppers in finding the best deals and assisting
Communications uses ML to predict customer behavior, resulting in merchants in increasing their sales and visibility in online
a 53% reduction in customer complaints. The lack of widespread marketplaces is reflected in the company’s name.
adoption of these solutions in the United Kingdom government is, Using data science, e-commerce companies can gain a deeper
however, due to difficulties in team collaboration, a focus on understanding of their customers’ online activities, channel
technology over outcomes, language preferences in coding, and preferences, and even the circumstances surrounding a purchase.
departmental data silos. To overcome these obstacles, a cultural The results of search engine suggestions are used to promote
transition toward decision-making based on evidence and additional our goods and services. Recommendations help us set the pace
investment in data analytics and data science solutions are required. for the market and boost revenue. We utilize data science to
achieve this.
Using algorithms and technology, data science provides insights
3.4. Fintech from various data categories, thereby enhancing the e-commerce
In the finance sector, data science can be used in several ways, consumer experience. It automates processes, creates consumer
including but not limited to the following: profiles based on their activities and social media profiles, and
provides extremely precise product suggestions. It helps prevent
First, methods for identifying and stopping fraud. fraud by identifying deviations from normal consumer behavior.
Second, risk assessment and price forecasting. Data science considerably enhances inventory management by
Third, focusing on specific groups of customers. preventing investments in low-selling products and predicting future
demand. It empowers recommendation systems using browsing
Fourth, evaluation of the financial market one of cane bay partners’ history and AI/ML, resulting in increased sales by suggesting
original partners, Kirk Chewning, has this to say, “According to Nixdorf products that correspond to customer interests. Therefore, data
(2021), decision-makers in the fintech industry are increasingly turning science is essential for success in the e-commerce industry.
to business intelligence tools developed on a foundation of Picture Source: datasciencecentral.com top ways in which data
multidimensional data analytics.” Our customers are demanding that science improves e-commerce sales.
we anticipate their needs and respond to them in a timely fashion by To be more precise, we use deep learning algorithms to assess
projecting historical data onto their projected daily business. user preferences and click through rates to make recommendations.
Kirk has spent the last 29 years working as an entrepreneur and The user’s search and purchase histories are also factored into the
consultant in the financial services sector, during which time he has suggestions provided.
developed data-driven underwriting frameworks for several The use of data science in Business in recent years, has emerged
businesses. The above citation is an extended excerpt from his as a crucial resource for companies. Data science allows companies
discussion of the current state of data science in the financial to monitor consumer actions, learn about product demand, and
technology industry and its potential future directions. anticipate market shifts. Using data science, stores can identify the
Some Uses of Data Science in Online Business. The field of most frequently returned items and restock those items.
data science has numerous uses in online business. It can be used Data science can be applied in numerous contexts to boost a
to enhance client loyalty, enhance marketing efforts, and lessen business’s efficiency. Predictive algorithms are just one
instances of fraud. As a bonus, data science can also be used to application of data science. Predictive modeling is the process of
boost productivity and revenue. using past data to make predictions about the future. Foreseeing
In the realm of electronic trade, data science can be put to many sales on Black Friday is just one use for data science in business.
uses. It can be used to enhance the quality of service provided to Because of this, the business will be able to better prepare for the
customers, boost revenue, and streamline business processes. occasion and ensure adequate supplies. Advertising strategies can
Cloud technology advancements increase our capacity to store also be improved with the help of data science. Businesses can
and process enormous quantities of data. This development has improve their decision-making through client data analysis.
spurred the digital transformation of all industries, including As LuckLuckGo’s Ryan Young puts it, when it comes to future
banking and finance, which now generate diverse data types and planning, data science is having a profound impact on how
offer multiple opportunities for decision-making using statistical, companies operate today. Companies like ours, LuckLuckGo, are
ML, and AI methods. In the BFSI industry, data science improves now using data science tools in place of the previously prevalent
operational efficiency, enhances customer experience, identifies practice of relying on the subjective opinions of a small group of
growth opportunities, and facilitates risk analysis and fraud employees. We have been able to analyze our company’s
detection. As data science and ML flourish on variegated data and structure, output, and productivity using the power of data science
specialized skills, it is anticipated that the fintech industry, which and its tools, and thus pinpoint areas where additional investment
possesses both, will implement AI and ML extensively. could enhance our overall performance.

04
Journal of Data Science and Intelligent Systems Vol. 00 Iss. 00 2023

We were terrible at getting a good return on investment in the The hunt for exoplanets orbiting other stars, the investigation of
business before we started using data science in our processes. Now stellar evolution, and the detection of supernovae are just a few
and in the future, data science will direct the company’s investments examples of how Astro-informatics is used in the study of stars. To
in enhancing its processes, ensuring that those investments generate a learn more about stars and their characteristics, data scientists use tools
high return on investment. Our vision is that the future of like the Hubble Space Telescope and the Chandra X-ray Observatory.
LuckLuckGo’s will be an efficient one powered by AI and data Astro computing is employed in a wide variety of fields,
science to ensure that we can remain ahead of the competition and including but not limited to those listed above, the study of black
continue to provide our customers with an exemplary experience holes, and the investigation of dark matter and dark energy. Data
worth their time. science plays a crucial part in all these fields by helping us make
sense of the massive amounts of astronomical data being generated.
3.6. Astro-informatics Handling massive databases is a major obstacle in the field of
Astro-informatics. Traditional methods of data storage and analysis
are no longer sufficient for the massive amounts of data being
generated by astronomical observations. Therefore, new methods of
data storage and analysis that can deal with the size and complexity
of astronomical data are developed using data science techniques.
ML is one such method; it uses algorithms that can analyze data and
draw conclusions or find trends. In Astro-informatics, ML is used to
evaluate massive datasets and spot patterns that would be invisible to
human eyes. For brand new supernovae detection, for instance,
scientists are using ML methods to sift through Dark Energy Survey data.
Data visualization is another method used in Astro-informatics;
it includes the creation of visual representations of data to aid in the
comprehension of large datasets. With the help of data visualization
methods, astronomical data are being made visually accessible for
The field of astrophysics is just one area where data science has scientists and the public in the field of Astro-informatics.
made significant strides. The application of data science to the To sum up, Astro-informatics is a subfield of computer science
analysis of massive amounts of astronomical data has given rise to that uses data science methods for celestial data analysis and
the discipline of Astro-informatics, which in turn has led to new interpretation.
discoveries and insights into the cosmos. In this piece, we will
delve into the intersection of data science and Astro-informatics, 3.7. Bioinformatics
as well as examine the various applications of this field.
Analyzing, processing, and managing massive databases
derived from astronomical observations are the focus of Astro-
informatics, a branch of astronomy. Thanks to the development of
cutting-edge telescopes like the Hubble Space Telescope and the
Chandra X-ray Observatory, scientists can now amass an
unprecedented amount of data. Conventional methods of data
analysis are insufficient because of the sheer magnitude of the
data being generated. So, Astro-informatics strongly depends on
data science for analysis and interpretation of astronomical data.
The field of data science applies statistical and analytical
techniques to large datasets to draw conclusions and useful
information. Data science methods are applied to the processing
and analysis of astronomical data in the field of Astro-informatics,
resulting in new understandings and discoveries about the cosmos.
Picture Source: Robohub Talking Machines with Data Science
Africa, with Dina Machuve
The study of the cosmos as a whole (cosmology) is one of the
main applications of Astro-informatics. Astronomical data are used Tools and methods for comprehending biological data are being
by cosmologists to learn about the universe, planets, and the developed in the interdisciplinary area of bioinformatics, which is
universe’s history and structure. Cosmologists can better expanding at a rapid rate (Guo & Zou, 2019). Next-generation
comprehend the structure and evolution of the universe, thanks to sequencing and other advances in technology have allowed
data science techniques applied to massive amounts of data scientists to collect vast quantities of biological data, necessitating
gathered from surveys of galaxies, galaxy clusters, and the cosmic the development of powerful computational methods for analyzing
microwave background radiation. this information. This paper will address the role of bioinformatics
The hunt for exoplanets (planets beyond our solar system) is in data science and the ways in which the two disciplines are
another significant application of Astro-informatics. One of the intertwined.
most important astronomical findings in recent decades is that of Altschul et al. (1990) talk about the role of data science in
exoplanets, and data science was instrumental in this finding. Data biology, and data science is transforming bioinformatics by
science methods are used by astronomers in the identification of facilitating the management and analysis of intricate biological
possible exoplanets and the determination of their characteristics data. It is useful for DNA sequencing, protein classification, and
like size, mass, and orbit. modeling protein structure. Dhar (2013) sheds light on prediction

05
Journal of Data Science and Intelligent Systems Vol. 00 Iss. 00 2023

and data science relationship. AI algorithms accelerate genome used, for instance, to investigate the human microbiome and its
sequencing, enabling personalized therapies and disease predictions function in digestion and illness. Analysis of microbiome data by
based on individual genomes. AI enhances gene expression analysis, bioinformatics is also used to determine which biochemical
allowing precision cancer therapy based on a tumor’s genetic pathways are operational in each ecosystem.
structure. Robinson et al. (2010) talk about this specific information. Bioinformatics is also being used in data science to create novel
AI also improves protein classification and structure prediction, instruments and techniques for analyzing data, in addition to the
which contributes to the efficacy of drug design. In addition, applications.
Generative Adversarial Network (GAN) can generate new data ML algorithms are being created to analyze biological data such
instances for training AI algorithms. As more biological data are as gene expression data to detect patterns and provide predictions, for
collected, the potential of AI in bioinformatics will continue to grow, instance. Complex biological data, such as protein structures and
promising significant time and cost reductions in biological research. networks, are a major focus of data visualization study.
The field of bioinformatics deals with the use of computers for the Integrating and analyzing different kinds of biological data are
purpose of analyzing and making sense of biological data. Gene major obstacles in the field of bioinformatics. To fully comprehend
expression analysis encompasses a broad range of tasks, such as the the underlying biological processes, researchers may wish to analyze
creation of sequence alignment algorithms, the discovery of new genomic, proteomic, and metabolomic data from the same biological
functional elements in genomes, and the interpretation of experimental sample. Data science equips us with the resources to combine and
results. Bioinformatics, like data science, is fundamentally concerned evaluate all this information.
with making sense of large amounts of information. To sum up, bioinformatics is an emerging discipline that is crucial
On the other hand, data science is concerned with the study of to making sense of biological data. It has many uses in data science,
how to gain knowledge and understanding from data through the from genomics and proteomics to microbiome analysis, and it
application of statistical and computational techniques. ML, data includes the application of computational techniques to the analysis
extraction, and data visualization are just some of the many and interpretation of biological data. New data analysis tools and
methods that fall under its umbrella. The sheer volume of methods, such as ML algorithms and data display strategies, are
biological data being produced makes data science. being developed with the help of bioinformatics. There will be an
A necessity in bioinformatics. As a result of the tools made ever-increasing demand for bioinformatics within the field of data
available by data science, new biological findings and insights can science as the quantity and intricacy of biological data continue to rise.
be gained from this information.
Mardis (2017) lights up the innovations in DNA sequencing 4. The Current Trend on the Data Science
technologies. Bioinformatics is widely used in genomics, a subfield
of computer science. Genomics entails investigating not only the Without a shadow of a question, data science is changing the
DNA code but also the entirety of an organism’s hereditary make- course of history. The importance of data science is growing as
up. Now that organisms’ genomes can be sequenced rapidly and companies increasingly rely on digital tools. Staying ahead of the
cheaply, thanks to next-generation sequencing technologies, huge curve requires learning this technology and entering one of the
quantities of genomic data are being generated. When trying to most potential fields within data science.
make sense of all this information, bioinformatics comes in very handy. The goal of data science is to gain understanding by analyzing
Genomic data are analyzed with the help of bioinformatics tools collected data. It combines elements of mathematics, statistics, and
and techniques to find genetic variants that may be linked to diseases computer science to better analyze and interpret big datasets and
or other characteristics. Genomic data analysis using bioinformatics, inform policymaking.
for instance, can be used to look for changes that might increase a McAfee et al. (2012) have discussed the importance of big data
person’s risk of developing cancer. Likewise, genomic data and the revolution it has brought. The goal of data science is to help
analysis by bioinformatics is used to determine gene functions and businesses improve their decision-making in areas such as resource
the regulatory factors that govern gene expression. allocation, process optimization, and client service.
Proteomics research is one area where bioinformatics has Information technology is not the only field where data science
become an integral part of data science. The term “proteomics” can be applied. But it has spread its wings to various sectors/
refers to the study of all the proteins that are made by a living industries. Wu et al. (2008) explain the importance of information
entity. Proteins catalyze chemical reactions, transport molecules technology in the modern world. Many fields are making use of
across cell membranes, and send signals between cells; they are data science’s resources and methods. It has become clear that this
crucial molecules with many roles in cells. technology brings incredible advantages to any area where it is
Proteomic data are analyzed with bioinformatics tools and implemented. As a result, many sectors are adopting this
techniques to determine proteins and their roles. Proteomic data technology, driving up demand for data scientists.
analysis, for instance, requires the use of bioinformatics to determine Signature Ly’s co-founder and data scientist William Cannon has
which proteins play a role in various disease processes. To better provided some insightful commentary on the field’s transformative
comprehend how proteins collaborate within cells, bioinformatics is potential.
also used to analyze proteomic data to reveal protein–protein interactions. Data Science: Shaping the future of industry business organizations
Microbiome research is another application of bioinformatics in the can benefit from data science because it teaches them how to sift through
data science realm. Microbiomes are collections of microorganisms massive amounts of data to find actionable insights.
found in a specific habitat, such as the human digestive tract or a Aspects of the economy that affect the need for data scientists
patch of dirt. The microbiome plays an important part in digestion, providing clear explanations to business customers, so that they can
immunity, and disease. have faith in data science. To be successful, they need to simplify
Tsai and Coyle (2009) discuss the effects of microbiomes and their models for use in business presentations. Lack of confidence
obesity, and microbiome data are analyzed using bioinformatics in the model could prevent some companies from implementing it.
tools and techniques to determine the types and functions of We need to get better at making machines work in the real
microorganisms existing in each ecosystem. Bioinformatics is world, as some of them are not very user-friendly and others may

06
Journal of Data Science and Intelligent Systems Vol. 00 Iss. 00 2023

behave differently based on the conditions. It is crucial that it works scientist’s job is to sift through piles of information and draw
everywhere and that its function is simplified to make it more conclusions that will aid in the decision-making process for a
intuitive for users. “Time to value” needs to be reduced because company. There is a growing shortage of skilled data scientists,
data handling takes so long, which is an issue when planning your making it challenging for companies to hire new employees.
day. Companies are unlikely to be amenable to the lengthy However, data science positions can be highly lucrative for those
process of continually testing a theory. who possess the necessary abilities and experience. Data science
Data analysts are in high demand, but there is currently a jobs are in great demand. When compared to other fields, why has
shortage of them. However, there remains a deficiency of people data science become so trendy? Expertise in mathematics,
interested in this field of employment. Perhaps this is because it is programming, and marketing are all helpful in data science. It
a challenging path to pursue. simplifies the process of analyzing and making sense of complex
To be successful in data science, it is necessary to make data data for companies. The need for data scientists is expected to grow
usable for analysis. Improving data quality through applicable in the coming years as an increasing number of companies come to
data science tasks that can be put into practice is crucial. rely on information obtained through data collection and analysis.
The most cutting-edge data science uses in the financial sector, Data science is used across industries for issue solving and
for example, data science has allowed banks to better manage their improved decision-making, and it has many potential applications
resources and detect and prevent fraud, as well as better handle their in the hiring process. By studying and evaluating data, data
customers’ information. scientists aid businesses in making more informed choices. The
The financial sector makes strategic choices with the help of financial sector, healthcare, the retail sector, the manufacturing
automated risk analytics. Detection, tracking, and mitigation of sector, and so on have all discovered uses for data science.
threats are all carried out with the aid of ML. Human resource management and employee selection are two
Transit is becoming more secure as data science increases the areas where data science is finding widespread application. Data
vehicle’s efficiency and provides the driver with more science is being used by human resources to analyze candidate
independence. The increasing use of human food necessitated a applications, social media profiles, and past and present employee
great deal of data science-related analysis in the logistics and performance to spot trends and make more informed hiring
supply chain area, as seen in the latest trends in e-commerce choices. Recruitment analytics, also known as talent analytics, is
ordering, which increased the demand for smart yard management the application of data science to the field of human resources.
and data science-based gate allocations for trucks to save time and Data science in HR refers to the analysis of information about
automate the truck assignments to save time and help companies current and prospective workers. Information about potential
serve online, time-sensitive orders. workers, existing employees, and employers is all included. Better
Factories now employ data analysts instead of regular workers hiring choices can be made with the aid of recruitment analytics.
because of this. Data scientists are preferred by businesses over regular The future enterprise resource planning (ERP) software
employees due to the positive financial impact they have on the company. with the data science: In the 1960s, the German company SAP
A loan advisor editor named Kevin Miles has this to say: (Systems, Applications, and Products in Data Processing) developed
expanding use cases, the need for more qualified professionals, the first ERP software. Not until the 1990s, however, did ERP
and greater effort by organizations to consolidate multiple data systems with their integrated modules and extensive functionalities
sources all contribute to the rising demand for data science. The emerge. The term “ERP” was coined in the early 1990s, and
most recent data show that many modern businesses use companies such as SAP, Oracle, and Baan published the first
predictive modeling and ML to improve their processes. modern ERP systems around the same time. Since then, ERP
Incorporating all these technological advancements has greatly systems have continued to develop and expand their functionality,
improved the precision and consistency of forecasts for the future. with cloud-based solutions and mobile access gaining popularity in
As companies become more and more reliant on data-driven recent years. The traditional software was built on relational
insights for decision-making, data science has emerged as a rapidly databases, but as the business expanded with globalization, the ERP
expanding field. The need for experts who can analyze and make systems became difficult to maintain data and applications. As a
sense of the massive amounts of data being produced at breakneck result, new ERP software companies like MahaaAi (2023) and
speeds is being fueled by these factors. Companies in every sector many startups are now coming up with AI-based zero extract,
are desperately seeking data scientists who can give them a leg up in transform, and load (ETL) data science usage solutions that will
the marketplace. Thus, data science is rapidly becoming one of the save the business a lot of time and effort in the future by
most sought-after competencies across all sectors and establishments. automating many human tasks, chat-based applications, and virtual
Renowned research from Wu et al. (2014) explained why the assistant assistants. With data science, analytics, predictions, and
need for data scientists is rising as more and more companies and prescriptive analysis are significant wins in the ERP industry.
groups recognize the importance of data. To gain meaning from
massive amounts of data, data scientists use a combination of 4.1. A new era of data science: Revolutionizing
techniques from data mining, ML, maths, and statistics. This means industry with zero ETL ERP and modern cloud
that it has found widespread application across sectors as disparate
as the financial sector, the healthcare sector, the industrial sector, In the fields of data management and data science, a paradigm
and the retail sector. The Bureau of Labor Statistics predicts a 28% shift is taking place. Once considered essential for data preparation,
increase in demand for data scientists from 2016 to 2026, which is traditional ETL processes are progressively being supplanted by zero
much higher than the average for all professions. Much of this ETL strategies enabled by modern cloud technologies. The advent of
expansion can be attributed to the creation of new data science zero ETL with the modern cloud is not merely a technical
positions, which have supplanted several older occupations. advancement; it is igniting an industry revolution with far-reaching
Jobs in the field of data science are in high demand. As more and effects on data science. The ERP industry is becoming to super
more companies recognize the potential in data for driving operational tough due to increasing the business rules and data type and it is
improvements, the need for data scientists continues to rise. A data heavily increasing the supporting cost and user efforts for the ERP

07
Journal of Data Science and Intelligent Systems Vol. 00 Iss. 00 2023

and supply chain industry, adopting the data science with ML exploration, and data democratization by eradicating the need for
technologies with zero ETL like MahaaAi ERP will save enormous laborious data transformation processes and leveraging the power
amount of cost and application supporting cost for the modern industry. of the cloud. In a world increasingly driven by data, organizations
AI and data science are transforming ERP systems by improving that embrace these new opportunities stand to gain a competitive
data analysis, user experience, and machine monitoring. AI tools and edge as this revolution unfolds.
ML analyze large datasets, identifying patterns and trends for Industry experts and senior leaders’ opinions: Moonchaser
immediate business decision-making and response. These tools CEO David Patterson-Cole adds, students and recent graduates are
enable accurate forecasting and inventory management by learning this lesson the hard way in data science, where skills are
monitoring suppliers, consumer behavior, production inefficiencies, constantly evolving, and new areas are opening. The days are
quality concerns, and customer inquiries. AI enhances the user gone when your first employment and degree determined your
experience by simplifying processes and implementing virtual agents area of expertise for the rest of your career. Now, you can
for real-time alerts and information. It also improves machine enroll in a bootcamp or study online to rapidly become
monitoring by providing performance metrics in real time to detect proficient in a field like natural language processing or a
and predict production anomalies. When combined with the Internet programming language like R and become a competitive
of Things, AI analyzes equipment performance, enabling proactive candidate for high-paying jobs in that field.
management, preventative maintenance scheduling, and enhanced On this, the last day of February 2022, I will attest that
overall equipment effectiveness, thereby preventing costly shutdowns. cybersecurity and cloud security have garnered massive attention.
Historically, ETL procedures have been the backbone of data The media’s constant coverage of cyberattacks should make the
management, allowing data to be extracted from various sources, danger obvious. Data scientists are not necessarily experts in this
transformed into an appropriate format, and inserted into a data area, but if you can demonstrate the relationship between what
warehouse. However, these processes are time-consuming, frequently you do with the data and how/why it is secured, you stand a
intricate, and demand substantial computational resources. The strong chance of being a preferred candidate for a given position.
introduction of zero ETL, which negates the requirement for data Data science, the practice of gaining understanding or wisdom
transformation prior to importing, is a game-changer. from data, is increasingly being used in digital marketing. To do this,
Zero ETL leverages the capabilities of modern cloud-based data it employs a wide range of statistical, ML, and AI-based processes
platforms, which enable the ingestion of unprocessed, unstructured and algorithms.
data. Data are no longer compelled to conform to predefined Marketing a product or service to customers via digital means.
schemas, which reduces the time, resources, and potential for Data science can be helpful here by allowing for more precise
error associated with conventional ETL. In addition, modern targeting, better insight into consumer behavior, and more
cloud platforms provide robust, scalable storage and accurate forecasting of future desires. Click-through rates, convert
computational resources, enabling real-time data processing and rates, and return on investment can all benefit from data science.
advanced analytics on vast quantities of unprocessed data. It can also be used to determine the best ways to contact your
This evolution in data administration is causing a revolution in target audience and adjust your marketing strategies accordingly.
the industry, especially in the field of data science. With zero ETL Data science can also be used to learn about the consumer
and contemporary cloud technologies, data scientists now have experience on a website and to assess its strengths and weaknesses.
direct access to raw, granular data, thereby expanding the scope of According to inmotionmktg.com’s Head of Marketing Bryan
in-depth analysis and data-driven insights. Phillips, with the help of data science, I can optimize my B2B sales
Data scientists can delve deeply into the data, recognize patterns channels and better focus my leads. In recent years, there has been a
and correlations, and extract valuable information that may have rise in the marketing industry’s need for data scientists as businesses
been lost during the traditional ETL transformation process. They seek more efficient and affordable digital advertising strategies.
can work with real-time data, allowing for more expeditious Increasing the efficiency of advertisements is a top priority for many
insights and the ability to react swiftly to shifting circumstances. companies, which is why the development of methods to micro-
In addition, the scalability of cloud resources enables data target particular audiences is so exciting.
scientists to manage massive amounts of data, allowing for more Businesses also want a program that provides instantaneous
thorough analyses and accurate predictions. data, such as how digital marketing campaigns have impacted
The effects of this revolution are already visible in numerous sales and expansion.
industries. Zero ETL enables real-time analysis of patient data in Data are gathered from online sources, search engine
the healthcare industry, thereby improving patient care and optimization, social media, and marketing statistics by data
outcomes. In finance, it facilitates real-time fraud detection and scientists. All these considerations are baked into the newest data
risk assessment. In the manufacturing industry, it enables real- analysis applications and websites for a more complete picture.
time monitoring and predictive maintenance of apparatus, thereby Tools like project management programs and communication
decreasing downtime and boosting output. suites can be seamlessly combined with these.
In addition, this transition to zero ETL and the modern cloud Many businesses now only gather data from customers who
fosters a culture of data democratization. Even non-technical specifically give them permission to do so. They are also
personnel can investigate data, generate insights, and make data- investing heavily in cybersecurity measures like data encryption
driven decisions when they have simple access to raw data and and the development of algorithms to prevent unauthorized access
intuitive cloud-based tools. It results in a more inclusive, data- to sensitive client information.
savvy organization in which data are no longer the exclusive The firm SERP helps businesses fill revenue gaps through
domain of data scientists but rather a valuable resource for everyone. sophisticated, data-driven digital marketing strategies. Provost and
In conclusion, the introduction of zero ETL in conjunction with Fawcett (2013a, 2013b) highlighted the effects of data science in
modern cloud technologies is transforming the landscape of data data-driven decision-making.
management and revolutionizing industries. It ushers in a new era Data science has had a profound impact on my job, and I can
of data science characterized by real-time analysis, in-depth data already see how it is influencing the direction of the company.

08
Journal of Data Science and Intelligent Systems Vol. 00 Iss. 00 2023

Today, data science allows for targeted ads to be launched on an Chinthamu, N., Gooda, S. K., Venkatachalam, C. S. S., & Malathy, G.
individual basis, rather than just on broad demographic groups. (2023). IoT-based secure data transmission prediction using deep
When I started out in digital marketing, the number of unique learning model in cloud computing. International Journal on
messages I could send to my clientele was limited by the number of Recent and Innovation Trends in Computing and Communication,
authors we employed or the number of segments we created from our 11(4S), 68–76.
clientele database. Increased human output Thanks to more efficient Data Science & AI Community (2022). Top five data science
machine processing, human output has been dramatically increased. Applications in government and public sectors. Retrieved
Machines are designed to replace some traditional human labor and from: https://community.nasscom.in/communities/data-science-ai-
save people from grunt work. Thus, enterprises can generate revenue community/top-5-uses-data-science-government-and-public-sector
and profits. Davenport, T. H., & Patil, D. J. (2012). Data scientist: The sexiest job
Uses of data science in the banking sector data plays a crucial role of the 21st century. Harvard Business Review, 90(5), 70–76.
in the finance sector, making it one of the most data-driven sectors of the Dhar, V. (2013). Data science and prediction. Communications of the
global economy. Demand for data scientists in the financial sector has ACM, 56(12), 64–73.
been propelled by the increasing complexity of financial products and Guo, M., & Zou, Q. (2019). Perspectives of bioinformatics in big
the need to make prudent investment choices. data era. Current Genomics, 20(2), 79.
Numerous sectors of the finance sector have benefited from data Kassambara, A. (2017). Network analysis and visualization in R:
scientists’ efforts. It has a wide variety of applications, including but Quick start guide. USA: CreateSpace.
not limited to fraud analysis, risk management, client segmentation, Kumar, J. (2021). Data science and machine learning in FinTech.
and cost forecasting. Many banks and other financial organizations Retrieved from: https://machine-learning.ciotechoutlook.com/
are starting to use data science to get ahead. Financial organizations cxoinsight/data-science-and-machine-learning-in-fintech-nid-6627-
hire data scientists to sift through mountains of information in search cid-170.html
of trends. To foresee potential developments in the industry, they MahaaAi. (2023). Smart ERP configuration. Retrieved from: http://
employ several ML algorithms. mahaaai.com/smart-erp-configurations/
In the financial sector, data science is being used to enhance Mardis, E. R. (2017). DNA sequencing technologies: 2006-2016.
support to customers. The finance sector can greatly benefit from big Nature Protocols, 12(2), 213–218.
data. Banks and other financial organizations can use it to improve Mascia, K. (2022). How data science is ushering in a new era of modern
their decision-making by looking at historical trends. There is a medicine. Retrieved from: https://www.jnj.com/innovation/how-
direct link between the success of financial organizations and the data-science-ushers-in-new-era-of-modern-medicine
application of data science. McAfee, A., Brynjolfsson, E., Davenport, T. H., Patil, D. J., &
Barton, D. (2012). Big data: The management revolution.
5. Conclusion Harvard Business Review, 90(10), 60–68.
Nixdorf, D. (2021). Data science and machine learning in FinTech.
In conclusion, data science has proved to be an indispensable Retrieved from: https://community.nasscom.in/communities/
aspect of decision-making in a variety of industries and fields. Its data-science-ai-community/data-science-and-machine-
application in AI, manufacturing, fintech, government, Astro- learning-fintech
informatics, e-commerce, education, and bioinformatics has resulted Open Access Government. (2019). Speeding up results in the public
in significant enhancements, resulting in more efficient and effective sector with data science. Retrieved from: https://www.
processes. With the availability of data increasing, the demand for openaccessgovernment.org/public-sector-with-data-science/67746/
data scientists will continue to rise. The ever-evolving discipline of Provost, F., & Fawcett, T. (2013a). Data science and its
data science provides professionals with a thrilling opportunity to relationship to big data and data-driven decision making.
make a difference in society by tackling some of its most pressing Big Data, 1(1), 51–59.
problems. It is evident that data science has the potential to Provost, F., & Fawcett, T. (2013b). Data science for business: What
revolutionize how we live, work, and make decisions, and it will you need to know about data mining and data-analytic
unquestionably play a significant role in moulding the future. thinking. USA: O’Reilly Media.
To reduce human effort, the ERP industry needed to implement Robinson, M. D., McCarthy, D. J., & Smyth, G. K. (2010). EdgeR: A
numerous data science-related features, such as chat-based ERP Bioconductor package for differential expression analysis of
applications and virtual assistant support. The study’s conclusion digital gene expression data. Bioinformatics, 26(1), 139–140.
highlights the significant advantages of data science in the ERP Tsai, F., & Coyle, W. J. (2009). The microbiome and obesity: Is
industry, including self-service analytics, predictions, and prescriptive obesity linked to our gut flora? Current Gastroenterology
analysis. Reports, 11(4), 307–313.
Wu, X., Kumar, V., Ross Quinlan, J., Ghosh, J., Yang, Q., Motoda,
Conflicts of Interest H., : : : , & Steinberg, D. (2008). Top 10 algorithms in data
mining. Knowledge and Information Systems, 14, 1–37.
The authors declare that they have not conflicts of interest to this
Wu, X., Zhu, X., Wu, G. Q., & Ding, W. (2014). Data mining with
work.
big data. IEEE Transactions on Knowledge and Data
Engineering, 26(1), 97–107.
References
How to Cite: Chinthamu, N. & Karukuri, M. (2023). Data Science and Applications.
Altschul, S. F., Gish, W., Miller, W., Myers, E. W., & Lipman, D. J. Journal of Data Science and Intelligent Systems https://doi.org/10.47852/
(1990). Basic local alignment search tool. Journal of Molecular bonviewJDSIS3202837
Biology, 215(3), 403–410.

09

View publication stats

You might also like