artificial intelligence and machine learning research papers

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

View all journals
Explore content
About the journal
Publish with us
Sign up for alerts
Open access
Published: 16 October 2023

Forecasting the future of artificial intelligence with machine learning-based link prediction in an exponentially growing knowledge network

Mario Krenn ORCID: orcid.org/0000-0003-1620-9207 1 ,
Lorenzo Buffoni 2 ,
Bruno Coutinho 2 ,
Sagi Eppel 3 ,
Jacob Gates Foster 4 ,
Andrew Gritsevskiy ORCID: orcid.org/0000-0001-8138-8796 3 , 5 , 6 ,
Harlin Lee ORCID: orcid.org/0000-0001-6128-9942 4 ,
Yichao Lu ORCID: orcid.org/0009-0001-2005-1724 7 ,
João P. Moutinho 2 ,
Nima Sanjabi ORCID: orcid.org/0009-0000-6342-5231 8 ,
Rishi Sonthalia ORCID: orcid.org/0000-0002-0928-392X 4 ,
Ngoc Mai Tran 9 ,
Francisco Valente ORCID: orcid.org/0000-0001-6964-9391 10 ,
Yangxinyu Xie ORCID: orcid.org/0000-0002-1532-6746 11 ,
Rose Yu 12 &
Michael Kopp 6

Nature Machine Intelligence volume 5 , pages 1326–1335 ( 2023 ) Cite this article

29k Accesses

17 Citations

1048 Altmetric

Metrics details

Complex networks
Computer science
Research data

A tool that could suggest new personalized research directions and ideas by taking insights from the scientific literature could profoundly accelerate the progress of science. A field that might benefit from such an approach is artificial intelligence (AI) research, where the number of scientific publications has been growing exponentially over recent years, making it challenging for human researchers to keep track of the progress. Here we use AI techniques to predict the future research directions of AI itself. We introduce a graph-based benchmark based on real-world data—the Science4Cast benchmark, which aims to predict the future state of an evolving semantic network of AI. For that, we use more than 143,000 research papers and build up a knowledge network with more than 64,000 concept nodes. We then present ten diverse methods to tackle this task, ranging from pure statistical to pure learning methods. Surprisingly, the most powerful methods use a carefully curated set of network features, rather than an end-to-end AI approach. These results indicate a great potential that can be unleashed for purely ML approaches without human knowledge. Ultimately, better predictions of new future research directions will be a crucial component of more advanced research suggestion tools.

Learning on knowledge graph dynamics provides an early warning of impactful research

artificial intelligence and machine learning research papers

TrendyGenes, a computational pipeline for the detection of literature trends in academia and drug discovery

Accelerating science with human-aware artificial intelligence

The corpus of scientific literature grows at an ever-increasing speed. Specifically, in the field of artificial intelligence (AI) and machine learning (ML), the number of papers every month is growing exponentially with a doubling rate of roughly 23 months (Fig. 1 ). Simultaneously, the AI community is embracing diverse ideas from many disciplines such as mathematics, statistics and physics, making it challenging to organize different ideas and uncover new scientific connections. We envision a computer program that can automatically read, comprehend and act on AI literature. It can predict and suggest meaningful research ideas that transcend individual knowledge and cross-domain boundaries. If successful, it could greatly improve the productivity of AI researchers, open up new avenues of research and help drive progress in the field.

The doubling rate of papers per month is roughly 23 months, which might lead to problems for publishing in these fields, at some point. The categories are cs.AI, cs.LG, cs.NE and stat.ML.

In this work, we address the ambitious vision of developing a data-driven approach to predict future research directions 1 . As new research ideas often emerge from connecting seemingly unrelated concepts 2 , 3 , 4 , we model the evolution of AI literature as a temporal network. We construct an evolving semantic network that encapsulates the content and development of AI research since 1994, with approximately 64,000 nodes (representing individual concepts) and 18 million edges (connecting jointly investigated concepts).

We use the semantic network as an input to ten diverse statistical and ML methods to predict the future evolution of the semantic network with high accuracy. That is, we can predict which combinations of concepts AI researchers will investigate in the future. Being able to predict what scientists will work on is a first crucial step for suggesting new topics that might have a high impact.

Several methods were contributions to the Science4Cast competition hosted by the 2021 IEEE International Conference on Big Data (IEEE BigData 2021). Broadly, we can divide the methods into two classes: methods that use hand-crafted network-theoretical features and those that automatically learn features. We found that models using carefully hand-crafted features outperform methods that attempt to learn features autonomously. This (somewhat surprising) finding indicates a great potential for improvements of models free of human priors.

Our paper introduces a real-world graph benchmark for AI, presents ten methods for solving it, and discusses how this task contributes to the larger goal of AI-driven research suggestions in AI and other disciplines. All methods are available at GitHub 5 .

Semantic networks

The goal here is to extract knowledge from the scientific literature that can subsequently be processed by computer algorithms. At first glance, a natural first step would be to use large language model (such as GPT3 6 , Gopher 7 , MegaTron 8 or PaLM 9 ) on each article to extract concepts and their relations automatically. However, these methods still struggle in reasoning capabilities 10 , 11 ; thus, it is not yet directly clear how these models can be used for identifying and suggesting new ideas and concept combinations.

Rzhetsky et al. 12 pioneered an alternative approach, creating semantic networks in biochemistry from co-occurring concepts in scientific papers. There, nodes represent scientific concepts, specifically biomolecules, and are linked when a paper mentions both in its title or abstract. This evolving network captures the field’s history and, using supercomputer simulations, provides insights into scientists’ collective behaviour and suggests more efficient research strategies 13 . Although creating semantic networks from concept co-occurrences extracts only a small amount of knowledge from each paper, it captures non-trivial and actionable content when applied to large datasets 2 , 4 , 13 , 14 , 15 . PaperRobot extends this approach by predicting new links from large medical knowledge graphs and formulating new ideas in human language as paper drafts 16 .

This approach was applied and extended to quantum physics 17 by building a semantic network of over 6,000 concepts. There, the authors (including one of us) formulated the prediction of new research trends and connections as an ML task, with the goal of identifying concept pairs not yet jointly discussed in the literature but likely to be investigated in the future. This prediction task was one component for personalized suggestions of new research ideas.

Link prediction in semantic networks

We formulate the prediction of future research topics as a link-prediction task in an exponentially growing semantic network in the AI field. The goal is to predict which unconnected nodes, representing scientific concepts not yet jointly researched, will be connected in the future.

Link prediction is a common problem in computer science, addressed with classical metrics and features, as well as ML techniques. Network theory-based methods include local motif-based approaches 18 , 19 , 20 , 21 , 22 , linear optimization 23 , global perturbations 24 and stochastic block models 25 . ML works optimized a combination of predictors 26 , with further discussion in a recent review 27 .

In ref. 17 , 17 hand-crafted features were used for this task. In the Science4Cast competition, the goal was to find more precise methods for link-prediction tasks in semantic networks (a semantic network of AI that is ten times larger than the one in ref. 17 ).

Potential for idea generation in science

The long-term goal of predictions and suggestions in semantic networks is to provide new ideas to individual researchers. In a way, we hope to build a creative artificial muse in science 28 . We can bias or constrain the model to give topic suggestions that are related to the research interest of individual scientists, or a pair of scientists to suggest topics for collaborations in an interdisciplinary setting.

Generation and analysis of the dataset

Dataset construction.

We create a dynamic semantic network using papers published on arXiv from 1992 to 2020 in the categories cs.AI, cs.LG, cs.NE and stat.ML. The 64,719 nodes represent AI concepts extracted from 143,000 paper titles and abstracts using Rapid Automatic Keyword Extraction (RAKE) and normalized via natural language processing (NLP) techniques and custom methods 29 . Although high-quality taxonomies such as the Computer Science Ontology (CSO) exist 30 , 31 , we choose not to use them for two reasons: the rapid growth of AI and ML may result in new concepts not yet in the CSO, and not all scientific domains have high-quality taxonomies like CSO. Our goal is to build a scalable approach applicable to any domain of science. However, future research could investigate merging these approaches (see ‘Extensions and future work’).

Concepts form the nodes of the semantic network, and edges are drawn when concepts co-appear in a paper title or abstract. Edges have time stamps based on the paper’s publication date, and multiple time-stamped edges between concepts are common. The network is edge-weighted, and the weight of an edge stands for the number of papers that connect two concepts. In total, this creates a time-evolving semantic network, depicted in Fig. 2 .

Utilizing 143,000 AI and ML papers on arXiv from 1992 to 2020, we create a list of concepts using RAKE and other NLP tools, which form nodes in a semantic network. Edges connect concepts that co-occur in titles or abstracts, resulting in an evolving network that expands as more concepts are jointly investigated. The task involves predicting which unconnected nodes (concepts not yet studied together) will connect within a few years. We present ten diverse statistical and ML methods to address this challenge.

Network-theoretical analysis

The published semantic network has 64,719 nodes and 17,892,352 unique undirected edges, with a mean node degree of 553. Many hub nodes greatly exceed this mean degree, as shown in Fig. 3 , For example, the highest node degrees are 466,319 (neural network), 198,050 (deep learning), 195,345 (machine learning), 169,555 (convolutional neural network), 159,403 (real world), 150,227 (experimental result), 127,642 (deep neural network) and 115,334 (large scale). We fit a power-law curve to the degree distribution p ( k ) using ref. 32 and obtained p ( k ) ∝ k −2.28 for degree k ≥ 1,672. However, real complex network degree distributions often follow power laws with exponential cut-offs 33 . Recent work 34 has indicated that lognormal distributions fit most real-world networks better than power laws. Likelihood ratio tests from ref. 32 suggest truncated power law ( P = 0.0031), lognormal ( P = 0.0045) and lognormal positive ( P = 0.015) fit better than power law, while exponential ( P = 3 × 10 −10 ) and stretched exponential ( P = 6 × 10 −5 ) are worse. We couldn’t conclusively determine the best fit with P ≤ 0.1.

Nodes with the highest (466,319) and lowest (2) non-zero degrees are neural network and video compression technique, respectively. The most frequent non-zero degree is 64 (which occures 313 times). The plot, in log scale, omits 1,247 nodes with zero degrees.

We observe changes in network connectivity over time. Although degree distributions remained heavy-tailed, the ordering of nodes within the tail changed due to popularity trends. The most connected nodes and the years they became so include decision tree (1994), machine learning (1996), logic program (2000), neural network (2005), experimental result (2011), machine learning (2013, for a second time) and neural network (2015).

Connected component analysis in Fig. 4 reveals that the network grew more connected over time, with the largest group expanding and the number of connected components decreasing. Mid-sized connected components’ trajectories may expose trends, like image processing. A connected component with four nodes appeared in 1999 (brightness change, planar curve, local feature, differential invariant), and three more joined in 2000 (similarity transformation, template matching, invariant representation). In 2006, a paper discussing support vector machine and local feature merged this mid-sized group with the largest connected component.

Primary (left, blue) vertical axis: number of connected components with more than one node. Secondary (right, orange) vertical axis: number of nodes in the largest connected component. For example, the network in 2019 comprises of one large connected component with 63,472 nodes and 1,247 isolated nodes, that is, nodes with no edges. However, the 2001 network has 19 connected components with size greater than one, the largest of which has 2,733 nodes.

The semantic network reveals increasing centralization over time, with a smaller percentage of nodes (concepts) contributing to a larger fraction of edges (concept combinations). Figure 5 shows that the fraction of edges for high-degree nodes rises, while it decreases for low-degree nodes. The decreasing average clustering coefficient over time supports this trend, suggesting nodes are more likely to connect to high-degree central nodes. This could be due to the AI community’s focus on a few dominating methods or more consistent terminology use.

This cumulative histogram illustrates the fraction of nodes (concepts) corresponding to the fraction of edges (connections) for given years (1999, 2003, 2007, 2011, 2015 and 2019). The graph was generated by adding edges and nodes dated before each year. Nodes are sorted by increasing degrees. The y value at x = 80 represents the fraction of edges contributed by all nodes in and below the 80th percentile of degrees.

Problem formulation

At the big picture, we aim to make predictions in an exponentially growing semantic network. The specific task involves predicting which two nodes v 1 and v 2 with degrees d ( v 1/ 2 ) ≥ c lacking an edge in the year (2021 − δ ) will have w edges in 2021. We use δ = 1, 3, 5, c = 0, 5, 25 and w = 1, 3, where c is a minimal degree. Note that c = 0 is an intriguing special case where the nodes may not have an associated edge in the initial year, requiring the model to predict which nodes will connect to entirely new edges. The task w = 3 goes beyond simple link prediction and seeks to identify uninvestigated concept pairs that will appear together in at least three papers. An interesting alternative task could be predicting the fastest-growing links, denoted as ‘trend’ prediction.

In this task, we provide a list of 10 million unconnected node pairs (each node having a degree ≥ c ) for the year (2021 − δ ), with the goal of sorting this list by descending probability that they will have at least w edges in 2021.

For evaluation, we employ the receiver operating characteristic (ROC) curve 35 , which plots the true-positive rate against the false-positive rate at various threshold settings. We use the area under the curve (AUC) of the ROC curve as our evaluation metric. The advantage of AUC over mean square error is its independence from the data distribution. Specifically, in our case, where the two classes have a highly asymmetric distribution (with only about 1–3% of newly connected edges) and the distribution changes over time, AUC offers meaningful interpretation. Perfect predictions yield AUC = 1, whereas random predictions result in AUC = 0.5. AUC represents the percentage that a random true element is ranked higher than a random false one. For other metrics, see ref. 36 .

To tackle this task, models can use the complete information of the semantic network from the year (2021 − δ ) in any way possible. In our case, all presented models generate a dataset for learning to make predictions from (2021 − 2 δ ) to (2021 − δ ). Once the models successfully complete this task, they are applied to the test dataset to make predictions from (2021 − δ ) to 2021. All reported AUCs are based on the test dataset. Note that solving the test dataset is especially challenging due to the δ -year shift, causing systematic changes such as the number of papers and density of the semantic network.

AI-based solutions

We demonstrate various methods to predict new links in a semantic network, ranging from pure statistical approaches and neural networks with hand-crafted features (NF) to ML models without NF. The results are shown in Fig. 6 , with the highest AUC scores achieved by methods using NF as ML model inputs. Pure network features without ML are competitive, while pure ML methods have yet to outperform those with NF. Predicting links generated at least three times can achieve a quasi-deterministic AUC > 99.5%, suggesting an interesting target for computational sociology and science of science research. We have performed numerous tests to exclude data leakage in the benchmark dataset, overfitting or data duplication both in the set of articles and the set of concepts. We rank methods based on their performance, with model M1 as the best performing and model M8 as the least effective (for the prediction of a new edge with δ = 3, c = 0). Models M4 and M7 are subdivided into M4A, M4B, M7A and M7B, differing in their focus on feature or embedding selection (more details in Methods ).

Here we show the AUC values for different models that use machine learning techniques (ML), hand-crafted network features (NF) or a combination thereof. The left plot shows results for the prediction of a single new link (that is, w = 1) and the right plot shows the results for the prediction of new triple links w = 3. The task is to predict δ = [1, 3, 5] years into the future, with cut-off values c = [0, 5, 25]. We sort the models by the the results for the task ( w = 1, δ = 3, c = 0), which was the task in the Science4Cast competition. Data points that are not shown have a AUC below 0.6 or are not computed due to computational costs. All AUC values reported are computed on a validation dataset δ years ahead of the training dataset that the models have never seen. Note that the prediction of new triple edges can be performed nearly deterministic. It will be interesting to understand the origin of this quasi-deterministic pattern in AI research, for example, by connecting it to the research interests of scientists 88 .

Model M1: NF + ML. This approach combines tree-based gradient boosting with graph neural networks, using extensive feature engineering to capture node centralities, proximity and temporal evolution 37 . The Light Gradient Boosting Machine (LightGBM) model 38 is employed with heavy regularization to combat overfitting due to the scarcity of positive examples, while a time-aware graph neural network learns dynamic node representations.

Model M2: NF + ML. This method utilizes node and edge features (as well as their first and second derivatives) to predict link formation probabilities 39 . Node features capture popularity, and edge features measure similarity. A multilayer perceptron with rectified linear unit (ReLU) activation is used for learning. Cold start issues are addressed with feature imputation.

Model M3: NF + ML. This method captures hand-crafted node features over multiple time snapshots and employs a long short-term memory (LSTM) to learn time dependencies 40 . The features were selected to be highly informative while having a low computational cost. The final configuration uses degree centrality, degree of neighbours and common neighbours as features. The LSTM outperforms fully connected neural networks.

Model M4: pure NF. Two purely statistical methods, preferential attachment 41 and common neighbours 27 , are used 42 . Preferential attachment is based on node degrees, while common neighbours relies on the number of shared neighbours. Both methods are computationally inexpensive and perform competitively with some learning-based models.

Model M5: NF + ML. Here, ten groups of first-order graph features are extracted to obtain neighbourhood and similarity properties, with principal component analysis 43 applied for dimensionality reduction 44 . A random forest classifier is trained on the balanced dataset to predict new links.

Model M6: NF + ML. The baseline solution uses 15 hand-crafted features as input to a four-layer neural network, predicting the probability of link formation between node pairs 17 .

Model M7: end-to-end ML (auto node embedding). The baseline solution is modified to use node2vec 45 and ProNE embeddings 46 instead of hand-crafted features. The embeddings are input to a neural network with two hidden layers for link prediction.

Model M8: end-to-end ML (transformers). This method learns features in an unsupervised manner using transformers 47 . Node2vec embeddings 45 , 48 are generated for various snapshots of the adjacency matrix, and a transformer model 49 is pre-trained as a feature extractor. A two-layer ReLU network is used for classification.

Extensions and future work

Developing an AI that suggests research topics to scientists is a complex task, and our link-prediction approach in temporal networks is just the beginning. We highlight key extensions and future work directly related to the ultimate goal of AI for AI.

High-quality predictions without feature engineering. Interestingly, the most effective methods utilized carefully crafted features on a graph with extracted concepts as nodes and edges representing their joint publication history. Investigating whether end-to-end deep learning can solve tasks without feature engineering will be a valuable next step.

Fully automated concept extraction. Current concept lists, generated by RAKE’s statistical text analysis, demand time-consuming code development to address irrelevant term extraction (for example, verbs, adjectives). A fully automated NLP technique that accurately extracts meaningful concepts without manual code intervention would greatly enhance the process.

Leveraging ontology taxonomies. Alongside fully automated concept extraction, utilizing established taxonomies such as the CSO 30 , 31 , Wikipedia-extracted concepts, book indices 17 or PhySH key phrases is crucial. Although not comprehensive for all domains, these curated datasets often contain hierarchical and relational concept information, greatly improving prediction tasks.

Incorporating relation extraction. Future work could explore relation extraction techniques for constructing more accurate, sparser semantic networks. By discerning and classifying meaningful concept relationships in abstracts 50 , 51 , a refined AI literature representation is attainable. Using NLP tools for entity recognition, relationship identification and classification, this approach may enhance prediction performance and novel research direction identification.

Generation of new concepts. Our work predicts links between known concepts, but generating new concepts using AI remains a challenge. This unsupervised task, as explored in refs. 52 , 53 , involves detecting concept clusters with dynamics that signal new concept formation. Incorporating emerging concepts into the current framework for suggesting research topics is an intriguing future direction.

Semantic information beyond concept pairs. Currently, abstracts and titles are compressed into concept pairs, but more comprehensive information extraction could yield meaningful predictions. Exploring complex data structures such as hypergraphs 54 may be computationally demanding, but clever tricks could reduce complexity, as shown in ref. 55 . Investigating sociological factors or drawing inspiration from material science approaches 56 may also improve prediction tasks. A recent dataset for the study of the science of science also includes more complex data structures than the ones used in our paper, including data from social networks such as Twitter 57 .

Predictions of scientific success. While predicting new links between concepts is valuable, assessing their potential impact is essential for high-quality suggestions. Introducing a metric of success, like estimated citation numbers or citation growth rate, can help gauge the importance of these connections. Adapting citation prediction techniques from the science of science 58 , 59 , 60 , 61 to semantic networks offers a promising research direction.

Anomaly detections. Predicting likely connections may not align with finding surprising research directions. One method for identifying surprising suggestions involves constraining cosine similarity between vertices 62 , which measures shared neighbours and can be associated with semantic (dis)similarity. Another approach is detecting anomalies in semantic networks, which are potential links with extreme properties 63 , 64 . While scientists often focus on familiar topics 3 , 4 , greater impact results from unexpected combinations of distant domains 12 , encouraging the search for surprising associations.

End-to-end formulation. Our method breaks down the goal of extracting knowledge from scientific literature into subtasks, contrasting with end-to-end deep learning that tackles problems directly without subproblems 65 , 66 . End-to-end approaches have shown great success in various domains 67 , 68 , 69 . Investigating whether such an end-to-end solution can achieve similar success in our context would be intriguing.

Our method represents a crucial step towards developing a tool that can assist scientists in uncovering novel avenues for exploration. We are confident that our outlined ideas and extensions pave the way for achieving practical, personalized, interdisciplinary AI-based suggestions for new impactful discoveries. We firmly believe that such a tool holds the potential to become a influential catalyst, transforming the way scientists approach research questions and collaborate in their respective fields.

Details on concept set generation and application

In this section, we provide details on the generation of our list of 64,719 concepts. For more information, the code is accessible on GitHub . The entire approach is designed for immediate scalability to other domains.

Initially, we utilized approximately 143,000 arXiv papers from the categories cs.AI, cs.LG, cs.NE and stat.ML spanning 1992 to 2020. The omission of earlier data has a negligible effect on our research question, as we show below. We then iterated over each individual article, employing RAKE (with an extended stopword list) to suggest concept candidates, which were subsequently stored.

Following the iteration, we retained concepts composed of at least two words (for example, neural network) appearing in six or more articles, as well as concepts comprising a minimum of three words (for example, recurrent neural network) appearing in three or more articles. This initial filter substantially reduced noise generated by RAKE, resulting in a list of 104,948 concepts.

Lastly, we developed an automated filtering tool to further enhance the quality of the concept list. This tool identified common, domain-independent errors made by RAKE, which primarily included phrases that were not concepts (for example, dataset provided or discuss open challenge). We compiled a list of 543 words not part of meaningful concepts, including verbs, ordinal numbers, conjunctions and adverbials. Ultimately, this process produced our final list of 64,719 concepts employed in our study. No further semantic concept/entity linking is applied.

By this construction, the test sets with c = 0 could lead to very rare contamination of the dataset. That is because each concept will have at least one edge in the final dataset. The effects, however, are negligible.

The distribution of concepts in the articles can be seen in Extended Data Fig. 1 . As an example, we show the extraction of concepts from five randomly chosen papers:

Memristor hardware-friendly reinforcement learning 70 : ‘actor critic algorithm’, ‘neuromorphic hardware implementation’, ‘hardware neural network’, ‘neuromorphic hardware system’, ‘neural network’, ‘large number’, ‘reinforcement learning’, ‘case study’, ‘pre training’, ‘training procedure’, ‘complex task’, ‘high performance’, ‘classical problem’, ‘hardware implementation’, ‘synaptic weight’, ‘energy efficient’, ‘neuromorphic hardware’, ‘control theory’, ‘weight update’, ‘training technique’, ‘actor critic’, ‘nervous system’, ‘inverted pendulum’, ‘explicit supervision’, ‘hardware friendly’, ‘neuromorphic architecture’, ‘hardware system’.

Automated deep learning analysis of angiography video sequences for coronary artery disease 71 : ‘deep learning approach’, ‘coronary artery disease’, ‘deep learning analysis’, ‘traditional image processing’, ‘deep learning’, ‘image processing’, ‘f1 score’, ‘video sequence’, ‘error rate’, ‘automated analysis’, ‘coronary artery’, ‘vessel segmentation’, ‘key frame’, ‘visual assessment’, ‘analysis method’, ‘analysis pipeline’, ‘coronary angiography’, ‘geometrical analysis’.

Demographic influences on contemporary art with unsupervised style embeddings 72 : ‘classification task’, ‘social network’, ‘data source’, ‘visual content’, ‘graph network’, ‘demographic information’, ‘social connection’, ‘visual style’, ‘historical dataset’, ‘novel information’

The utility of general domain transfer learning for medical language tasks 73 : ‘natural language processing’, ‘long short term memory’, ‘logistic regression model’, ‘transfer learning technique’, ‘short term memory’, ‘average f1 score’, ‘class classification model’, ‘domain transfer learning’, ‘weighted average f1 score’, ‘medical natural language processing’, ‘natural language process’, ‘transfer learning’, ‘f1 score’, ’natural language’, ’deep model’, ’logistic regression’, ’model performance’, ’classification model’, ’text classification’, ’regression model’, ’nlp task’, ‘short term’, ‘medical domain’, ‘weighted average’, ‘class classification’, ‘bert model’, ‘language processing’, ‘biomedical domain’, ‘domain transfer’, ‘nlp model’, ‘main model’, ‘general domain’, ‘domain model’, ‘medical text’.

Fast neural architecture construction using envelopenets 74 : ‘neural network architecture’, ‘neural architecture search’, ‘deep network architecture’, ‘image classification problem’, ‘neural architecture search method’, ‘neural network’, ‘reinforcement learning’, ‘deep network’, ‘image classification’, ‘objective function’, ‘network architecture’, ‘classification problem’, ‘evolutionary algorithm’, ‘neural architecture’, ‘base network’, ‘architecture search’, ‘training epoch’, ‘search method’, ‘image class’, ‘full training’, ‘automated search’, ‘generated network’, ‘constructed network’, ‘gpu day’.

Time gap between the generation of edges

We use articles from arXiv, which only goes back to the year 1992. However, of course, the field of AI exists at least since the 1960s 75 . Thus, this raises the question whether the omission of the first 30–40 years of research has a crucial impact in the prediction task we formulate, specifically, whether edges that we consider as new might not be so new after all. Thus, in Extended Data Fig. 2 , we compute the time between the formation of edges between the same concepts, taking into account all or just the first edge. We see that the vast majority of edges are formed within short time periods, thus the effect of omission of early publication has a negligible effect for our question. Of course, different questions might be crucially impacted by the early data; thus, a careful choice of the data source is crucial 61 .

Positive examples in the test dataset

Table 1 shows the number of positive cases within the 10 million examples in the 18 test datasets that are used for evaluation.

Publication rates in quantum physics

Another field of research that gained a lot of attention in the recent years is quantum physics. This field is also a strong adopter of arXiv. Thus, we analyse in the same way as for AI in Fig. 1 . We find in Extended Data Fig. 3 no obvious exponential increase in papers per month. A detailed analysis of other domains is beyond the current scope. It will be interesting to investigate the growth rates in different scientific disciplines in more detail, especially given that exponential increase has been observed in several aspects of the science of science 3 , 76 .

Details on models M1–M8

What follows are more detailed explanations of the models presented in the main text. All codes are available at GitHub. The feature importance of the best model M1 is shown here, those of other models are analysed in the respective workshop contributions (cited in the subsections).

Details on M1

The best-performing solution is based on a blend of a tree-based gradient boosting approach and a graph neural network approach 37 . Extensive feature engineering was conducted to capture the centralities of the nodes, the proximity between node pairs and their evolution over time. The centrality of a node is captured by the number of neighbours and the PageRank score 77 , while the proximity between a node pair is derived using the Jaccard index. We refer the reader to ref. 37 for the list of all features and their feature importance.

The tree-based gradient boosting approach uses LightGBM 38 and applies heavy regularization to combat overfitting due to the scarcity of positive samples. The graph neural network approach employs a time-aware graph neural network to learn node representations on dynamic semantic networks. The feature importance of model M1, averaged over 18 datasets, is shown in Table 2 . It shows that the temporal features do contribute largely to the model performance, but the model remains strong even when they are removed. An example of the evolution of the training (from 2016 to 2019) and test set (2019 to 2021) for δ = 3, c = 25, ω = 1 is shown in Extended Data Fig. 4 .

Details on M2

The second method assumes that the probability that nodes u and v form an edge in the future is a function of the node features f ( u ), f ( v ) and some edge feature h ( u , v ). We chose node features f that capture popularity at the current time t 0 (such as degree, clustering coefficient 78 , 79 and PageRank 77 ). We also use these features’ first and second time derivatives to capture the evolution of the node’s popularity over time. After variable selection during training, we chose h to consist of the HOP-rec score (high-order proximity for implicit recommendation) 80 , 81 and a variation of the Dice similarity score 82 as a measure of similarity between nodes. In summary, we use 31 node features for each node, and two edge features, which gives 31 × 2 + 2 = 64 features in total. These features are then fed into a small multilayer perceptron (5 layers, each with 13 neurons) with ReLU activation.

Cold start is the problem that some nodes in the test set do not appear in the training set. Our strategy for a cold start is imputation. We say a node v is seen if it appeared in the training data, and unseen otherwise; similarly, we say that a node is born at time t if t is the first time stamp where an edge linking this node has appeared. The idea is that an unseen node is simply a node born in the future, so its features should look like a recently born node in the training set. If a node is unseen, then we impute its features as the average of the features of the nodes born recently. We found that with imputation during training, the test AUC scores across all models consistently increased by about 0.02. For a complete description of this method, we refer the reader to ref. 39 .

Details on M3

This approach, detailed in ref. 40 , uses hand-crafted node features that have been captured in multiple time snapshots (for example, every year) and then uses an LSTM to benefit from learning the time dependencies of these features. The final configuration uses two main types of feature: node features including degree and degree of neighbours, and edge features including common neighbours. In addition, to balance the training data, the same number of positive and negative instances have been randomly sampled and combined.

One of the goals was to identify features that are very informative with a very low computational cost. We found that the degree centrality of the nodes is the most important feature, and the degree centrality of the neighbouring nodes and the degree of mutual neighbours gave us the best trade-off. As all of the extracted features’ distributions are highly skewed to the right, meaning most of the features take near zero values, using a power transform such as Yeo–Johnson 83 helps to make the distributions more Gaussian, which boosts the learning. Finally, for the link-prediction task, we saw that LSTMs perform better than fully connected neural networks.

Details on M4

The following two methods are based on a purely statistical analysis of the test data and are explained in detail in ref. 42 .

Preferential attachment. In the network analysis, we concluded that the growth of this dataset tends to maintain a heavy-tailed degree distribution, often associated with scale-free networks. As mentioned before the γ value of the degree distribution is very close to 2, suggesting that preferential attachment 41 is probably the main organizational principle of the network. As such, we implemented a simple prediction model following this procedure. Preferential attachment scores in link prediction are often quantified as

with k i , j the degree of nodes i and j . However, this assumes the scoring of links between nodes that are already connected to the network, that is k i , j > 0, which is not the case for all the links we must score in the dataset. As a result, we define our preferential attachment model as

Using this simple model with no free parameters we could score new links and compare them with the other models. Immediately we note that preferential attachment outperforms some learning-based models, even if it never manages to reach the top AUC, but it is extremely simple and with negligible computational cost.

Common neighbours. We explore another network-based approach to score the links. Indeed, while the preferential attachment model we derived performed well, it uses no information about the distance between i and j , which is a popular feature used in link-prediction methods 27 . As such, we decided to test a method known as common neighbours 18 . We define Γ ( i ) as the set of neighbors of node i and Γ ( i ) ∩ Γ ( j ) as the set of common neighbours between nodes i and j . We can easily score the nodes with

the intuition being that nodes that share a larger number of neighbours are more likely to be connected than distant nodes that do not share any.

Evaluating this score for each pair ( i , j ) on the dataset of unconnected pairs, which can be computed as the second power of the adjacency matrix, A 2 , we obtained an AUC that is sometimes higher than preferential attachment and sometimes lower than it but is still consistently quite close with the best learning-based models.

Details on M5

This method is based on ref. 44 . First, ten groups of first-order graph features are extracted to get some neighbourhood and similarity properties from each pair of nodes: degree centrality of nodes, pair’s total number of neighbours, common neighbours index, Jaccard coefficient, Simpson coefficient, geometric coefficient, cosine coefficient, Adamic–Adar index, resource allocation index and preferential attachment index. They are obtained for three consecutive years to capture the temporal dynamics of the semantic network, leading to a total of 33 features. Second, principal component analysis 43 is applied to reduce the correlation between features, speed up the learning process and improve generalization, which results in a final set of seven latent variables. Lastly, a random forest classifier is trained (using a balanced dataset) to estimate the likelihood of new links between the AI concepts.

In this paper, a modification was performed in relation to the original formulation of the method 44 : two of the original features, average neighbour degree and clustering coefficient, were infeasible to extract for some of the tasks covered in this paper, as their computation can be heavy for such a very large network, and they were discarded. Due to some computational memory issues, it was not possible to run the model for some of the tasks covered in this study, and so those results are missing.

Details on M6

The baseline solution for the Science4Cast competition was closely related to the model presented in ref. 17 . It uses 15 hand-crafted features of a pair of nodes v 1 and v 2 . (Degrees of v 1 and v 2 in the current year and previous two years are six properties. The number of shared neighbours in total of v 1 and v 2 in the current year and previous two years are six properties. The number of shared neighbours between v 1 and v 2 in the current year and the previous two years are three properties). These 15 features are the input of a neural network with four layers (15, 100, 10 and 1 neurons), intending to predict whether the nodes v 1 and v 2 will have w edges in the future. After the training, the model computes the probability for all 10 million evaluation examples. This list is sorted and the AUC is computed.

Details on M7

The solution M7 was not part of the Science4Cast competition and therefore not described in the corresponding proceedings, thus we want to add more details.

The most immediate way one can apply ML to this problem is by automating the detection of features. Quite simply, the baseline solution M6 is modified such that instead of 15 hand-crafted features, the neural network is instead trained on features extracted from a graph embedding. We use two different embedding approaches. The first method is employs node2vec (M7A) 45 , for which we use the implementations provided in the nodevectors Python package 84 . The second one uses the ProNE embedding (M7B) 46 , which is based on sparse matrix factorizations modulated by the higher-order Cheeger inequality 85 .

The embeddings generate a 32-dimensional representation for each node, resulting in edge representations in [0, 1] 64 . These features are input into a neural network with two hidden layers of size 1,000 and 30. Like M6, the model computes the probability for evaluation examples to determine the ROC. We compare ProNE to node2vec, a common graph embedding method using a biased random walk procedure with return and in–out parameters, which greatly affect network encoding. Initial experiments used default values for a 64-dimensional encoding before inputting into the neural network. The higher variance in node2vec predictions is probably due to its sensitivity to hyperparameters. While ProNE is better suited for general multi-dataset link prediction, node2vec’s sensitivity may help identify crucial network features for predicting temporal evolution.

Details on M8

This model, which is detailed in ref. 47 , does not use any hand-crafted features but learns them in a completely unsupervised manner. To do so, we extract various snapshots of the adjacency matrix through time, capturing graphs in the form of A t for t = 1994, …, 2019. We then embed each of these graphs into 128-dimensional Euclidean space via node2vec 45 , 48 . For each node u in the semantic graph, we extract different 128-dimensional vector embeddings n u ( A 1994 ), …, n u ( A 2019 ).

Transformers have performed extremely well in NLP tasks 49 ; thus, we apply them to learn the dynamics of the embedding vectors. We pre-train a transformer to help classify node pairs. For the transformer, the encoder and decoder had 6 layers each; we used 128 as the embedding dimension, 2,048 as the feed-forward dimension and 8-headed attention. This transformer acts as our feature extractor. Once we pre-train our transformer, we add a two-layer ReLU network with hidden dimension 128 as a classifier on top.

Data availability

All 18 datasets tested in this paper are available via Zenodo at https://doi.org/10.5281/zenodo.7882892 ref. 86 .

Code availability

All of the models and codes described above can be found via GitHub at https://github.com/artificial-scientist-lab/FutureOfAIviaAI ref. 5 and a permanent Zenodo record at https://zenodo.org/record/8329701 ref. 87 .

Clauset, A., Larremore, D. B. & Sinatra, R. Data-driven predictions in the science of science. Science 355 , 477–480 (2017).

Article Google Scholar

Evans, J. A. & Foster, J. G. Metaknowledge. Science 331 , 721–725 (2011).

Article MathSciNet MATH Google Scholar

Fortunato, S. et al. Science of science. Science 359 , eaao0185 (2018).

Wang, D. & Barabási, A.-L. The Science of Science (Cambridge Univ. Press, 2021).

Krenn, M. et al. FutureOfAIviaAI. GitHub https://github.com/artificial-scientist-lab/FutureOfAIviaAI (2023).

Brown, T. et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33 , 1877–1901 (2020).

Google Scholar

Rae, J. W. et al. Scaling language models: methods, analysis & insights from training gopher. Preprint at https://arxiv.org/abs/2112.11446 (2021).

Smith, S. et al. Using DeepSpeed and Megatron to train Megatron-Turing NLG 530B, a large-scale generative language model. Preprint at https://arxiv.org/abs/2201.11990 (2022).

Chowdhery, A. et al. Palm: scaling language modeling with pathways. Preprint at https://arxiv.org/abs/2204.02311 (2022).

Kojima, T., Gu, S. S., Reid, M., Matsuo, Y. & Iwasawa, Y. Large language models are zero-shot reasoners. Preprint at https://arxiv.org/abs/2205.11916 (2022).

Zhang, H., Li, L. H., Meng, T., Chang, K.-W. & Broeck, G. V. d. On the paradox of learning to reason from data. Preprint at https://arxiv.org/abs/2205.11502 (2022).

Rzhetsky, A., Foster, J. G., Foster, I. T. & Evans, J. A. Choosing experiments to accelerate collective discovery. Proc. Natl Acad. Sci. USA 112 , 14569–14574 (2015).

Foster, J. G., Rzhetsky, A. & Evans, J. A. Tradition and innovation in scientists’ research strategies. Am. Sociol. Rev. 80 , 875–908 (2015).

Van Eck, N. J. & Waltman, L. Text mining and visualization using vosviewer. Preprint at https://arxiv.org/abs/1109.2058 (2011).

Van Eck, N. J. & Waltman, L. in Measuring Scholarly Impact: Methods and Practice (eds Ding, Y. et al.) 285–320 (Springer, 2014).

Wang, Q. et al. Paperrobot: Incremental draft generation of scientific ideas. Preprint at https://arxiv.org/abs/1905.07870 (2019).

Krenn, M. & Zeilinger, A. Predicting research trends with semantic and neural networks with an application in quantum physics. Proc. Natl Acad. Sci. USA 117 , 1910–1916 (2020).

Liben-Nowell, D. & Kleinberg, J. The link-prediction problem for social networks. J. Am. Soc. Inf. Sci. Technol. 58 , 1019–1031 (2007).

Albert, I. & Albert, R. Conserved network motifs allow protein–protein interaction prediction. Bioinformatics 20 , 3346–3352 (2004).

Zhou, T., Lü, L. & Zhang, Y.-C. Predicting missing links via local information. Eur. Phys. J. B 71 , 623–630 (2009).

Article MATH Google Scholar

Kovács, I. A. et al. Network-based prediction of protein interactions. Nat. Commun. 10 , 1240 (2019).

Muscoloni, A., Abdelhamid, I. & Cannistraci, C. V. Local-community network automata modelling based on length-three-paths for prediction of complex network structures in protein interactomes, food webs and more. Preprint at bioRxiv https://doi.org/10.1101/346916 (2018).

Pech, R., Hao, D., Lee, Y.-L., Yuan, Y. & Zhou, T. Link prediction via linear optimization. Physica A 528 , 121319 (2019).

Lü, L., Pan, L., Zhou, T., Zhang, Y.-C. & Stanley, H. E. Toward link predictability of complex networks. Proc. Natl Acad. Sci. USA 112 , 2325–2330 (2015).

Guimerà, R. & Sales-Pardo, M. Missing and spurious interactions and the reconstruction of complex networks. Proc. Natl Acad. Sci. USA 106 , 22073–22078 (2009).

Ghasemian, A., Hosseinmardi, H., Galstyan, A., Airoldi, E. M. & Clauset, A. Stacking models for nearly optimal link prediction in complex networks. Proc. Natl Acad. Sci. USA 117 , 23393–23400 (2020).

Zhou, T. Progresses and challenges in link prediction. iScience 24 , 103217 (2021).

Krenn, M. et al. On scientific understanding with artificial intelligence. Nat. Rev. Phys. 4 , 761–769 (2022).

Rose, S., Engel, D., Cramer, N. & Cowley, W. in Text Mining: Applications and Theory (eds Berry, M. W. & Kogan, J.) Ch. 1 (Wiley, 2010).

Salatino, A. A., Thanapalasingam, T., Mannocci, A., Osborne, F. & Motta, E. The computer science ontology: a large-scale taxonomy of research areas. In Proc. Semantic Web–ISWC 2018: 17th International Semantic Web Conference Part II Vol. 17, 187–205 (Springer, 2018).

Salatino, A. A., Osborne, F., Thanapalasingam, T. & Motta, E. The CSO classifier: ontology-driven detection of research topics in scholarly articles. In Proc. Digital Libraries for Open Knowledge: 23rd International Conference on Theory and Practice of Digital Libraries Vol. 23, 296–311 (Springer, 2019).

Alstott, J., Bullmore, E. & Plenz, D. powerlaw: a Python package for analysis of heavy-tailed distributions. PLoS ONE 9 , e85777 (2014).

Fenner, T., Levene, M. & Loizou, G. A model for collaboration networks giving rise to a power-law distribution with an exponential cutoff. Soc. Netw. 29 , 70–80 (2007).

Broido, A. D. & Clauset, A. Scale-free networks are rare. Nat. Commun. 10 , 1017 (2019).

Fawcett, T. ROC graphs: notes and practical considerations for researchers. Pattern Recognit. Lett. 31 , 1–38 (2004).

Sun, Y., Wong, A. K. & Kamel, M. S. Classification of imbalanced data: a review. Int. J. Pattern Recognit. Artif. Intell. 23 , 687–719 (2009).

Lu, Y. Predicting research trends in artificial intelligence with gradient boosting decision trees and time-aware graph neural networks. In 2021 IEEE International Conference on Big Data (Big Data) 5809–5814 (IEEE, 2021).

Ke, G. et al. LightGBM: a highly efficient gradient boosting decision tree. In Proc. 31st International Conference on Neural Information Processing Systems 3149–3157 (Curran Associates Inc., 2017).

Tran, N. M. & Xie, Y. Improving random walk rankings with feature selection and imputation Science4Cast competition, team Hash Brown. In 2021 IEEE International Conference on Big Data (Big Data) 5824–5827 (IEEE, 2021).

Sanjabi, N. Efficiently predicting scientific trends using node centrality measures of a science semantic network. In 2021 IEEE International Conference on Big Data (Big Data) 5820–5823 (IEEE, 2021).

Barabási, A.-L. Network science. Phil. Trans. R. Soci. A 371 , 20120375 (2013).

Moutinho, J. P., Coutinho, B. & Buffoni, L. Network-based link prediction of scientific concepts—a Science4Cast competition entry. In 2021 IEEE International Conference on Big Data (Big Data) 5815–5819 (IEEE, 2021).

Jolliffe, I. T. & Cadima, J. Principal component analysis: a review and recent developments. Phil. Trans. R. Soc. A 374 , 20150202 (2016).

Valente, F. Link prediction of artificial intelligence concepts using low computational power. In 2021 IEEE International Conference on Big Data (Big Data) 5828–5832 (2021).

Grover, A. & Leskovec, J. node2vec: scalable feature learning for networks. In Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 855–864 (ACM, 2016).

Zhang, J., Dong, Y., Wang, Y., Tang, J. & Ding, M. ProNE: fast and scalable network representation learning. In Proc. Twenty-Eighth International Joint Conference on Artificial Intelligence 4278–4284 (International Joint Conferences on Artificial Intelligence Organization, 2019).

Lee, H., Sonthalia, R. & Foster, J. G. Dynamic embedding-based methods for link prediction in machine learning semantic network. In 2021 IEEE International Conference on Big Data (Big Data) 5801–5808 (IEEE, 2021).

Liu, R. & Krishnan, A. PecanPy: a fast, efficient and parallelized python implementation of node2vec. Bioinformatics 37 , 3377–3379 (2021).

Vaswani, A. et al. Attention is all you need. In Proc. 31st International Conference on Neural Information Processing Systems 6000–6010 (Curran Associates Inc., 2017).

Zelenko, D., Aone, C. & Richardella, A. Kernel methods for relation extraction. J. Mach. Learn. Res. 3 , 1083–1106 (2003).

MathSciNet MATH Google Scholar

Bach, N. & Badaskar, S. A review of relation extraction. Literature Review for Language and Statistics II 2 , 1–15 (2007).

Salatino, A. A., Osborne, F. & Motta, E. How are topics born? Understanding the research dynamics preceding the emergence of new areas. PeerJ Comput. Sc. 3 , e119 (2017).

Salatino, A. A., Osborne, F. & Motta, E. AUGUR: forecasting the emergence of new research topics. In Proc. 18th ACM/IEEE on Joint Conference on Digital Libraries 303–312 (IEEE, 2018).

Battiston, F. et al. The physics of higher-order interactions in complex systems. Nat. Phys. 17 , 1093–1098 (2021).

Coutinho, B. C., Wu, A.-K., Zhou, H.-J. & Liu, Y.-Y. Covering problems and core percolations on hypergraphs. Phys. Rev. Lett. 124 , 248301 (2020).

Article MathSciNet Google Scholar

Olivetti, E. A. et al. Data-driven materials research enabled by natural language processing and information extraction. Appl. Phys. Rev. 7 , 041317 (2020).

Lin, Z., Yin, Y., Liu, L. & Wang, D. SciSciNet: a large-scale open data lake for the science of science research. Sci. Data 10 , 315 (2023).

Azoulay, P. et al. Toward a more scientific science. Science 361 , 1194–1197 (2018).

Liu, H., Kou, H., Yan, C. & Qi, L. Link prediction in paper citation network to construct paper correlation graph. EURASIP J. Wirel. Commun. Netw. 2019 , 1–12 (2019).

Reisz, N. et al. Loss of sustainability in scientific work. New J. Phys. 24 , 053041 (2022).

Frank, M. R., Wang, D., Cebrian, M. & Rahwan, I. The evolution of citation graphs in artificial intelligence research. Nat. Mach. Intell. 1 , 79–85 (2019).

Newman, M. Networks (Oxford Univ. Press, 2018).

Kwon, D. et al. A survey of deep learning-based network anomaly detection. Cluster Comput. 22 , 949–961 (2019).

Pang, G., Shen, C., Cao, L. & Hengel, A. V. D. Deep learning for anomaly detection: a review. ACM Comput. Surv. 54 , 1–38 (2021).

Collobert, R. et al. Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12 , 2493–2537 (2011).

MATH Google Scholar

LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521 , 436–444 (2015).

Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Commun. ACM 60 , 84–90 (2017).

Mnih, V. et al. Human-level control through deep reinforcement learning. Nature 518 , 529–533 (2015).

Silver, D. et al. Mastering the game of Go with deep neural networks and tree search. Nature 529 , 484–489 (2016).

Wu, N., Vincent, A., Strukov, D. & Xie, Y. Memristor hardware-friendly reinforcement learning. Preprint at https://arxiv.org/abs/2001.06930 (2020).

Zhou, C. et al. Automated deep learning analysis of angiography video sequences for coronary artery disease. Preprint at https://arxiv.org/abs/2101.12505 (2021).

Huckle, N., Garcia, N. & Nakashima, Y. Demographic influences on contemporary art with unsupervised style embeddings. In Proc. Computer Vision–ECCV 2020 Workshops Part II Vol. 16, 126–142 (Springer, 2020).

Ranti, D. et al. The utility of general domain transfer learning for medical language tasks. Preprint at https://arxiv.org/abs/2002.06670 (2020).

Kamath, P., Singh, A. & Dutta, D. Fast neural architecture construction using envelopenets. Preprint at https://arxiv.org/abs/1803.06744 (2018).

Minsky, M. Steps toward artificial intelligence. Proc. IRE 49 , 8–30 (1961).

Bornmann, L., Haunschild, R. & Mutz, R. Growth rates of modern science: a latent piecewise growth curve approach to model publication numbers from established and new literature databases. Humanit. Soc. Sci. Commun. 8 , 224 (2021).

Brin, S. & Page, L. The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30 , 107–117 (1998).

Holland, P. W. & Leinhardt, S. Transitivity in structural models of small groups. Comp. Group Studies 2 , 107–124 (1971).

Watts, D. J. & Strogatz, S. H. Collective dynamics of ‘small-world’ networks. Nature 393 , 440–442 (1998).

Yang, J.-H., Chen, C.-M., Wang, C.-J. & Tsai, M.-F. HOP-rec: high-order proximity for implicit recommendation. In Proc. 12th ACM Conference on Recommender Systems 140–144 (2018).

Lin, B.-Y. OGB_collab_project. GitHub https://github.com/brucenccu/OGB_collab_project (2021).

Sorensen, T. A. A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on danish commons. Biol. Skar. 5 , 1–34 (1948).

Yeo, I.-K. & Johnson, R. A. A new family of power transformations to improve normality or symmetry. Biometrika 87 , 954–959 (2000).

Ranger, M. nodevectors. GitHub https://github.com/VHRanger/nodevectors (2021).

Bandeira, A. S., Singer, A. & Spielman, D. A. A Cheeger inequality for the graph connection Laplacian. SIAM J. Matrix Anal. Appl. 34 , 1611–1630 (2013).

Krenn, M. et al. Predicting the future of AI with AI. Zenodo https://doi.org/10.5281/zenodo.7882892 (2023).

Krenn, M. et al. FutureOfAIviaAI code. Zenodo https://zenodo.org/record/8329701 (2023).

Jia, T., Wang, D. & Szymanski, B. K. Quantifying patterns of research-interest evolution. Nat. Hum. Behav. 1 , 0078 (2017).

Download references

Acknowledgements

We thank IARAI Vienna and IEEE for supporting and hosting the IEEE BigData Competition Science4Cast. We are specifically grateful to D. Kreil, M. Neun, C. Eichenberger, M. Spanring, H. Martin, D. Geschke, D. Springer, P. Herruzo, M. McCutchan, A. Mihai, T. Furdui, G. Fratica, M. Vázquez, A. Gruca, J. Brandstetter and S. Hochreiter for helping to set up and successfully execute the competition and the corresponding workshop. We thank X. Gu for creating Fig. 2 , and M. Aghajohari and M. Sadegh Akhondzadeh for helpful comments on the paper. The work of H.L., R.S. and J.G.F. was supported by grant TWCF0333 from the Templeton World Charity Foundation. H.L. is additionally supported by NSF grant DMS-1952339. J.P.M. acknowledges the support of FCT (Portugal) through scholarship SFRH/BD/144151/2019. B.C. thanks the support from FCT/MCTES through national funds and when applicable co-funded EU funds under the project UIDB/50008/2020, and FCT through the project CEECINST/00117/2018/CP1495/CT0001. N.M.T. and Y.X. are supported by NSF grant DMS-2113468, the NSF IFML 2019844 award to the University of Texas at Austin, and the Good Systems Research Initiative, part of University of Texas at Austin Bridging Barriers.

Open access funding provided by Max Planck Society.

Author information

Authors and affiliations.

Max Planck Institute for the Science of Light (MPL), Erlangen, Germany

Mario Krenn

Instituto de Telecomunicações, Lisbon, Portugal

Lorenzo Buffoni, Bruno Coutinho & João P. Moutinho

University of Toronto, Toronto, Ontario, Canada

Sagi Eppel & Andrew Gritsevskiy

University of California Los Angeles, Los Angeles, CA, USA

Jacob Gates Foster, Harlin Lee & Rishi Sonthalia

Cavendish Laboratories, Cavendish, VT, USA

Andrew Gritsevskiy

Institute of Advanced Research in Artificial Intelligence (IARAI), Vienna, Austria

Andrew Gritsevskiy & Michael Kopp

Alpha 8 AI, Toronto, Ontario, Canada

Independent Researcher, Barcelona, Spain

Nima Sanjabi

University of Texas at Austin, Austin, TX, USA

Ngoc Mai Tran

Independent Researcher, Leiria, Portugal

Francisco Valente

University of Pennsylvania, Philadelphia, PA, USA

Yangxinyu Xie

University of California, San Diego, CA, USA

You can also search for this author in PubMed Google Scholar

Contributions

M. Krenn and R.Y. initiated the research. M. Krenn and M. Kopp organized the Science4Cast competition. M. Krenn generated the datasets and initial codes. S.E. and H.L. analysed the network-theoretical properties of the semantic network. M. Krenn, L.B., B.C., J.G.F., A.G, H.L., Y.L, J.P.M, N.S., R.S., N.M.T, F.V., Y.X and M. Kopp provided codes for the ten models. M. Krenn wrote the paper with input from all co-authors.

Corresponding author

Correspondence to Mario Krenn .

Ethics declarations

Competing interests.

The authors declare no competing interests.

Peer review

Peer review information.

Nature Machine Intelligence thanks Alexander Belikov, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Mirko Pieropan, in collaboration with the Nature Machine Intelligence team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended data fig. 1.

Number of concepts per article.

Extended Data Fig. 2

Time Gap between the generation of edges. Here, left shows the time it takes to create a new edge between two vertices and right shows the time between the first and the second edge.

Extended Data Fig. 3

Publications in Quantum Physics.

Extended Data Fig. 4

Evolution of the AUC during training for Model M1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Krenn, M., Buffoni, L., Coutinho, B. et al. Forecasting the future of artificial intelligence with machine learning-based link prediction in an exponentially growing knowledge network. Nat Mach Intell 5 , 1326–1335 (2023). https://doi.org/10.1038/s42256-023-00735-0

Download citation

Received : 21 January 2023

Accepted : 11 September 2023

Published : 16 October 2023

Issue Date : November 2023

DOI : https://doi.org/10.1038/s42256-023-00735-0

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

This article is cited by

Link prediction for hypothesis generation: an active curriculum learning infused temporal graph-based approach.

Uchenna Akujuobi
Priyadarshini Kumari
Tarek R. Besold

Artificial Intelligence Review (2024)

A commentary on transformative consumer research: Musings on its genesis, evolution, and opportunity for scientific specialization

Martin Mende
David Glen Mick

AMS Review (2024)

Quick links

Explore articles by subject
Guide to authors
Editorial policies

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Subscribe or Renew

Create an E-mail Alert for This Article

Artificial intelligence and machine learning in clinical medicine, 2023, permissions, information & authors, metrics & citations, continue reading this article.

Select an option below:

Create your account to get 2 free subscriber-only articles each month.

Already have an account, print subscriber, supplementary material, information, published in, translation.

Drugs, Devices, and the FDA
Medical Practice, Training, and Education General
Reform Implementation

Export citation

Select the format you want to export the citation of this publication.

Badi Rawashdeh,
Cristian Lindner,
Raúl Riquelme,
Rodrigo San Martín,
Frank Quezada,
Jorge Valenzuela,
Juan P Maureira,
Martín Einersen,
YooKyung Lee,
So Yun Kim,
Adil Salihu,
David Meier,
Nathalie Noirclerc,
Ioannis Skalidis,
Sarah Mauler-Wittwer,
Frederique Recordon,
Matthias Kirsch,
Christan Roguelov,
Alexandre Berger,
Xiaowu Sun,
Emmanuel Abbe,
Carlo Marcucci,
Valentina Rancati,
Lorenzo Rosner,
Emanuelle Scala,
David C. Rotzinger,
Marc Humbert,
Olivier Muller,
Stephane Fournier,
Yueyuan Xu,
Zehua Jiang,
Daniel Shu Wei Ting,
Alfred Wei Chieh Kow,
Fernando Bello,
Yih-Chung Tham,
Tien Yin Wong,
Om J Lakhani,
Rano Mal Piryani,
Suneel Piryani,
Rajesh Piryani,
Catherine Hayes,
Xiaoyu Jiang,
Ruichun Gu,
Govada Anuradha,
Harini Davu,
Muthyalanaidu Karri,

View Options

View options, content link.

Copying failed.

Next article, create your account for 2 free subscriber-only articles each month. get free access now..

The Journal of Artificial Intelligence Research (JAIR) is dedicated to the rapid dissemination of important research results to the global artificial intelligence (AI) community. The journal’s scope encompasses all areas of AI, including agents and multi-agent systems, automated reasoning, constraint processing and search, knowledge representation, machine learning, natural language, planning and scheduling, robotics and vision, and uncertainty in AI.

Current Issue

Vol. 81 (2024)

Published: 2024-09-11

The Effect of Preferences in Abstract Argumentation under a Claim-Centric View

Digraph k-coloring games: new algorithms and experiments, opening the analogical portal to explainability: can analogies help laypeople in ai-assisted decision making, separating and collapsing electoral control types, the state of computer vision research in africa, understanding what affects the generalization gap in visual reinforcement learning: theory and empirical evidence.

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Publications
Account settings

The PMC website is updating on October 15, 2024. Learn More or Try it out now .

Advanced Search
Journal List
Future Healthc J
v.8(2); 2021 Jul

Artificial intelligence in healthcare: transforming the practice of medicine

Junaid bajwa.

A Microsoft Research, Cambridge, UK

Usman Munir

B Microsoft Research, Cambridge, UK

Aditya Nori

C Microsoft Research, Cambridge, UK

Bryan Williams

D University College London, London, UK and director, NIHR UCLH Biomedical Research Centre, London, UK

Artificial intelligence (AI) is a powerful and disruptive area of computer science, with the potential to fundamentally transform the practice of medicine and the delivery of healthcare. In this review article, we outline recent breakthroughs in the application of AI in healthcare, describe a roadmap to building effective, reliable and safe AI systems, and discuss the possible future direction of AI augmented healthcare systems.

Introduction

Healthcare systems around the world face significant challenges in achieving the ‘quadruple aim’ for healthcare: improve population health, improve the patient's experience of care, enhance caregiver experience and reduce the rising cost of care. 1–3 Ageing populations, growing burden of chronic diseases and rising costs of healthcare globally are challenging governments, payers, regulators and providers to innovate and transform models of healthcare delivery. Moreover, against a backdrop now catalysed by the global pandemic, healthcare systems find themselves challenged to ‘perform’ (deliver effective, high-quality care) and ‘transform’ care at scale by leveraging real-world data driven insights directly into patient care. The pandemic has also highlighted the shortages in healthcare workforce and inequities in the access to care, previously articulated by The King's Fund and the World Health Organization (Box (Box1 1 ). 4,5

Workforce challenges in the next decade

By 2030, the gap between supply of and demand for staff employed by NHS trusts could increase to almost 250,000 full-time equivalent posts.

Based on the current trends and needs of the global population by 2030, the world will have 18 million fewer healthcare professionals (especially marked differences in the developing world), including 5 million fewer doctors than society will require.

The application of technology and artificial intelligence (AI) in healthcare has the potential to address some of these supply-and-demand challenges. The increasing availability of multi-modal data (genomics, economic, demographic, clinical and phenotypic) coupled with technology innovations in mobile, internet of things (IoT), computing power and data security herald a moment of convergence between healthcare and technology to fundamentally transform models of healthcare delivery through AI-augmented healthcare systems.

In particular, cloud computing is enabling the transition of effective and safe AI systems into mainstream healthcare delivery. Cloud computing is providing the computing capacity for the analysis of considerably large amounts of data, at higher speeds and lower costs compared with historic ‘on premises’ infrastructure of healthcare organisations. Indeed, we observe that many technology providers are increasingly seeking to partner with healthcare organisations to drive AI-driven medical innovation enabled by cloud computing and technology-related transformation (Box (Box2 2 ). 6–8

Quotes from technology leaders

Satya Nadella, chief executive officer, Microsoft: ‘AI is perhaps the most transformational technology of our time, and healthcare is perhaps AI's most pressing application.’

Tim Cook, chief executive officer, Apple: ‘[Healthcare] is a business opportunity ... if you look at it, medical health activity is the largest or second-largest component of the economy.’

Google Health: ‘We think that AI is poised to transform medicine, delivering new, assistive technologies that will empower doctors to better serve their patients. Machine learning has dozens of possible application areas, but healthcare stands out as a remarkable opportunity to benefit people.’

Here, we summarise recent breakthroughs in the application of AI in healthcare, describe a roadmap to building effective AI systems and discuss the possible future direction of AI augmented healthcare systems.

What is artificial intelligence?

Simply put, AI refers to the science and engineering of making intelligent machines, through algorithms or a set of rules, which the machine follows to mimic human cognitive functions, such as learning and problem solving. 9 AI systems have the potential to anticipate problems or deal with issues as they come up and, as such, operate in an intentional, intelligent and adaptive manner. 10 AI's strength is in its ability to learn and recognise patterns and relationships from large multidimensional and multimodal datasets; for example, AI systems could translate a patient's entire medical record into a single number that represents a likely diagnosis. 11,12 Moreover, AI systems are dynamic and autonomous, learning and adapting as more data become available. 13

AI is not one ubiquitous, universal technology, rather, it represents several subfields (such as machine learning and deep learning) that, individually or in combination, add intelligence to applications. Machine learning (ML) refers to the study of algorithms that allow computer programs to automatically improve through experience. 14 ML itself may be categorised as ‘supervised’, ‘unsupervised’ and ‘reinforcement learning’ (RL), and there is ongoing research in various sub-fields including ‘semi-supervised’, ‘self-supervised’ and ‘multi-instance’ ML.

Supervised learning leverages labelled data (annotated information); for example, using labelled X-ray images of known tumours to detect tumours in new images. 15
‘Unsupervised learning’ attempts to extract information from data without labels; for example, categorising groups of patients with similar symptoms to identify a common cause. 16
In RL, computational agents learn by trial and error, or by expert demonstration. The algorithm learns by developing a strategy to maximise rewards. Of note, major breakthroughs in AI in recent years have been based on RL.
Deep learning (DL) is a class of algorithms that learns by using a large, many-layered collection of connected processes and exposing these processors to a vast set of examples. DL has emerged as the predominant method in AI today driving improvements in areas such as image and speech recognition. 17,18

How to build effective and trusted AI-augmented healthcare systems?

Despite more than a decade of significant focus, the use and adoption of AI in clinical practice remains limited, with many AI products for healthcare still at the design and develop stage. 19–22 While there are different ways to build AI systems for healthcare, far too often there are attempts to force square pegs into round holes ie find healthcare problems to apply AI solutions to without due consideration to local context (such as clinical workflows, user needs, trust, safety and ethical implications).

We hold the view that AI amplifies and augments, rather than replaces, human intelligence. Hence, when building AI systems in healthcare, it is key to not replace the important elements of the human interaction in medicine but to focus it, and improve the efficiency and effectiveness of that interaction. Moreover, AI innovations in healthcare will come through an in-depth, human-centred understanding of the complexity of patient journeys and care pathways.

In Fig Fig1, 1 , we describe a problem-driven, human-centred approach, adapted from frameworks by Wiens et al , Care and Sendak to building effective and reliable AI-augmented healthcare systems. 23–25

An external file that holds a picture, illustration, etc.
Object name is futurehealth-8-2-e188fig1.jpg

Multi-step, iterative approach to build effective and reliable AI-augmented systems in healthcare.

Design and develop

The first stage is to design and develop AI solutions for the right problems using a human-centred AI and experimentation approach and engaging appropriate stakeholders, especially the healthcare users themselves.

Stakeholder engagement and co-creation

Build a multidisciplinary team including computer and social scientists, operational and research leadership, and clinical stakeholders (physician, caregivers and patients) and subject experts (eg for biomedical scientists) that would include authorisers, motivators, financiers, conveners, connectors, implementers and champions. 26 A multi-stakeholder team brings the technical, strategic, operational expertise to define problems, goals, success metrics and intermediate milestones.

Human-centred AI

A human-centred AI approach combines an ethnographic understanding of health systems, with AI. Through user-designed research, first understand the key problems (we suggest using a qualitative study design to understand ‘what is the problem’, ‘why is it a problem’, ‘to whom does it matter’, ‘why has it not been addressed before’ and ‘why is it not getting attention’) including the needs, constraints and workflows in healthcare organisations, and the facilitators and barriers to the integration of AI within the clinical context. After defining key problems, the next step is to identify which problems are appropriate for AI to solve, whether there is availability of applicable datasets to build and later evaluate AI. By contextualising algorithms in an existing workflow, AI systems would operate within existing norms and practices to ensure adoption, providing appropriate solutions to existing problems for the end user.

Experimentation

The focus should be on piloting of new stepwise experiments to build AI tools, using tight feedback loops from stakeholders to facilitate rapid experiential learning and incremental changes. 27 The experiments would allow the trying out of new ideas simultaneously, exploring to see which one works, learn what works and what doesn't, and why. 28 Experimentation and feedback will help to elucidate the purpose and intended uses for the AI system: the likely end users and the potential harm and ethical implications of AI system to them (for instance, data privacy, security, equity and safety).

Evaluate and validate

Next, we must iteratively evaluate and validate the predictions made by the AI tool to test how well it is functioning. This is critical, and evaluation is based on three dimensions: statistical validity, clinical utility and economic utility.

Statistical validity is understanding the performance of AI on metrics of accuracy, reliability, robustness, stability and calibration. High model performance on retrospective, in silico settings is not sufficient to demonstrate clinical utility or impact.
To determine clinical utility, evaluate the algorithm in a real-time environment on a hold-out and temporal validation set (eg longitudinal and external geographic datasets) to demonstrate clinical effectiveness and generalisability. 25
Economic utility quantifies the net benefit relative to the cost from the investment in the AI system.

Scale and diffuse

Many AI systems are initially designed to solve a problem at one healthcare system based on the patient population specific to that location and context. Scale up of AI systems requires special attention to deployment modalities, model updates, the regulatory system, variation between systems and reimbursement environment.

Monitor and maintain

Even after an AI system has been deployed clinically, it must be continually monitored and maintained to monitor for risks and adverse events using effective post-market surveillance. Healthcare organisations, regulatory bodies and AI developers should cooperate to collate and analyse the relevant datasets for AI performance, clinical and safety-related risks, and adverse events. 29

What are the current and future use cases of AI in healthcare?

AI can enable healthcare systems to achieve their ‘quadruple aim’ by democratising and standardising a future of connected and AI augmented care, precision diagnostics, precision therapeutics and, ultimately, precision medicine (Table (Table1 1 ). 30 Research in the application of AI healthcare continues to accelerate rapidly, with potential use cases being demonstrated across the healthcare sector (both physical and mental health) including drug discovery, virtual clinical consultation, disease diagnosis, prognosis, medication management and health monitoring.

Widescale adoption and application of artificial intelligence in healthcare

Connected/augmented care	Precision diagnostics	Precision therapeutics	Precision Medicine	Summary
Internet of things in healthcare Virtual assistants Augmented telehealth Personalised mental health support	Precision imaging (eg diabetic retinopathy and radiotherapy planning)	CRISPR (increasing use)	Digital and AI enabled research hospitals	AI automates time consuming, high-volume repetitive tasks, especially within precision imaging
Ambient intelligence in healthcare	Large-scale adoption and scale-up of precision imaging	Synthetic biology Immunomics	Customisation of healthcare Robotic assisted therapies	AI uses multi-modal datasets to drive precision therapeutics
Autonomous virtual health assistants, delivering predictive and anticipatory care Networked and connected care organisations (single digital infrastructure)	Holographic and hybrid imaging Holomics (integrated genomic/radiomic/proteomic/clinical/immunohistochemical data)	Genomics medicine AI driven drug discovery	New curative treatments AI empowered healthcare professionals (eg digital twins)	AI enables healthcare systems to achieve a state of precision medicine through AI-augmented healthcare and connected care

Timings are illustrative to widescale adoption of the proposed innovation taking into account challenges / regulatory environment / use at scale.

We describe a non-exhaustive suite of AI applications in healthcare in the near term, medium term and longer term, for the potential capabilities of AI to augment, automate and transform medicine.

AI today (and in the near future)

Currently, AI systems are not reasoning engines ie cannot reason the same way as human physicians, who can draw upon ‘common sense’ or ‘clinical intuition and experience’. 12 Instead, AI resembles a signal translator, translating patterns from datasets. AI systems today are beginning to be adopted by healthcare organisations to automate time consuming, high volume repetitive tasks. Moreover, there is considerable progress in demonstrating the use of AI in precision diagnostics (eg diabetic retinopathy and radiotherapy planning).

AI in the medium term (the next 5–10 years)

In the medium term, we propose that there will be significant progress in the development of powerful algorithms that are efficient (eg require less data to train), able to use unlabelled data, and can combine disparate structured and unstructured data including imaging, electronic health data, multi-omic, behavioural and pharmacological data. In addition, healthcare organisations and medical practices will evolve from being adopters of AI platforms, to becoming co-innovators with technology partners in the development of novel AI systems for precision therapeutics.

AI in the long term (>10 years)

In the long term, AI systems will become more intelligent , enabling AI healthcare systems achieve a state of precision medicine through AI-augmented healthcare and connected care. Healthcare will shift from the traditional one-size-fits-all form of medicine to a preventative, personalised, data-driven disease management model that achieves improved patient outcomes (improved patient and clinical experiences of care) in a more cost-effective delivery system.

Connected/augmented care

AI could significantly reduce inefficiency in healthcare, improve patient flow and experience, and enhance caregiver experience and patient safety through the care pathway; for example, AI could be applied to the remote monitoring of patients (eg intelligent telehealth through wearables/sensors) to identify and provide timely care of patients at risk of deterioration.

In the long term, we expect that healthcare clinics, hospitals, social care services, patients and caregivers to be all connected to a single, interoperable digital infrastructure using passive sensors in combination with ambient intelligence. 31 Following are two AI applications in connected care.

Virtual assistants and AI chatbots

AI chatbots (such as those used in Babylon ( www.babylonhealth.com ) and Ada ( https://ada.com )) are being used by patients to identify symptoms and recommend further actions in community and primary care settings. AI chatbots can be integrated with wearable devices such as smartwatches to provide insights to both patients and caregivers in improving their behaviour, sleep and general wellness.

Ambient and intelligent care

We also note the emergence of ambient sensing without the need for any peripherals.

Emerald ( www.emeraldinno.com ): a wireless, touchless sensor and machine learning platform for remote monitoring of sleep, breathing and behaviour, founded by Massachusetts Institute of Technology faculty and researchers.
Google nest: claiming to monitor sleep (including sleep disturbances like cough) using motion and sound sensors. 32
A recently published article exploring the ability to use smart speakers to contactlessly monitor heart rhythms. 33
Automation and ambient clinical intelligence: AI systems leveraging natural language processing (NLP) technology have the potential to automate administrative tasks such as documenting patient visits in electronic health records, optimising clinical workflow and enabling clinicians to focus more time on caring for patients (eg Nuance Dragon Ambient eXperience ( www.nuance.com/healthcare/ambient-clinical-intelligence.html )).

Precision diagnostics

Diagnostic imaging.

The automated classification of medical images is the leading AI application today. A recent review of AI/ML-based medical devices approved in the USA and Europe from 2015–2020 found that more than half (129 (58%) devices in the USA and 126 (53%) devices in Europe) were approved or CE marked for radiological use. 34 Studies have demonstrated AI's ability to meet or exceed the performance of human experts in image-based diagnoses from several medical specialties including pneumonia in radiology (a convolutional neural network trained with labelled frontal chest X-ray images outperformed radiologists in detecting pneumonia), dermatology (a convolutional neural network was trained with clinical images and was found to classify skin lesions accurately), pathology (one study trained AI algorithms with whole-slide pathology images to detect lymph node metastases of breast cancer and compared the results with those of pathologists) and cardiology (a deep learning algorithm diagnosed heart attack with a performance comparable with that of cardiologists). 35–38

We recognise that there are some exemplars in this area in the NHS (eg University of Leeds Virtual Pathology Project and the National Pathology Imaging Co-operative) and expect widescale adoption and scaleup of AI-based diagnostic imaging in the medium term. 39 We provide two use cases of such technologies.

Diabetic retinopathy screening

Key to reducing preventable, diabetes-related vision loss worldwide is screening individuals for detection and the prompt treatment of diabetic retinopathy. However, screening is costly given the substantial number of diabetes patients and limited manpower for eye care worldwide. 40 Research studies on automated AI algorithms for diabetic retinopathy in the USA, Singapore, Thailand and India have demonstrated robust diagnostic performance and cost effectiveness. 41–44 Moreover, Centers for Medicare & Medicaid Services approved Medicare reimbursement for the use of Food and Drug Administration approved AI algorithm ‘IDx-DR’, which demonstrated 87% sensitivity and 90% specificity for detecting more-than-mild diabetic retinopathy. 45

Improving the precision and reducing waiting timings for radiotherapy planning

An important AI application is to assist clinicians for image preparation and planning tasks for radiotherapy cancer treatment. Currently, segmentation of the images is time consuming and laborious task, performed manually by an oncologist using specially designed software to draw contours around the regions of interest. The AI-based InnerEye open-source technology can cut this preparation time for head and neck, and prostate cancer by up to 90%, meaning that waiting times for starting potentially life-saving radiotherapy treatment can be dramatically reduced (Fig (Fig2 2 ). 46,47

An external file that holds a picture, illustration, etc.
Object name is futurehealth-8-2-e188fig2.jpg

Potential applications for the InnerEye deep learning toolkit include quantitative radiology for monitoring tumour progression, planning for surgery and radiotherapy planning. 47

Precision therapeutics

To make progress towards precision therapeutics, we need to considerably improve our understanding of disease. Researchers globally are exploring the cellular and molecular basis of disease, collecting a range of multimodal datasets that can lead to digital and biological biomarkers for diagnosis, severity and progression. Two important future AI applications include immunomics / synthetic biology and drug discovery.

Immunomics and synthetic biology

Through the application of AI tools on multimodal datasets in the future, we may be able to better understand the cellular basis of disease and the clustering of diseases and patient populations to provide more targeted preventive strategies, for example, using immunomics to diagnose and better predict care and treatment options. This will be revolutionary for multiple standards of care, with particular impact in the cancer, neurological and rare disease space, personalising the experience of care for the individual.

AI-driven drug discovery

AI will drive significant improvement in clinical trial design and optimisation of drug manufacturing processes, and, in general, any combinatorial optimisation process in healthcare could be replaced by AI. We have already seen the beginnings of this with the recent announcements by DeepMind and AlphaFold, which now sets the stage for better understanding disease processes, predicting protein structures and developing more targeted therapeutics (for both rare and more common diseases; Fig Fig3 3 ). 48,49

An external file that holds a picture, illustration, etc.
Object name is futurehealth-8-2-e188fig3.jpg

An overview of the main neural network model architecture for AlphaFold. 49 MSA = multiple sequence alignment.

Precision medicine

New curative therapies.

Over the past decade, synthetic biology has produced developments like CRISPR gene editing and some personalised cancer therapies. However, the life cycle for developing such advanced therapies is still extremely inefficient and expensive.

In future, with better access to data (genomic, proteomic, glycomic, metabolomic and bioinformatic), AI will allow us to handle far more systematic complexity and, in turn, help us transform the way we understand, discover and affect biology. This will improve the efficiency of the drug discovery process by helping better predict early which agents are more likely to be effective and also better anticipate adverse drug effects, which have often thwarted the further development of otherwise effective drugs at a costly late stage in the development process. This, in turn will democratise access to novel advanced therapies at a lower cost.

AI empowered healthcare professionals

In the longer term, healthcare professionals will leverage AI in augmenting the care they provide, allowing them to provide safer, standardised and more effective care at the top of their licence; for example, clinicians could use an ‘AI digital consult’ to examine ‘digital twin’ models of their patients (a truly ‘digital and biomedical’ version of a patient), allowing them to ‘test’ the effectiveness, safety and experience of an intervention (such as a cancer drug) in the digital environment prior to delivering the intervention to the patient in the real world.

We recognise that there are significant challenges related to the wider adoption and deployment of AI into healthcare systems. These challenges include, but are not limited to, data quality and access, technical infrastructure, organisational capacity, and ethical and responsible practices in addition to aspects related to safety and regulation. Some of these issues have been covered, but others go beyond the scope of this current article.

Conclusion and key recommendations

Advances in AI have the potential to transform many aspects of healthcare, enabling a future that is more personalised, precise, predictive and portable. It is unclear if we will see an incremental adoption of new technologies or radical adoption of these technological innovations, but the impact of such technologies and the digital renaissance they bring requires health systems to consider how best they will adapt to the changing landscape. For the NHS, the application of such technologies truly has the potential to release time for care back to healthcare professionals, enabling them to focus on what matters to their patients and, in the future, leveraging a globally democratised set of data assets comprising the ‘highest levels of human knowledge’ to ‘work at the limits of science’ to deliver a common high standard of care, wherever and whenever it is delivered, and by whoever. 50 Globally, AI could become a key tool for improving health equity around the world.

As much as the last 10 years have been about the roll out of digitisation of health records for the purposes of efficiency (and in some healthcare systems, billing/reimbursement), the next 10 years will be about the insight and value society can gain from these digital assets, and how these can be translated into driving better clinical outcomes with the assistance of AI, and the subsequent creation of novel data assets and tools. It is clear that we are at an turning point as it relates to the convergence of the practice of medicine and the application of technology, and although there are multiple opportunities, there are formidable challenges that need to be overcome as it relates to the real world and the scale of implementation of such innovation. A key to delivering this vision will be an expansion of translational research in the field of healthcare applications of artificial intelligence. Alongside this, we need investment into the upskilling of a healthcare workforce and future leaders that are digitally enabled, and to understand and embrace, rather than being intimidated by, the potential of an AI-augmented healthcare system.

Healthcare leaders should consider (as a minimum) these issues when planning to leverage AI for health:

processes for ethical and responsible access to data: healthcare data is highly sensitive, inconsistent, siloed and not optimised for the purposes of machine learning development, evaluation, implementation and adoption
access to domain expertise / prior knowledge to make sense and create some of the rules which need to be applied to the datasets (to generate the necessary insight)
access to sufficient computing power to generate decisions in real time, which is being transformed exponentially with the advent of cloud computing
research into implementation: critically, we must consider, explore and research issues which arise when you take the algorithm and put it in the real world, building ‘trusted’ AI algorithms embedded into appropriate workflows.

IEEE Account

Change Username/Password
Update Address

Purchase Details

Payment Options
Order History
View Purchased Documents

Profile Information

Communications Preferences
Profession and Education
Technical Interests
US & Canada: +1 800 678 4333
Worldwide: +1 732 981 0060
Contact & Support
About IEEE Xplore
Accessibility
Terms of Use
Nondiscrimination Policy
Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

All Research Labs
3D Deep Learning
Applied Research
Autonomous Vehicles
Deep Imagination
New and Featured
AI Art Gallery
AI & Machine Learning
Computer Vision
Academic Collaborations
Government Collaborations
Graduate Fellowship
Internships
Research Openings
Research Scientists
Meet the Team

Research Areas

Artificial intelligence and machine learning, associated publications, researchers.

Help | Advanced Search

Computer Science > Machine Learning

Title: how to do impactful research in artificial intelligence for chemistry and materials science.

Abstract: Machine learning has been pervasively touching many fields of science. Chemistry and materials science are no exception. While machine learning has been making a great impact, it is still not reaching its full potential or maturity. In this perspective, we first outline current applications across a diversity of problems in chemistry. Then, we discuss how machine learning researchers view and approach problems in the field. Finally, we provide our considerations for maximizing impact when researching machine learning for chemistry.

Subjects:	Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci); Artificial Intelligence (cs.AI); Chemical Physics (physics.chem-ph)
Cite as:	[cs.LG]
	(or [cs.LG] for this version)
	Focus to learn more arXiv-issued DOI via DataCite
Journal reference:	Faraday Discuss., 2024
:	Focus to learn more DOI(s) linking to related resources

Submission history

Access paper:.

HTML (experimental)
Other Formats

References & Citations

Google Scholar
Semantic Scholar

BibTeX formatted citation

Bibliographic and Citation Tools

Code, data and media associated with this article, recommenders and search tools.

Institution

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs .

Deep learning, Artificial Intelligence and machine learning in cancer: Prognosis, diagnosis and treatment

Temitope oluwatosin fatunmbi 1, * , andrew ricardo piastri 2 and frederick adrah 3.

eISSN: 2581-9615 CODEN(USA): WJARAI Impact Factor 7.8 GIF Value 90.12

World Journal of Advanced Research and Reviews (WJARR) is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International license. Permissions beyond the scope of this license may be available at www.wjarr.com This site can be best viewed in modern browser like Google chrome.

A systematic review of fairness in machine learning

Published: 19 September 2024

Cite this article

Ricardo Trainotti Rabonato 1 &
Lilian Berton ORCID: orcid.org/0000-0003-1397-6005 1

115 Accesses

Explore all metrics

Fairness in Machine Learning (ML) has emerged as a crucial concern as these models increasingly influence critical decisions in various domains, including healthcare, finance, and criminal justice. The presence of bias in ML systems can lead to unfair and discriminatory outcomes, undermining the reliability and ethical standards of these technologies. As the deployment of ML expands, ensuring that these systems are fair and unbiased is not only a technical challenge but also a moral imperative. Here, a systematic literature review was conducted to explore fairness in machine learning, utilizing the ACM, IEEE, and Springer databases. From an initial retrieval of 975 papers, 30 were included in the review. The results highlight the identification of sensitive attributes, the metrics used to assess bias, and the various databases tested. Additionally, the review categorizes the in-processing and post-processing approaches employed to mitigate bias and examines how studies are managing the trade-off between fairness and accuracy. This comprehensive analysis provides a detailed understanding of the current state of fairness in machine learning and offers insights into effective strategies for bias mitigation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save.

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Price includes VAT (Russian Federation)

Instant access to the full article PDF.

Rent this article via DeepDyve

Institutional subscriptions

Fairness issues, current approaches, and challenges in machine learning models

Designing Against Bias: Identifying and Mitigating Bias in Machine Learning and AI

A novel approach for assessing fairness in deployed machine learning algorithms

Explore related subjects.

Artificial Intelligence
Medical Ethics

Data availability

This paper is a review, no dataset was used or generated.

https://www.ieee.org/.

https://www.springer.com/br.

https://www.acm.org/.

According to [ 53 ], for example, attributes such as “number of working hours” and “education level” can, to some extent, explain the wage differences between men and women in a given data set.

The National Basketball Association is the main professional basketball league in North America.

In Table 3 , the bases noted with (*) have data that were artificially created for use in the proposed study.

Caliskan, A., Bryson, J.J., Narayanan, A.: Semantics derived automatically from language corpora contain human-like biases. Science 356 (6334), 183–186 (2017). https://doi.org/10.1126/science.aal4230

Article Google Scholar

Forum, W.E.: How to prevent discriminatory outcomes in machine learning (2018). https://www3.weforum.org/docs/WEF_40065_White_Paper_How_to_Prevent_Discriminatory_Outcomes_in_Machine_Learning.pdf

INTELLIGENCE, A.F.T.A.O.A.: Code Of Professional Ethics And Conduct (2019). https://www.aaai.org/Conferences/code-of-ethics-and-conduct.php

Calmon, F.P., Wei, D., Ramamurthy, K.N., Varshney, K.R.: Optimized data pre-processing for discrimination prevention (2017)

Barocas, S., Hardt, M., Narayanan, A.: Fairness and machine learning: limitations and opportunities. fairmlbook.org, ??? (2019). http://www.fairmlbook.org

Chen, R.J., Wang, J.J., Williamson, D.F., Chen, T.Y., Lipkova, J., Lu, M.Y., Sahai, S., Mahmood, F.: Algorithmic fairness in artificial intelligence for medicine and healthcare. Nat. Biomed. Eng. 7 (6), 719–742 (2023)

Chen, I.Y., Pierson, E., Rose, S., Joshi, S., Ferryman, K., Ghassemi, M.: Ethical machine learning in healthcare. Ann. Rev. Biomed. Data Sci. 4 , 123–144 (2021)

Ricci Lara, M.A., Echeveste, R., Ferrante, E.: Addressing fairness in artificial intelligence for medical imaging. Nat. Commun. 13 (1), 4581 (2022)

Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., Galstyan, A.: A survey on bias and fairness in machine learning. ACM Comput. Surv. 54 (6), 1–35 (2021)

Pessach, D., Shmueli, E.: A review on fairness in machine learning. ACM Comput. Surv. (CSUR) 55 (3), 1–44 (2022)

Caton, S., Haas, C.: Fairness in machine learning: A survey. ACM Comput. Surv. 56 (7), 1–38 (2024)

Bellamy, R.K.E., Dey, K., Hind, M., Hoffman, S.C., Houde, S., Kannan, K., Lohia, P., Martino, J., Mehta, S., Mojsilovic, A., Nagar, S., Ramamurthy, K.N., Richards, J., Saha, D., Sattigeri, P., Singh, M., Varshney, K.R., Zhang, Y.: AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias (2018)

Dastin, J.: Amazon scraps secret ai recruiting tool that showed bias against women. In: Ethics of Data and Analytics, pp. 296–299. Auerbach Publications, ??? (2022)

Lum, K., Isaac, W.: To predict and serve? Significance 13 (5), 14–19 (2016)

Sjoding, M.W., Dickson, R.P., Iwashyna, T.J., Gay, S.E., Valley, T.S.: Racial bias in pulse oximetry measurement. N. Engl. J. Med. 383 (25), 2477–2478 (2020)

Adamson, A.S., Smith, A.: Machine learning and health care disparities in dermatology. JAMA Dermatol. 154 (11), 1247–1248 (2018)

Diao, J.A., Wu, G.J., Taylor, H.A., Tucker, J.K., Powe, N.R., Kohane, I.S., Manrai, A.K.: Clinical implications of removing race from estimates of kidney function. JAMA 325 (2), 184–186 (2021)

Google Scholar

Selbst, A.D., Boyd, D., Friedler, S.A., Venkatasubramanian, S., Vertesi, J.: Fairness and Abstraction in Sociotechnical Systems. In: Proceedings of the Conference on Fairness, Accountability, and Transparency, pp. 59–68. ACM, Atlanta GA USA (2019). https://doi.org/10.1145/3287560.3287598 . Accessed 14 July 2023

Perrone, V., Donini, M., Zafar, M.B., Schmucker, R., Kenthapadi, K., Archambeau, C.: Fair bayesian optimization. In: Proceedings of the 2021 AAAI/ACM Conference on AI, ethics, and society, pp. 854–863. ACM, Virtual Event USA (2021). https://doi.org/10.1145/3461702.3462629 . Accessed 21 November 2022

Chakraborty, J., Majumder, S., Yu, Z., Menzies, T.: Fairway: a way to build fair ML software. In: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp. 654–665. ACM, Virtual Event USA (2020). https://doi.org/10.1145/3368089.3409697 . Accessed 21 November 2022

Zhao, T., Dai, E., Shu, K., Wang, S.: Towards Fair Classifiers Without Sensitive Attributes: Exploring Biases in Related Features. In: Proceedings of the Fifteenth ACM International Conference on Web Search And Data Mining, pp. 1433–1442. ACM, Virtual Event AZ USA (2022). https://doi.org/10.1145/3488560.3498493 . Accessed 21 November 2022

Berk, R., Heidari, H., Jabbari, S., Kearns, M., Roth, A.: Fairness in criminal justice risk assessments: the state of the art (2017)

Caton, S., Haas, C.: Fairness in machine learning: a survey (2020)

Biolchini, J., Mian, P.G., Natali, A.C.C., Travassos, G.H.: Systematic Review in Software Engineering. Technical Report ES 679 (05), 45 (2005)

Felizardo, K.R., Nakagawa, E.Y., Fabbri, S.C.P.F., Ferrari, F.C.: Revisão Sistemática da Literatura em Engenharia de Software: Teoria e Prática. Elsevier, Rio de Janeiro (2017)

Abebe, S.A., Lucchese, C., Orlando, S.: EiFFFeL: enforcing fairness in forests by flipping leaves. In: Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing, pp. 429–436. ACM, Virtual Event (2022). https://doi.org/10.1145/3477314.3507319 . Accessed 21 November 2022

Krasanakis, E., Spyromitros-Xioufis, E., Papadopoulos, S., Kompatsiaris, Y.: Adaptive Sensitive Reweighting to Mitigate Bias in Fairness-aware Classification. In: Proceedings of the 2018 World Wide Web Conference on World Wide Web - WWW ’18, pp. 853–862. ACM Press, Lyon, France (2018). https://doi.org/10.1145/3178876.3186133 . Accessed 21 November 2022

G. Harris, C.: Mitigating Cognitive Biases in Machine Learning Algorithms for Decision Making. In: Companion Proceedings of the Web Conference 2020, pp. 775–781. ACM, Taipei Taiwan (2020). https://doi.org/10.1145/3366424.3383562 . Accessed 21 November 2022

Sharma, S., Gee, A.H., Paydarfar, D., Ghosh, J.: FaiR-N: Fair and Robust Neural Networks for Structured Data. In: Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, And Society, pp. 946–955. ACM, Virtual Event USA (2021). https://doi.org/10.1145/3461702.3462559 . Accessed 21 November 2022

Alam, M.A.U.: AI-Fairness Towards Activity Recognition of Older Adults. In: MobiQuitous 2020 - 17th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services, pp. 108–117. ACM, Darmstadt Germany (2020). https://doi.org/10.1145/3448891.3448943 . Accessed 21 November 2022

Zhang, H., Chu, X., Asudeh, A., Navathe, S.B.: OmniFair: A Declarative System for Model-Agnostic Group Fairness in Machine Learning. In: Proceedings of the 2021 International Conference on Management Of Data, pp. 2076–2088. ACM, Virtual Event China (2021). https://doi.org/10.1145/3448016.3452787 . Accessed 21 November 2022

Hu, Q., Rangwala, H.: Metric-Free Individual Fairness with Cooperative Contextual Bandits. In: 2020 IEEE International Conference on Data Mining (ICDM), pp. 182–191. IEEE, Sorrento, Italy (2020). https://doi.org/10.1109/ICDM50108.2020.00027 . https://ieeexplore.ieee.org/document/9338312/ Accessed 21 November 2022

Grari, V., Ruf, B., Lamprier, S., Detyniecki, M.: Achieving Fairness with Decision Trees: An Adversarial Approach. Data Sci. Eng. 5 (2), 99–110 (2020). https://doi.org/10.1007/s41019-020-00124-2 . ( 21 November 2022 )

Ramos, G., Boratto, L., Marras, M.: Robust reputation independence in ranking systems for multiple sensitive attributes. Mach. Learn. 111 (10), 3769–3796 (2022). https://doi.org/10.1007/s10994-022-06173-0 . ( Accessed 21 November 2022 )

Article MathSciNet Google Scholar

Raza, S., Reji, D.J., Ding, C.: Dbias: detecting biases and ensuring fairness in news articles. Int. J. Data Sci. Anal. (2022). https://doi.org/10.1007/s41060-022-00359-4 . ( Accessed 21 November 2022 )

Scutari, M., Panero, F., Proissl, M.: Achieving fairness with a simple ridge penalty. Stat. Comput. 32 (5), 77 (2022). https://doi.org/10.1007/s11222-022-10143-w . ( Accessed 21 November 2022 )

Ogura, H., Takeda, A.: Convex Fairness Constrained Model Using Causal Effect Estimators. In: Companion Proceedings of the Web Conference 2020, pp. 723–732. ACM, Taipei Taiwan (2020). https://doi.org/10.1145/3366424.3383556 . Accessed 21 November 2022

Kim, M.P., Ghorbani, A., Zou, J.: Multiaccuracy: Black-Box Post-Processing for Fairness in Classification. In: Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, And Society, pp. 247–254. ACM, Honolulu HI USA (2019). https://doi.org/10.1145/3306618.3314287 . Accessed 21 November 2022

Yan, S., Huang, D., Soleymani, M.: Mitigating Biases in Multimodal Personality Assessment. In: Proceedings of the 2020 International Conference on Multimodal Interaction, pp. 361–369. ACM, Virtual Event Netherlands (2020). https://doi.org/10.1145/3382507.3418889 . Accessed 21 November 2022

Geyik, S.C., Ambler, S., Kenthapadi, K.: Fairness-Aware Ranking in Search & Recommendation Systems with Application to LinkedIn Talent Search. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2221–2231. ACM, Anchorage AK USA (2019). https://doi.org/10.1145/3292500.3330691 . Accessed 21 November 2022

Halevy, M., Harris, C., Bruckman, A., Yang, D., Howard, A.: Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework. In: Equity and Access in Algorithms, Mechanisms, and Optimization, pp. 1–11. ACM, – NY USA (2021). https://doi.org/10.1145/3465416.3483299 . Accessed 21 November 2022

Zehlike, M., Castillo, C.: Reducing Disparate Exposure in Ranking: A Learning To Rank Approach. In: Proceedings of The Web Conference 2020, pp. 2849–2855. ACM, Taipei Taiwan (2020). https://doi.org/10.1145/3366424.3380048 . Accessed 21 November 2022

Wang, J., Li, Y., Wang, C.: Synthesizing Fair Decision Trees via Iterative Constraint Solving. In: Shoham, S., Vizel, Y. (eds.) Computer Aided Verification vol. 13372, pp. 364–385. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-13188-2_18 . Series Title: Lecture Notes in Computer Science. Accessed 21 November 2022

Nozza, D., Volpetti, C., Fersini, E.: Unintended Bias in Misogyny Detection. In: IEEE/WIC/ACM International Conference on Web Intelligence, pp. 149–155. ACM, Thessaloniki Greece (2019). https://doi.org/10.1145/3350546.3352512 . Accessed 21 November 2022

Wu, Z., He, J.: Fairness-aware Model-agnostic Positive and Unlabeled Learning. In: 2022 ACM Conference on Fairness, Accountability, and Transparency, pp. 1698–1708. ACM, Seoul Republic of Korea (2022). https://doi.org/10.1145/3531146.3533225 . Accessed 21 November 2022

Zhang, B.H., Lemoine, B., Mitchell, M.: Mitigating Unwanted Biases with Adversarial Learning. In: Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, And Society, pp. 335–340. ACM, New Orleans LA USA (2018). https://doi.org/10.1145/3278721.3278779 . Accessed 21 November 2022

Liu, D., Shafi, Z., Fleisher, W., Eliassi-Rad, T., Alfeld, S.: RAWLSNET: Altering Bayesian Networks to Encode Rawlsian Fair Equality of Opportunity. In: Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, And Society, pp. 745–755. ACM, Virtual Event USA (2021). https://doi.org/10.1145/3461702.3462618 . Accessed 21 November 2022

Rekabsaz, N., Kopeinik, S., Schedl, M.: Societal Biases in Retrieved Contents: Measurement Framework and Adversarial Mitigation of BERT Rankers. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 306–316. ACM, Virtual Event Canada (2021). https://doi.org/10.1145/3404835.3462949 . Accessed 21 November 2022

Almuzaini, A.A., Singh, V.K.: Balancing Fairness and Accuracy in Sentiment Detection using Multiple Black Box Models. In: Proceedings of the 2nd International Workshop on Fairness, Accountability, Transparency and Ethics In Multimedia, pp. 13–19. ACM, Seattle WA USA (2020). https://doi.org/10.1145/3422841.3423536 . Accessed 21 November 2022

Bhaskaruni, D., Hu, H., Lan, C.: Improving Prediction Fairness via Model Ensemble. In: 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI), pp. 1810–1814. IEEE, Portland, OR, USA (2019). https://doi.org/10.1109/ICTAI.2019.00273 . https://ieeexplore.ieee.org/document/8995403/ Accessed 21 November 2022

Dai, E., Wang, S.: Say No to the Discrimination: Learning Fair Graph Neural Networks with Limited Sensitive Attribute Information. In: Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp. 680–688. ACM, Virtual Event Israel (2021). https://doi.org/10.1145/3437963.3441752 . Accessed 21 November 2022

Liu, W., Liu, F., Tang, R., Liao, B., Chen, G., Heng, P.A.: Balancing Between Accuracy and Fairness for Interactive Recommendation with Reinforcement Learning. In: Lauw, H.W., Wong, R.C.-W., Ntoulas, A., Lim, E.-P., Ng, S.-K., Pan, S.J. (eds.) Advances in Knowledge Discovery and Data Mining vol. 12084, pp. 155–167. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-47426-3_13 . Series Title: Lecture Notes in Computer Science. Accessed 21 November 2022

Calders, T., Karim, A., Kamiran, F., Ali, W., Zhang, X.: Controlling attribute effect in linear regression. In: 2013 IEEE 13th International Conference on Data Mining, pp. 71–80 (2013). https://doi.org/10.1109/ICDM.2013.114

Download references

Acknowledgements

We thanks Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP), grant 21/14725-3 and Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq).

Author information

Authors and affiliations.

Universidade Federal de São Paulo, São José dos Campos, São Paulo, Brazil

Ricardo Trainotti Rabonato & Lilian Berton

You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lilian Berton .

Ethics declarations

Conflict of interest.

On behalf of all authors, the corresponding author states that there is no Conflict of interest.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Rabonato, R.T., Berton, L. A systematic review of fairness in machine learning. AI Ethics (2024). https://doi.org/10.1007/s43681-024-00577-5

Download citation

Received : 08 June 2024

Accepted : 04 September 2024

Published : 19 September 2024

DOI : https://doi.org/10.1007/s43681-024-00577-5

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Machine learning
Fairness metrics
Sensitive attributes
Find a journal
Publish with us
Track your research

Frontiers in Computational Neuroscience
Research Topics

Machine Learning Algorithms and Software Tools for Early Detection and Prognosis of Schizophrenia

Total Downloads

Total Views and Downloads

About this Research Topic

Schizophrenia is a complex psychiatric disorder with a diversity of symptoms that pose significant challenges to diagnosis and treatment. Despite advances in neuroscience, early detection and prognosis remain difficult due to the complex nature of the disease's presentation and its extensive interindividual variability. With the exponential growth in digitization of medical records and neuroimaging data, it has become increasingly feasible to utilize machine learning algorithms and software tools to advance schizophrenia research. The primary objective of this Research Topic is to explore the potential of machine learning algorithms and digital tools in the early detection and prognosis of schizophrenia. -How can computational models be developed and trained to predict the onset of schizophrenia from early symptoms? -What are the key features that these models should consider? -Can machine learning algorithms aid in developing a more effective and personalized treatment plan for individuals diagnosed with schizophrenia? To address these questions, we aim to gather pioneering research employing machine learning, artificial intelligence, and data analytics to transform schizophrenia diagnosis and treatment. This Research Topic welcomes both empirical and review papers focused on, but not limited to, the following themes: -Applications of machine learning algorithms in the early detection of schizophrenia. -Use of digital and computational tools for the prognosis of schizophrenia. -Machine-learning-based analysis of neuroimaging and/or genomic data in schizophrenia research. -Challenges, opportunities, and ethical considerations related to the application of AI and machine learning in schizophrenia diagnosis and treatment. -Machine Learning algorithms' role in individualized interventions and treatment predictions in schizophrenia. We invite Original Research, Review Papers, Method Articles, and Case Studies that contribute to the understanding and development of innovative machine learning approaches in the field of schizophrenia research.

Keywords : Schizophrenia, Machine Learning, Digital Tools, Neuroimaging

Important Note : All contributions to this Research Topic must be within the scope of the section and journal to which they are submitted, as defined in their mission statements. Frontiers reserves the right to guide an out-of-scope manuscript to a more suitable section or journal at any stage of peer review.

Topic Editors

Topic coordinators, submission deadlines.

	Manuscript Summary
	Manuscript

Participating Journals

Manuscripts can be submitted to this Research Topic via the following journals:

total views

Demographics

No records found

total views article views downloads topic views

Top countries

Top referring sites, about frontiers research topics.

With their unique mixes of varied contributions from Original Research to Review Articles, Research Topics unify the most influential researchers, the latest key findings and historical advances in a hot research area! Find out more on how to host your own Frontiers Research Topic or contribute to one as an author.

Product Design
Industrial Design
Smart Products

Artificial Intelligence and Machine Learning Applications in Smart Production: Progress, Trends, and Directions

January 2020
Sustainability 12(2):492
This person is not on ResearchGate, or hasn't claimed this research yet.

Parthenope University of Naples

Abstract and Figures

Discover the world's research

25+ million members
160+ million publication pages
2.3+ billion citations

Muhammad Fadhli

Shrikant Tiwari
Tomáš Martínek
Jan Korenek
Tomáš Čejka

ishath Murshida A
Chaithra B K
Nishmitha B

Federico Zomparelli

INT J INFORM MANAGE

COMPUT IND ENG

Roberto Vita

Jeffrey D. Sachs
Guido Schmidt-Traub

J. McCarthy
M.L. Minsky
N. Rochester
C.E. Shannon

Vladimir Vapnik
Recruit researchers
Join for free
Login Email Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google Welcome back! Please log in. Email · Hint Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google No account? Sign up

Information

Author Services

Initiatives

You are accessing a machine-readable page. In order to be human-readable, please install an RSS reader.

All articles published by MDPI are made immediately available worldwide under an open access license. No special permission is required to reuse all or part of the article published by MDPI, including figures and tables. For articles published under an open access Creative Common CC BY license, any part of the article may be reused without permission provided that the original article is clearly cited. For more information, please refer to https://www.mdpi.com/openaccess .

Feature papers represent the most advanced research with significant potential for high impact in the field. A Feature Paper should be a substantial original Article that involves several techniques or approaches, provides an outlook for future research directions and describes possible research applications.

Feature papers are submitted upon individual invitation or recommendation by the scientific editors and must receive positive feedback from the reviewers.

Editor’s Choice articles are based on recommendations by the scientific editors of MDPI journals from around the world. Editors select a small number of articles recently published in the journal that they believe will be particularly interesting to readers, or important in the respective research area. The aim is to provide a snapshot of some of the most exciting work published in the various research areas of the journal.

Original Submission Date Received: .

Active Journals
Find a Journal
Journal Proposal
Proceedings Series
For Authors
For Reviewers
For Editors
For Librarians
For Publishers
For Societies
For Conference Organizers
Open Access Policy
Institutional Open Access Program
Special Issues Guidelines
Editorial Process
Research and Publication Ethics
Article Processing Charges
Testimonials
Preprints.org
SciProfiles
Encyclopedia

Article Menu

Subscribe SciFeed
Recommended Articles
Author Biographies
Google Scholar
on Google Scholar
Table of Contents

Find support for a specific problem in the support section of our website.

Please let us know what you think of our products and services.

Visit our dedicated information section to learn more about MDPI.

JSmol Viewer

Artificial intelligence techniques in grapevine research: a comparative study with an extensive review of datasets, diseases, and techniques evaluation.

1. Introduction

Aim of the study.

Knowing that the majority of the research (88%) had been performed in the last 5 years, we needed it to be organized in one place where the studies will be compared together.
Moreover, we organize all the technologies used and the applications of ML in grapevine research and correlate them together. As a result, it becomes more likely that researchers will look at the general picture and all perspectives.
Knowing the currently studied ML topics in grapevine research and comparing them with the ML topics studied in general agriculture, it is easier to understand the topics that have not been thoroughly studied yet in grapevine research.
The datasets are organized in the same place so that future researchers can easily utilize them, while hopefully, future works could arise using the same data for comparison reasons.

2. Materials and Methods

Outline of the paper, 3. background, 3.1. machine learning in agriculture.

Crop Management;
Water Management;
Soil Management;
Livestock Management.

3.2. A Brief Overview of Machine Learning Types

Classification : Classification problems arise when the output variable is categorical, such as “disease” or “no disease”.
Regression : Regression problems occur when the output variable is a true continuous value such as stock price prediction.
Clustering : A clustering problem is one in which you want to disclose the underlying groupings in the data, such as grouping animals based on particular characteristics/features such as leg count.
Association : Here, you want to identify association rules, such as “those who buy X also buy Y”.
Semi-Supervised Learning : A mixture of labeled and unlabeled data constitute the input data [ 11 , 12 ]. This group lies between supervised and unsupervised learning. During model training, a small number of labeled data are combined with a large number of unlabeled data. As in supervised learning, the goal of the system in semi-supervised learning is to train a function such that the output variable can be accurately predicted as a function of the input variables. In contrast to supervised learning, the system is trained on a dataset consisting of both labeled and unlabeled data. Semi-supervised learning is especially applicable when the volume of unlabeled data is huge and cannot be labeled due to either cost issues or difficulty [ 13 ].
Reinforcement Learning : Decisions are made to find out actions that can lead to a more positive outcome, while it is solely determined by the trial and error method and a delayed outcome [ 7 ]. Reinforcement learning consists of algorithms that use estimated errors as incentives or penalties. If the mistake is serious, the penalty is harsh and the reward is insignificant. When the fault is little, the penalty is mild and the reward is significant. The two most important characteristics of reinforcement learning are the trial-and-error search and delayed reward. This model family automates the determination of optimal behavior within a given environment to achieve the desired performance. To learn which behavior is ideal, the model requires reward feedback, often known as “the reinforcement signal”. This family of models includes the Q-learning, Sarsa, and Markov Decision models. Figure 6 illustrates the reinforcement learning process, where an agent learns to interact with an environment in a way to choose the best actions to obtain desired outputs, take actions, and obtain rewards.

4. Diseases in Grapevines

4.1. grapevine yellow (gy), flavescence dorée, 4.3. downy mildew, 4.4. leafroll, 4.5. pierce’s disease, 4.6. root rot, 5. datasets.

GrapeCS-ML database: [ 32 ], https://researchoutput.csu.edu.au/en/datasets/grape-image-database (accessed on 10 May 2023). The GrapeCS-ML database consists of images of grape varieties at different stages of development, together with the corresponding ground truth data (e.g., pH and Brix) obtained from chemical analysis. The database consists of five datasets for 15 grape varieties taken at several stages of development and includes size and/or Macbeth color references. Altogether, the database contains a total of 2078 images.
In the paper “LDD: A Dataset for Grape Diseases Object Detection and Instance Segmentation” [ 33 ], https://www.kaggle.com/datasets/piyushmishra1999/plantvillage-grape (accessed on 10 May 2023), the authors created a grapevine disease database with images from Horta’s internal databases, the competition Grapevine Disease Images, and the web, as well as manual segmentations. It contains 1092 RGB images of grapes and 17,706 annotations (instances) for the tasks of Object Detection and Instance Segmentation. More specifically, it contains the categories black rot with 1180 instances, Esca (black measles) with 1383 instances, leaf blight (Isariopsis leaf spot) with 1076 instances, and healthy with 423 instances.
Grapes-Leaf-Disease-detection repository, https://github.com/shreyansh-kothari/Grapes-Leaf-Disease-detection (accessed on 16 May 2023), consists of images about grapevine diseases. To be exact, it includes Esca (240 images), black rot (210 images), Healthy (220 images), and leaf blight (210 images).
In [ 34 ], https://github.com/cu-cairlab/iros2022-OnlineDMSeg.git (accessed on 18 May 2023), there is a dataset with 282 raw images, classifying them as healthy or with downy mildew disease.
https://plantvillage.psu.edu/
https://www.kaggle.com/datasets/mohitsingh1804/plantvillage
https://github.com/DrAlbertCruz/Salento-Grapevine-Yellows-Dataset
This dataset [ 22 ], http://dx.doi.org/10.17632/89cnxc58kj.1 (accessed on 2 June 2023), is specific to Esca disease in grapes and consists of 881 healthy RGB images and 887 Esca images.
For the paper [ 35 ], https://www.kaggle.com/datasets/muratkokludataset/grapevine-leaves-image-dataset (accessed on 8 June 2023), they also created a dataset using hyperspectral imaging under laboratory lighting. their dataset consisted of 496 images of 248 plants. Also, this dataset distinguishes between symptomatic and asymptomatic leaves. The images in the collection were captured on grapevine leaves from five different varieties: Ak, Ala Idris, Büzgülü, Dimnit, and Nazli. Each kind of grapevine species has 100 images. The same dataset was used in [ 36 ].
Some research used also statistical data for grapevine disease (e.g., [ 37 ]); some statistical data for diseases can be found on the following page based on grapevines located in Italy: https://agroambiente.info.regione.toscana.it/ (accessed on 13 June 2023).
The paper “An expertized grapevine disease image database including five grape varieties focused on Flavescence dorée and its confounding diseases, biotic and abiotic stresses” [ 38 ], https://data.mendeley.com/datasets/3dr9r3w3jn/2 (accessed on 7 August 2024), contains a dataset that has 1483 RGB images of five different grape varieties, and two different diseases (Doree and Esca). This dataset can be used both for classification purposes of the grape varieties, as well as for disease detection. The images were acquired during 2 years from 14 vineyard blocks located in France.

6. Techniques in Grapevines

6.1. data format techniques, 6.1.1. rgb images, 6.1.2. spectroscopy.

Classification and Mapping: The valuable spectral information gained from the AVIRIS-NG can be used with Machine Learning algorithms for classification and mapping applications. This may include the mapping of vegetation, surface and open water, and crop types. They can also use the labeled AVIRIS-NG data in other images to automatically classify objects or features.
Environmental Monitoring: We can use AVIRIS-NG images to estimate environmental parameters such as vegetation health, water quality, pollution, etc. Machine Learning algorithms permit the examination of the spectral patterns and correlations in AVIRIS-NG data for the detection and observation of anomalies or changes over time, which is greatly helpful in identifying environmental hazards or changes in ecosystems.
Disease Detection: The early detection of plant diseases is feasible by training Machine Learning models on AVIRIS-NG images. The algorithms detect even minor changes in vegetation health since AVIRIS-NG captures the spectral signatures of vegetation. Thus, farmers and researchers make informed decisions to control the spread of diseases and protect crops.
Mineral Exploration: AVIRIS-NG data can be used in Machine Learning applications for mineral exploration and mapping. In the reflectance spectra, there are unique absorption features for rocks and minerals that can be observed from the spectral signatures obtained from the AVIRIS-NG images. Training Machine Learning algorithms may allow one to identify and detect minerals and hence locate and map deposits of various minerals.

6.1.3. Hyperspectral Images

6.1.4. multispectral images, 6.1.5. thermal images, 6.1.6. unmanned aerial vehicle (uav) images, 6.1.7. advantages and limitations of data formatting techniques, 6.2. machine learning techniques, 6.2.1. statistical measures, 6.2.2. k-nearest neighbor (knn), 6.2.3. naive bayes (nb), 6.2.4. support vector machine (svm), 6.2.5. decision trees, 6.2.6. random forest, 6.2.7. c5.0, 6.2.8. artificial neural networks (anns), multilayer perceptron (mlp), convolutional neural network (cnns), single shot multibox detector.

MobileNets are lightweight convolutional neural network models designed to perform Deep Learning tasks on resource-constrained devices. They are attempting to strike a balance between model size and processing efficiency while maintaining acceptable levels of accuracy. MobileNets do this by the use of depthwise separable convolutions, which divide regular convolutions into two different layers: depthwise and pointwise. Depthwise apply a single filter to each input channel and thus minimize computational complexity. Pointwise convolutions employ 1 × 1 convolutions to combine the outputs of depthwise convolutions to capture channel-wise interactions, which results in a reduced number of parameters and calculations required, making it suitable for resource-constrained devices. MobileNets offer flexibility through the use of hyperparameters like width multiplier ( α ) or resolution multiplier ( ρ ), which allows for extra trade-offs between model size, processing costs, and accuracy [ 84 ].
Inception-V2 is a deep neural network model designed to deal better with objects that change size across images instead of having to find the optimal kernel. Inception-V2 improves on the original Inception model by reducing computational complexity through the use of factorized convolution techniques. By reducing the number of individual convolutions, Inception-V2 improves runtime performance while maintaining a comparable level of accuracy [ 85 ].

6.2.9. The Genetic Algorithm (GA) in Feature Selection

6.2.10. advantages and limitations of machine learning techniques, 7. machine learning applications to grapevine, 7.1. diseases, 7.1.1. grapevine yellow disease, detection of grapevine yellows symptoms in vitis vinifera l. with artificial intelligence, using image texture and spectral reflectance analysis to detect yellowness and esca in grapevines at leaf-level, detection of two different grapevine yellows in vitis vinifera using hyperspectral imaging, 7.1.2. flavescence dorée disease, automatic detection of flavescense dorée grapevine disease in hyperspectral images using machine learning, development of spectral disease indices for ‘flavescence dorée’ grapevine disease identification, assessment of the optimal spectral bands for designing a sensor for vineyard disease detection: the case of “flavescence dorée”, 7.1.3. esca disease, evaluating the suitability of hyper- and multispectral imaging to detect foliar symptoms of the grapevine trunk disease esca in vineyards, a grapevine leaves dataset for early detection and classification of esca disease in vineyards through machine learning, deep learning approach with colorimetric spaces and vegetation indices for vine diseases detection in uav images, 7.1.4. mildew disease, predicting symptoms of downy mildew, powdery mildew, and gray mold diseases of grapevine through machine learning, near real-time vineyard downy mildew detection and severity estimation, deep learning for the differentiation of downy mildew and spider mite in grapevine under field conditions, artificial intelligence and novel sensing technologies for assessing downy mildew in grapevine, early detection of grapevine ( vitis vinifera ) downy mildew (peronospora) and diurnal variations using thermal imaging, vine disease detection in uav multispectral images using optimized image registration and deep learning segmentation approach, 7.1.5. leafroll disease, phenotyping grapevine red blotch virus and grapevine leafroll-associated viruses before and after symptom expression through machine learning analysis of hyperspectral images, early detection of grapevine leafroll disease in a red-berried wine grape cultivar using hyperspectral imaging, scalable early detection of grapevine virus infection with airborne imaging spectroscopy, 7.1.6. pierce’s disease, vision-based grapevine pierce’s disease detection system using artificial intelligence, 7.1.7. root rot disease, early identification of root rot disease by using hyperspectral reflectance: the case of pathosystem grapevine/armillaria, 7.1.8. general disease detection systems, bringing semantics to the vineyard: an approach on deep learning-based vine trunk detection, automatic grape leaf diseases identification via united model based on multiple convolutional neural networks, early detection of plant viral disease using hyperspectral imaging and deep learning, entropy-controlled deep features selection framework for grape leaf diseases recognition, grape leaf disease detection and classification using machine learning, a smart agricultural application: automated detection of diseases in vine leaves using hybrid deep learning, a deep-learning-based real-time detector for grape leaf diseases using improved convolutional neural networks, vddnet: vine disease detection network based on multispectral images and depth map, uas-based hyperspectral sensing methodology for continuous monitoring and early detection of vineyard anomalies, 7.2. water status assessment, 7.2.1. vineyard water status assessment using on-the-go thermal imaging and machine learning, 7.2.2. vineyard water status estimation using multispectral imagery from a uav platform and machine learning algorithms for irrigation scheduling management, 7.3. plant deficiencies, a deep learning algorithm for detection of potassium deficiency in a red grapevine and spraying actuation using a raspberry pi3, 7.4. classification, 7.4.1. in-field high throughput grapevine phenotyping with a consumer-grade depth camera, 7.4.2. a cnn-svm study based on selected deep features for grapevine leaves classification, 7.4.3. on-the-go hyperspectral imaging under field conditions and machine learning for the classification of grapevine varieties, 7.4.4. image classification for detection of winter grapevine buds in natural conditions using scale-invariant features transform, a bag of features, and support vector machines, 7.4.5. automated grapevine cultivar identification via leaf imaging and deep convolutional neural networks: a proof-of-concept study employing primary iranian varieties, 7.5. others, 7.5.1. geographical and cultivar features differentiate grape microbiota in northern italy and spain vineyards, 7.5.2. detection of single grapevine berries in images using fully convolutional neural networks, 7.5.3. insect classification and detection in field crops using modern machine learning techniques, 7.5.4. path planning algorithms benchmarking for grapevines pruning and monitoring, 7.5.5. a robot system for pruning grape vines, 7.5.6. assessment of smoke contamination in grapevine berries and taint in wines due to bushfires using a low-cost e-nose and an artificial intelligence approach, 7.5.7. vineinspector: the vineyard assistant, 8. discussion and conclusions.

A key finding of this review is the predominant use of Neural Networks, especially Convolutional Neural Networks (CNNs), in the majority of reviewed papers, as can be seen in Table 4 and the bar chart drawn in Figure 14 , which had the best results in most cases for the image-based tasks like grapevine disease detection and classification, as well as in other applications such as water management and plant nutrition. Besides CNNs, other ML techniques such as Support Vector Machines, decision trees, and Random Forests, among others, were considered, which contribute to the enhancement of accuracy and efficiency especially in applications such as the prediction for grapevine diseases and grapevine classification.
Given the scarcity of open, standardized datasets, we have acknowledged the significance of datasets in Machine Learning (ML) applications, particularly in grapevine research, which often necessitates researchers to develop their own. Progress has been achieved in this area, with increased open data related to diseases and classification purposes. Section 5 provided an exhaustive list of available datasets, outlining the types of data they encompass, including RGB images, hyperspectral and multispectral imaging, and thermal data. These datasets play a pivotal role in training ML models for detecting and predicting diseases such as Grapevine Yellow, Esca, downy mildew, and leafroll, among others. In addition, by reviewing existing datasets related to grapevine research, the paper identifies data gaps, which could help researchers draw important conclusions about trends for future data collection and how these datasets could be merged or supplemented.
The review also explored the utilization of advanced imaging techniques, shown in Table 2 , such as Unmanned Aerial Vehicles (UAVs or drones), which exhibit high-resolution data acquisition capabilities. Although minimal research has been conducted on the use of thermal images for grapevine management, UAVs and imaging approaches, such as hyperspectral imaging, have been extensively employed by researchers, yielding promising results.
The study compared AI techniques in grapevine research, such as computer vision, Deep Learning, and Machine Learning, assessing their effectiveness in different contexts. As shown in Figure 15 , a significant portion of the research (about 65.9%) focuses on disease-related topics, where we contrast methods of identification and prediction of grape infections based on symptoms, environments, or other factors. This helps in the early diagnosis of different diseases, enabling early pest management, and facilitates researchers and grape growers in choosing the best techniques for their particular needs, hence improving crop health and productivity. Conversely, areas like water management and grape quality receive less attention (about 4.54%), indicating theunder-exploration or reliance on more generalized agricultural methods.
Comparisons in this area should be made with extreme caution because several factors affect the reported accuracy. The most important are the datasets used, the data format techniques used to capture them, the different geographic locations tested, and how general the algorithms are, e.g., how effective they are at making accurate predictions or decisions when they encounter new information. Furthermore, through the review of the 44 papers, it has been observed that if a dataset contains only one disease, it is easier to obtain models with better performance compared to techniques applied to datasets with multiple diseases.
As can be inferred from the geographical distribution, depicted in Figure 16 , combined with the wide range of research fields, the applications of ML to facilitate various aspects of management in the wine sector is an important topic on an international scale. In particular, the geographical distribution of research publications focusing on the vineyard reveals that the majority of contributions come from countries in Europe and the USA, while in contrast, fewer studies come from countries such as China and South Africa, among others. This finding contrasts with the global landscape of Machine Learning (ML) applications in agriculture, where Asian countries, particularly China and India, dominate, such as in the study [ 7 ], highlighting that in vine research, the European and US countries play a central role, reflecting their historical and economic ties to viticulture.

Author Contributions

Institutional review board statement, informed consent statement, data availability statement, conflicts of interest.

Van Eck, N.; Waltman, L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics 2010 , 84 , 523–538. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Mourtzis, D.; Angelopoulos, J.; Panopoulos, N. A Literature Review of the Challenges and Opportunities of the Transition from Industry 4.0 to Society 5.0. Energies 2022 , 15 , 6276. [ Google Scholar ] [ CrossRef ]
Vishnoi, V.K.; Kumar, K.; Kumar, B. Plant disease detection using computational intelligence and image processing. J. Plant Dis. Prot. 2021 , 128 , 19–53. [ Google Scholar ] [ CrossRef ]
Abdulridha, J.; Batuman, O.; Ampatzidis, Y. UAV-based remote sensing technique to detect citrus canker disease utilizing hyperspectral imaging and Machine Learning. Remote Sens. 2019 , 11 , 1373. [ Google Scholar ] [ CrossRef ]
Ouhami, M.; Hafiane, A.; Es-Saady, Y.; El Hajji, M.; Canals, R. Computer vision, IoT and data fusion for crop disease detection using Machine Learning: A survey and ongoing research. Remote Sens. 2021 , 13 , 2486. [ Google Scholar ] [ CrossRef ]
Sharma, A.; Jain, A.; Gupta, P.; Chowdary, V. Machine Learning applications for precision agriculture: A comprehensive review. IEEE Access 2020 , 9 , 4843–4873. [ Google Scholar ] [ CrossRef ]
Benos, L.; Tagarakis, A.C.; Dolias, G.; Berruto, R.; Kateris, D.; Bochtis, D. Machine Learning in agriculture: A comprehensive updated review. Sensors 2021 , 21 , 3758. [ Google Scholar ] [ CrossRef ]
Araújo, S.O.; Peres, R.S.; Ramalho, J.C.; Lidon, F.; Barata, J. Machine Learning applications in agriculture: Current trends, challenges, and future perspectives. Agronomy 2023 , 13 , 2976. [ Google Scholar ] [ CrossRef ]
Choi, R.Y.; Coyner, A.S.; Kalpathy-Cramer, J.; Chiang, M.F.; Campbell, J.P. Introduction to Machine Learning, Neural Networks, and Deep Learning. Transl. Vis. Sci. Technol. 2020 , 9 , 14. [ Google Scholar ]
Liakos, K.G.; Busato, P.; Moshou, D.; Pearson, S.; Bochtis, D. Machine Learning in agriculture: A review. Sensors 2018 , 18 , 2674. [ Google Scholar ] [ CrossRef ]
Tripathi, P.; Kumar, N.; Rai, M.; Shukla, P.K.; Verma, K.N. Applications of Machine Learning in Agriculture. In Smart Village Infrastructure and Sustainable Rural Communities ; IGI Global: Hershey, PA, USA, 2023; pp. 99–118. [ Google Scholar ]
Georgoulas, I.; Protopapadakis, E.; Makantasis, K.; Seychell, D.; Doulamis, A.; Doulamis, N. Graph-based semi-supervised learning with tensor embeddings for hyperspectral data classification. IEEE Access 2023 , 11 , 124819–124832. [ Google Scholar ] [ CrossRef ]
Protopapadakis, E.; Doulamis, A.; Doulamis, N.; Maltezos, E. Stacked autoencoders driven by semi-supervised learning for building extraction from near-infrared remote sensing imagery. Remote Sens. 2021 , 13 , 371. [ Google Scholar ] [ CrossRef ]
Armijo, G.; Schlechter, R.; Agurto, M.; Muñoz, D.; Nuñez, C.; Arce-Johnson, P. Grapevine pathogenic microorganisms: Understanding infection strategies and host response scenarios. Front. Plant Sci. 2016 , 7 , 382. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Nguyen, C.; Sagan, V.; Maimaitiyiming, M.; Maimaitijiang, M.; Bhadra, S.; Kwasniewski, M.T. Early detection of plant viral disease using hyperspectral imaging and Deep Learning. Sensors 2021 , 21 , 742. [ Google Scholar ] [ CrossRef ]
CABI. Grapevine Yellows. 2020. Available online: https://www.cabidigitallibrary.org/doi/10.1079/cabicompendium.14061 (accessed on 10 August 2024).
Dermastia, M.; Bertaccini, A.; Constable, F.; Mehle, N.; Constable, F.; Bertaccini, A. Worldwide distribution and identification of Grapevine Yellows diseases. In Grapevine Yellows Diseases and Their Phytoplasma Agents ; Springer: Cham, Switzerland, 2017. [ Google Scholar ] [ CrossRef ]
Cruz, A.; Ampatzidis, Y.; Pierro, R.; Materazzi, A.; Panattoni, A.; De Bellis, L.; Luvisi, A. Detection of Grapevine Yellows symptoms in Vitis vinifera L. with artificial intelligence. Comput. Electron. Agric. 2019 , 157 , 63–76. [ Google Scholar ] [ CrossRef ]
Steffek, R.; Reisenzein, H.; Strauss, G.; Leichtfried, T.; Hofrichter, J.; Kopacka, I.; Schwarz, M.; Pusterhofer, J.; Biedermann, R.; Renner, W.; et al. VitisCLIM, a project modelling epidemiology and economic impact of grapevine ‘flavescence doree’phytoplasma in Austrian viticulture under a climate change scenario. Bull. Insectology 2011 , 64 , S191–S192. [ Google Scholar ]
Commons, W. Flavescence Dorée. Available online: https://commons.wikimedia.org/wiki/File:Flavescence_dor%C3%A9e_3.jpg?fbclid=IwY2xjawEm-k5leHRuA2FlbQIxMAABHW6asPmCtPn7100Xogw2lxNXhV85Ot1SQHoX-sIEFBvKOHUqILUeNPzBNg_aem_wN8sPpkG9TcyFfXRVuGtOA (accessed on 27 August 2024).
Fontaine, F.; Gramaje, D.; Armengol, J.; Smart, R.; Nagy, Z.A.; Borgo, M.; Rego, C.; Corio-Costet, M.F. Grapevine Trunk Diseases. A Review ; OIV Publications: Paris, France, 2016; pp. 1–26. [ Google Scholar ] [ CrossRef ]
Falaschetti, L. Esca-dataset, 2021. Mendeley Data, V1. Available online: https://data.mendeley.com/datasets/89cnxc58kj/1 (accessed on 15 July 2024). [ CrossRef ]
Gessler, C.; Pertot, I.; Perazzolli, M. Plasmopara viticola: A review of knowledge on downy mildew of grapevine and effective disease management. Phytopathol. Mediterr. 2011 , 50 , 3–44. [ Google Scholar ]
Clippinger, J.I.; Dobry, E.P.; Laffan, I.; Zorbas, N.; Hed, B.; Campbell, M.A. Traditional and Emerging Approaches for Disease Management of Plasmopara viticola, Causal Agent of Downy Mildew of Grape. Agriculture 2024 , 14 , 406. [ Google Scholar ] [ CrossRef ]
Naidu, R.A.; Maree, H.J.; Burger, J.T. Grapevine leafroll disease and associated viruses: A unique pathosystem. Annu. Rev. Phytopathol. 2015 , 53 , 613–634. [ Google Scholar ] [ CrossRef ]
Martínez-Lüscher, J.; Plank, C.M.; Brillante, L.; Cooper, M.L.; Smith, R.J.; Al-Rwahnih, M.; Yu, R.; Oberholster, A.; Girardello, R.; Kurtural, S.K. Grapevine red blotch virus may reduce carbon translocation leading to impaired grape berry ripening. J. Agric. Food Chem. 2019 , 67 , 2437–2448. [ Google Scholar ] [ CrossRef ]
Hopkins, D.; Purcell, A. Xylella fastidiosa: Cause of Pierce’s disease of grapevine and other emergent diseases. Plant Dis. 2002 , 86 , 1056–1066. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Cruz, A.C.; El-Kereamy, A.; Ampatzidis, Y. Vision-based grapevine pierce’s disease detection system using artificial intelligence. In Proceedings of the 2018 ASABE Annual International Meeting. American Society of Agricultural and Biological Engineers, Detroit, MI, USA, 29 July–1 August 2018; p. 1. [ Google Scholar ]
Zaini, P.A.; Nascimento, R.; Gouran, H.; Cantu, D.; Chakraborty, S.; Phu, M.; Goulart, L.R.; Dandekar, A.M. Molecular profiling of Pierce’s disease outlines the response circuitry of Vitis vinifera to Xylella fastidiosa infection. Front. Plant Sci. 2018 , 9 , 771. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Langenhoven, S.D.; Halleen, F.; Spies, C.F.; Stempien, E.; Mostert, L. Detection and quantification of black foot and crown and Root Rot pathogens in grapevine nursery soils in the Western Cape of South Africa. Phytopathol. Mediterr. 2018 , 57 , 519–537. [ Google Scholar ]
Calamita, F.; Imran, H.A.; Vescovo, L.; Mekhalfi, M.L.; La Porta, N. Early identification of Root Rot disease by using hyperspectral reflectance: The case of pathosystem grapevine/armillaria. Remote Sens. 2021 , 13 , 2436. [ Google Scholar ] [ CrossRef ]
Seng, K.P.; Ang, L.M.; Schmidtke, L.M.; Rogiers, S.Y. Computer vision and Machine Learning for viticulture technology. IEEE Access 2018 , 6 , 67494–67510. [ Google Scholar ] [ CrossRef ]
Rossi, L.; Valenti, M.; Legler, S.E.; Prati, A. LDD: A Grape Diseases Dataset Detection and Instance Segmentation. In Proceedings of the Image Analysis and Processing–ICIAP 2022: 21st International Conference, Lecce, Italy, 23–27 May 2022; Proceedings, Part II. Springer: Berlin/Heidelberg, Germany, 2022; pp. 383–393. [ Google Scholar ]
Liu, E.; Gold, K.; Cadle-Davidson, L.; Combs, D.; Jiang, Y. Near Real-Time Vineyard Downy Mildew Detection and Severity Estimation. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022; pp. 9187–9194. [ Google Scholar ] [ CrossRef ]
Sawyer, E.; Laroche-Pinel, E.; Flasco, M.; Cooper, M.L.; Corrales, B.; Fuchs, M.; Brillante, L. Phenotyping grapevine red blotch virus and grapevine leafroll-associated viruses before and after symptom expression through Machine Learning analysis of hyperspectral images. Front. Plant Sci. 2023 , 14 , 1117869. [ Google Scholar ] [ CrossRef ]
Koklu, M.; Unlersen, M.F.; Ozkan, I.A.; Aslan, M.F.; Sabanci, K. A CNN-SVM study based on selected deep features for grapevine leaves classification. Measurement 2022 , 188 , 110425. [ Google Scholar ] [ CrossRef ]
Volpi, I.; Guidotti, D.; Mammini, M.; Marchi, S. Predicting symptoms of downy mildew, powdery mildew, and gray mold diseases of grapevine through Machine Learning. Ital. J. Agrometeorol. 2021 , 57–69. [ Google Scholar ] [ CrossRef ]
Tardif, M.; Amri, A.; Deshayes, A.; Greven, M.; Keresztes, B.; Fontaine, G.; Sicaud, L.; Paulhac, L.; Bentejac, S.; Da Costa, J.P. An expertized grapevine disease image database including five grape varieties focused on Flavescence dorée and its confounding diseases, biotic and abiotic stresses. Data Brief 2023 , 48 , 109230. [ Google Scholar ] [ CrossRef ]
Al-Saddik, H.; Laybros, A.; Billiot, B.; Cointault, F. Using image texture and spectral reflectance analysis to detect Yellowness and Esca in grapevines at leaf-level. Remote Sens. 2018 , 10 , 618. [ Google Scholar ] [ CrossRef ]
Bendel, N.; Backhaus, A.; Kicherer, A.; Köckerling, J.; Maixner, M.; Jarausch, B.; Biancu, S.; Klück, H.C.; Seiffert, U.; Voegele, R.T.; et al. Detection of two different Grapevine Yellows in Vitis vinifera using hyperspectral imaging. Remote Sens. 2020 , 12 , 4151. [ Google Scholar ] [ CrossRef ]
Silva, D.M.; Bernardin, T.; Fanton, K.; Nepaul, R.; Pádua, L.; Sousa, J.J.; Cunha, A. Automatic detection of Flavescense Dorée grapevine disease in hyperspectral images using Machine Learning. Procedia Comput. Sci. 2022 , 196 , 125–132. [ Google Scholar ] [ CrossRef ]
Al-Saddik, H.; Simon, J.C.; Cointault, F. Development of spectral disease indices for ‘Flavescence Dorée’grapevine disease identification. Sensors 2017 , 17 , 2772. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Al-Saddik, H.; Simon, J.C.; Cointault, F. Assessment of the optimal spectral bands for designing a sensor for vineyard disease detection: The case of ‘Flavescence dorée’. Precis. Agric. 2019 , 20 , 398–422. [ Google Scholar ] [ CrossRef ]
Bendel, N.; Kicherer, A.; Backhaus, A.; Klück, H.C.; Seiffert, U.; Fischer, M.; Voegele, R.T.; Töpfer, R. Evaluating the suitability of hyper-and multispectral imaging to detect foliar symptoms of the grapevine trunk disease Esca in vineyards. Plant Methods 2020 , 16 , 142. [ Google Scholar ] [ CrossRef ]
Alessandrini, M.; Rivera, R.C.F.; Falaschetti, L.; Pau, D.; Tomaselli, V.; Turchetti, C. A grapevine leaves dataset for early detection and classification of Esca disease in vineyards through Machine Learning. Data Brief 2021 , 35 , 106809. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Kerkech, M.; Hafiane, A.; Canals, R. Deep leaning approach with colorimetric spaces and vegetation indices for vine diseases detection in UAV images. Comput. Electron. Agric. 2018 , 155 , 237–243. [ Google Scholar ] [ CrossRef ]
Gutiérrez, S.; Hernández, I.; Ceballos, S.; Barrio, I.; Díez-Navajas, A.M.; Tardaguila, J. Deep Learning for the differentiation of downy mildew and spider mite in grapevine under field conditions. Comput. Electron. Agric. 2021 , 182 , 105991. [ Google Scholar ] [ CrossRef ]
Hernández, I.; Gutiérrez, S.; Ceballos, S.; Iñíguez, R.; Barrio, I.; Tardaguila, J. Artificial intelligence and novel sensing technologies for assessing downy mildew in grapevine. Horticulturae 2021 , 7 , 103. [ Google Scholar ] [ CrossRef ]
Cohen, B.; Edan, Y.; Levi, A.; Alchanatis, V. Early detection of grapevine ( Vitis vinifera ) downy mildew (Peronospora) and diurnal variations using thermal imaging. Sensors 2022 , 22 , 3585. [ Google Scholar ] [ CrossRef ]
Kerkech, M.; Hafiane, A.; Canals, R. Vine disease detection in UAV multispectral images using optimized image registration and Deep Learning segmentation approach. Comput. Electron. Agric. 2020 , 174 , 105446. [ Google Scholar ] [ CrossRef ]
Gao, Z.; Khot, L.R.; Naidu, R.A.; Zhang, Q. Early detection of grapevine leafroll disease in a red-berried wine grape cultivar using hyperspectral imaging. Comput. Electron. Agric. 2020 , 179 , 105807. [ Google Scholar ] [ CrossRef ]
Romero Galvan, F.E.; Sousa, D.; Pavlick, R.; Aggarwal, S.; Trolley, G.R.; Forrestel, E.J.; Bolton, S.L.; Dokoozlian, N.; Alsina, M.D.M.; Gold, K.M. Scalable early detection of grapevine virus infection with airborne imaging spectroscopy. bioRxiv 2022 . [ Google Scholar ] [ CrossRef ]
Aguiar, A.S.; Monteiro, N.N.; Santos, F.N.d.; Solteiro Pires, E.J.; Silva, D.; Sousa, A.J.; Boaventura-Cunha, J. Bringing semantics to the vineyard: An approach on deep learning-based vine trunk detection. Agriculture 2021 , 11 , 131. [ Google Scholar ] [ CrossRef ]
Ji, M.; Zhang, L.; Wu, Q. Automatic grape leaf diseases identification via UnitedModel based on multiple Convolutional Neural Networks. Inf. Process. Agric. 2020 , 7 , 418–426. [ Google Scholar ] [ CrossRef ]
Adeel, A.; Khan, M.A.; Akram, T.; Sharif, A.; Yasmin, M.; Saba, T.; Javed, K. Entropy-controlled deep features selection framework for grape leaf diseases recognition. Expert Syst. 2022 , 39 , e12569. [ Google Scholar ] [ CrossRef ]
Huang, Z.; Qin, A.; Lu, J.; Menon, A.; Gao, J. Grape leaf disease detection and classification using machine learning. In Proceedings of the 2020 International Conferences on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) and IEEE Congress on Cybermatics (Cybermatics), Rhodes, Greece, 2–6 November 2020; pp. 870–877. [ Google Scholar ]
Alkan, A.; Abdullah, M.U.; Abdullah, H.O.; Assaf, M.; Zhou, H. A smart agricultural application: Automated detection of diseases in vine leaves using hybrid Deep Learning. Turk. J. Agric. For. 2021 , 45 , 717–729. [ Google Scholar ] [ CrossRef ]
Xie, X.; Ma, Y.; Liu, B.; He, J.; Li, S.; Wang, H. A deep-learning-based real-time detector for grape leaf diseases using improved Convolutional Neural Networks. Front. Plant Sci. 2020 , 11 , 751. [ Google Scholar ] [ CrossRef ]
Kerkech, M.; Hafiane, A.; Canals, R. VddNet: Vine disease detection network based on multispectral images and depth map. Remote Sens. 2020 , 12 , 3305. [ Google Scholar ] [ CrossRef ]
Adão, T.; Peres, E.; Pádua, L.; Hruška, J.; Sousa, J.J.; Morais, R. UAS-based hyperspectral sensing methodology for continuous monitoring and early detection of vineyard anomalies. In Proceedings of the Small Unmanned Aerial Systems for Environmental Research, Vila Real, Portugal, 28–30 June 2017. [ Google Scholar ]
Gutiérrez, S.; Diago, M.P.; Fernández-Novales, J.; Tardaguila, J. Vineyard water status assessment using on-the-go thermal imaging and Machine Learning. PLoS ONE 2018 , 13 , e0192037. [ Google Scholar ] [ CrossRef ]
Romero, M.; Luo, Y.; Su, B.; Fuentes, S. Vineyard water status estimation using multispectral imagery from an UAV platform and Machine Learning algorithms for irrigation scheduling management. Comput. Electron. Agric. 2018 , 147 , 109–117. [ Google Scholar ] [ CrossRef ]
Ukaegbu, U.; Tartibu, L.; Laseinde, T.; Okwu, M.; Olayode, I. A Deep Learning algorithm for detection of potassium deficiency in a red grapevine and spraying actuation using a raspberry pi3. In Proceedings of the 2020 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems (icABCD), Durban, South Africa, 6–7 August 2020; pp. 1–6. [ Google Scholar ]
Milella, A.; Marani, R.; Petitti, A.; Reina, G. In-field high throughput grapevine phenotyping with a consumer-grade depth camera. Comput. Electron. Agric. 2019 , 156 , 293–306. [ Google Scholar ] [ CrossRef ]
Gutiérrez, S.; Fernández-Novales, J.; Diago, M.P.; Tardaguila, J. On-the-go hyperspectral imaging under field conditions and machine learning for the classification of grapevine varieties. Front. Plant Sci. 2018 , 9 , 1102. [ Google Scholar ] [ CrossRef ]
Pérez, D.S.; Bromberg, F.; Diaz, C.A. Image classification for detection of winter grapevine buds in natural conditions using scale-invariant features transform, bag of features and Support Vector Machines. Comput. Electron. Agric. 2017 , 135 , 81–95. [ Google Scholar ] [ CrossRef ]
Nasiri, A.; Taheri-Garavand, A.; Fanourakis, D.; Zhang, Y.D.; Nikoloudakis, N. Automated grapevine cultivar identification via leaf imaging and deep Convolutional Neural Networks: A proof-of-concept study employing primary iranian varieties. Plants 2021 , 10 , 1628. [ Google Scholar ] [ CrossRef ]
Castleman, K.R. Digital Image Processing ; Prentice Hall Press: Saddle River, NJ, USA, 1996. [ Google Scholar ]
Su, W.H. Advanced Machine Learning in point spectroscopy, RGB-and hyperspectral-imaging for automatic discriminations of crops and weeds: A review. Smart Cities 2020 , 3 , 767–792. [ Google Scholar ] [ CrossRef ]
Savitri, K.P.; Hecker, C.; van der Meer, F.D.; Sidik, R.P. VNIR-SWIR infrared (imaging) spectroscopy for geothermal exploration: Current status and future directions. Geothermics 2021 , 96 , 102178. [ Google Scholar ] [ CrossRef ]
Green, R.O.; Eastwood, M.L.; Sarture, C.M.; Chrien, T.G.; Aronsson, M.; Chippendale, B.J.; Faust, J.A.; Pavri, B.E.; Chovit, C.J.; Solis, M.; et al. Imaging spectroscopy and the airborne visible/infrared imaging spectrometer (AVIRIS). Remote Sens. Environ. 1998 , 65 , 227–248. [ Google Scholar ] [ CrossRef ]
Galvan, F.E.R.; Pavlick, R.; Trolley, G.; Aggarwal, S.; Sousa, D.; Starr, C.; Forrestel, E.; Bolton, S.; Alsina, M.d.M.; Dokoozlian, N.; et al. Scalable early detection of grapevine viral infection with airborne imaging spectroscopy. Phytopathology® 2023 , 113 , 1439–1446. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Rajendran, S.; Aravindan, S.; Rajakumar, T.; Sivakumar, M.; Mohan, K. Hyperspectral Remote Sensing and Spectral Signature Applications ; New India Publishing: New Delhi, India, 2009. [ Google Scholar ] [ CrossRef ]
Ma, F.; Yuan, M.; Kozak, I. Multispectral imaging: Review of current applications. Surv. Ophthalmol. 2023 , 68 , 889–904. [ Google Scholar ] [ CrossRef ]
Wiecek, B. Review on thermal image processing for passive and active thermography. In Proceedings of the 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference, Shanghai, China, 17–18 January 2006; pp. 686–689. [ Google Scholar ]
Bian, J.; Zhang, Z.; Chen, J.; Chen, H.; Cui, C.; Li, X.; Chen, S.; Fu, Q. Simplified evaluation of cotton water stress using high resolution unmanned aerial vehicle thermal imagery. Remote Sens. 2019 , 11 , 267. [ Google Scholar ] [ CrossRef ]
Wilson, A.; Gupta, K.A.; Koduru, B.H.; Kumar, A.; Jha, A.; Cenkeramaddi, L.R. Recent advances in thermal imaging and its applications using machine learning: A review. IEEE Sens. J. 2023 , 23 , 3395–3407. [ Google Scholar ] [ CrossRef ]
Zhu, W.; Chen, H.; Ciechanowska, I.; Spaner, D. Application of infrared thermal imaging for the rapid diagnosis of crop disease. IFAC-PapersOnLine 2018 , 51 , 424–430. [ Google Scholar ] [ CrossRef ]
Oerke, E.C.; Herzog, K.; Toepfer, R. Hyperspectral phenotyping of the reaction of grapevine genotypes to Plasmopara viticola. J. Exp. Bot. 2016 , 67 , 5529–5543. [ Google Scholar ] [ CrossRef ]
Pouzoulet, J.; Scudiero, E.; Schiavon, M.; Rolshausen, P.E. Xylem vessel diameter affects the compartmentalization of the vascular pathogen Phaeomoniella chlamydospore in grapevine. Front. Plant Sci. 2017 , 8 , 1442. [ Google Scholar ] [ CrossRef ]
Shruthi, U.; Nagaveni, V.; Raghavendra, B. A review on Machine Learning classification techniques for plant disease detection. In Proceedings of the 2019 5th International conference on advanced computing & communication systems (ICACCS), Coimbatore, India, 15–16 March 2019; pp. 281–284. [ Google Scholar ]
Padol, P.B.; Yadav, A.A. SVM classifier-based grape leaf disease detection. In Proceedings of the 2016 Conference on Advances in signal processing (CASP), Pune, India, 9–11 June 2016; pp. 175–179. [ Google Scholar ]
Gutierrez, S.; Tardaguila, J.; Fernandez-Novales, J.; Diago, M.P. Support Vector Machine and artificial neural network models for the classification of grapevine varieties using a portable NIR spectrophotometer. PLoS ONE 2015 , 10 , e0143197. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Tang, Z.; Yang, J.; Li, Z.; Qi, F. Grape disease image classification based on lightweight convolution Neural Networks and channelwise attention. Comput. Electron. Agric. 2020 , 178 , 105735. [ Google Scholar ] [ CrossRef ]
Ghoury, S.; Sungur, C.; Durdu, A. Real-time diseases detection of grape and grape leaves using faster r-cnn and ssd mobilenet architectures. In Proceedings of the International Conference on Advanced Technologies, Computer Engineering and Science (ICATCES 2019), Alanya, Türkiye, 26–28 April 2019; pp. 39–44. [ Google Scholar ]
Chuanlei, Z.; Shanwen, Z.; Jucheng, Y.; Yancui, S.; Jia, C. Apple leaf disease identification using genetic algorithm and correlation-based feature selection method. Int. J. Agric. Biol. Eng. 2017 , 10 , 74–83. [ Google Scholar ]
Barbedo, J.G.A. Detection of nutrition deficiencies in plants using proximal images and Machine Learning: A review. Comput. Electron. Agric. 2019 , 162 , 482–492. [ Google Scholar ] [ CrossRef ]
Rangel, B.M.S.; Fernandez, M.A.A.; Murillo, J.C.; Ortega, J.C.P.; Arreguín, J.M.R. KNN-based image segmentation for grapevine potassium deficiency diagnosis. In Proceedings of the 2016 International Conference on Electronics, Communications and Computers (CONIELECOMP), Cholula, Mexico, 24–26 February 2016; pp. 48–53. [ Google Scholar ]
Mezzasalma, V.; Sandionigi, A.; Guzzetti, L.; Galimberti, A.; Grando, M.S.; Tardaguila, J.; Labra, M. Geographical and cultivar features differentiate grape microbiota in Northern Italy and Spain vineyards. Front. Microbiol. 2018 , 9 , 946. [ Google Scholar ] [ CrossRef ]
Zabawa, L.; Kicherer, A.; Klingbeil, L.; Milioto, A.; Topfer, R.; Kuhlmann, H.; Roscher, R. Detection of single grapevine berries in images using fully Convolutional Neural Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Long Beach, CA, USA, 16–17 June 2019. [ Google Scholar ]
Kasinathan, T.; Singaraju, D.; Uyyala, S.R. Insect classification and detection in field crops using modern Machine Learning techniques. Inf. Process. Agric. 2021 , 8 , 446–457. [ Google Scholar ] [ CrossRef ]
Magalhães, S.A.; dos Santos, F.N.; Martins, R.C.; Rocha, L.F.; Brito, J. Path planning algorithms benchmarking for grapevines pruning and monitoring. In Proceedings of the Progress in Artificial Intelligence: 19th EPIA Conference on Artificial Intelligence, EPIA 2019, Vila Real, Portugal, 3–6 September 2019; Proceedings, Part II 19. Springer: Berlin/Heidelberg, Germany, 2019; pp. 295–306. [ Google Scholar ]
Botterill, T.; Paulin, S.; Green, R.; Williams, S.; Lin, J.; Saxton, V.; Mills, S.; Chen, X.; Corbett-Davies, S. A robot system for pruning grape vines. J. Field Robot. 2017 , 34 , 1100–1122. [ Google Scholar ] [ CrossRef ]
Fuentes, S.; Summerson, V.; Gonzalez Viejo, C.; Tongson, E.; Lipovetzky, N.; Wilkinson, K.L.; Szeto, C.; Unnithan, R.R. Assessment of smoke contamination in grapevine berries and taint in wines due to bushfires using a low-cost E-nose and an artificial intelligence approach. Sensors 2020 , 20 , 5108. [ Google Scholar ] [ CrossRef ]
Mendes, J.; Peres, E.; Neves dos Santos, F.; Silva, N.; Silva, R.; Sousa, J.J.; Cortez, I.; Morais, R. VineInspector: The Vineyard Assistant. Agriculture 2022 , 12 , 730. [ Google Scholar ] [ CrossRef ]

Click here to enlarge figure

Datasets	Diseases						Class	Info
Datasets	Healthy	GY	Esca	Mildew	Leafroll	Others	Class	Info
1							2078	Grape Varieties + Chemical Data).
2	423		1383			1180 + 1076		Black rot + Leaf-blight.
3	220		240			210 + 2010		Leaf-blight + Black rot.
4				282				Mentioned the total number of images regardless of healthy or diseased).
5	84	134	1383			1180 + 1076		Expanded version of 2.
6	881		887					-
7	135				156	108 + 97		Red Blotch images and combined Red Blotch and Leafroll images.
8								Statistical Data.
9			107			228	1483	228 Flavescence Dorée images + 5 Grape varieties.

		Cite	Data Format Techniques					Notes
			RGB	Spectral		Thermal	UAV
				Hyper	Multi
DISEASE	GY	[ ]	✓
		[ ]	✓	✓				PROSPECT model.
		[ ]		✓
	DOREE	[ ]		✓				Autoencoders—dimensionality reduction.
		[ ]			✓
		[ ]			✓
	Esca	[ ]		✓	✓			VNIR and SWIR cameras (from RGB).
		[ ]	✓					ImageDataGenerator.
		[ ]	✓				✓
	MILDEW	[ ]						Geographical data analysis (statistics).
		[ ]	✓
		[ ]	✓
		[ ]	✓	✓
		[ ]				✓		Meteorological data.
		[ ]			✓		✓
	LEAFROLL	[ ]		✓				Laboratory lighting.
		[ ]		✓
		[ ]			✓			Airborne imaging (NASA AVIRIS).
	PIERCE’S	[ ]	✓
	Root Rot	[ ]		✓
	GENERAL	[ ]	✓			✓
		[ ]	✓
		[ ]		✓
		[ ]	✓
		[ ]	✓
		[ ]	✓
		[ ]	✓
		[ ]			✓		✓
		[ ]		✓			✓	Anomaly detection.
WATER—STATUS		[ ]				✓
WATER—STATUS		[ ]			✓		✓
PLANT DEFICIENCIES		[ ]	✓					Raspberry Pi 3.
CLASSIFICATION		[ ]	✓					3D Depth Camera.
		[ ]	✓
		[ ]		✓				In motion images.
		[ ]	✓					In natural conditions.
		[ ]	✓

Data Format Technique	Pros	Cons
RGB Images	+ High spatial resolution. + Well-suited for visual analysis. + Simple to train models. + Requires no expert knowledge. + Compatible with existing equipment.	− Limited spectral information. − Can miss non-visible features. − Hard for detection of asymptomatic leaves or early detection applications.
Thermal Images	+ Useful for detecting plant stress and water status. + Low cost. + Can be combined with other techniques. + Adds one more useful parameter.	− Sensitive to environmental conditions. − Limited to surface temperature data. − Difficult to apply independently in other fields.
Unmanned Aerial Vehicle (UAV) Images	+ Allows large area coverage. + Can combine multiple imaging techniques. + Can be automated. + Faster to scan the whole field. + Can be used daily leading to faster detection of issues. + Expensive to purchase, but reduces cost over time.	− Skill requirements. − Requires flight permits. − Regulations vary by country. − Limited flight time depending on the battery. − Weather dependency.
Multispectral Images	+ Balances between spatial and spectral data. + Can be used in multiple applications. + Lower cost than Hyperspectral imaging, having comparable results.	− Less detailed than hyperspectral imaging. − Moderate computational demand. − Complexity. − Data handling.
Hyperspectral Images	+ Provides detailed spectral information. + Can be used in even more applications. + Too much information for the researchers.	− High data complexity. − Expensive equipment required. − Needs specific environment and data handling. − Hard for commercial use.

		Cite	Machine Learning Techniques						Best ACC	Notes
		Cite	KNN	SVM	ANN	CNN	Decision Trees	Other	Best ACC	Notes
DISEASE	GY	[ ]				✓			99.33%	ResNet101.
		[ ]			✓				99%	Back Propagation Neural Network.
		[ ]						rRBF, MLP	96%	Varies between tests (e.g., field, BN-GY).
	DOREE	[ ]				✓			83%
		[ ]		✓					96%
		[ ]		✓					96+%
	Esca	[ ]						rRBF	95%	Varies depending on the type of data and the year.
		[ ]				✓			99.48%
		[ ]				✓			95.8%
	MILDEW	[ ]					✗	C5.0	70+%	Better Accuracy when tested diseases separately.
		[ ]				✓			83%	Modified ResNet.
		[ ]				✓			94%
		[ ]	✗			✓			82%
		[ ]		✓			✗		81.6%
		[ ]				✓			92+%	SegNet Architecture.
	LEAFROLL	[ ]				✓	✗		87%	3D-CNN, Binary classification on a symptomatic dataset.
		[ ]		✓					89.93%	Least-squares SVM.
		[ ]					✓		87%	Random Forest.
	PIERCE’S	[ ]				✓			99.2%	AlexNet Architecture.
	Root Rot	[ ]						NB	90%	Healthy vs Diseased plants.
	GENERAL	[ ]				✓			84.16%	Average precision, MobileNets, Inseption-V2.
		[ ]				✗			99.17%	UnitedModels based on 4 models.
		[ ]		✓		✓	✓		75%	Preprocessing: SVM, Prediction: 3D-CNN, RF.
		[ ]	✗	✓					99%	LS-SVM, cosine KNN, several steps from preprocessing to prediction.
		[ ]				✓			98%	Vanilla CNN 100% accuracy with ensemble model.
		[ ]				✓			92.5%	AlexNet + Transfer Learning.
		[ ]				✓			81.1%	Focused in speed and multiple Disease Detection.
		[ ]			✓				93.72%	VddNet.
		[ ]							-
WATER—STATUS		[ ]					✓		65%	Regression models, for prediction.
WATER—STATUS		[ ]			✓				82.9%	Pattern recognition ANN.
PLANT DEFICIENCIES		[ ]		✗		✓			80%
CLASSIFICATION		[ ]				✓			91.52%	VGG19.
		[ ]		✓		✗			97.6%	Cubic SVM.
		[ ]		✗				MLP	99%
		[ ]		✓					90.8%
		[ ]				✓			99.11%	Modified VGG16.

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

Gatou, P.; Tsiara, X.; Spitalas, A.; Sioutas, S.; Vonitsanos, G. Artificial Intelligence Techniques in Grapevine Research: A Comparative Study with an Extensive Review of Datasets, Diseases, and Techniques Evaluation. Sensors 2024 , 24 , 6211. https://doi.org/10.3390/s24196211

Gatou P, Tsiara X, Spitalas A, Sioutas S, Vonitsanos G. Artificial Intelligence Techniques in Grapevine Research: A Comparative Study with an Extensive Review of Datasets, Diseases, and Techniques Evaluation. Sensors . 2024; 24(19):6211. https://doi.org/10.3390/s24196211

Gatou, Paraskevi, Xanthi Tsiara, Alexandros Spitalas, Spyros Sioutas, and Gerasimos Vonitsanos. 2024. "Artificial Intelligence Techniques in Grapevine Research: A Comparative Study with an Extensive Review of Datasets, Diseases, and Techniques Evaluation" Sensors 24, no. 19: 6211. https://doi.org/10.3390/s24196211

Article Metrics

Article access statistics, further information, mdpi initiatives, follow mdpi.

Subscribe to receive issue release notifications and newsletters from MDPI journals

View PDF
Download full issue

Cognitive Robotics

Artificial intelligence, machine learning and deep learning in advanced robotics, a review, graphical abstract.

Download: Download high-res image (217KB)
Download: Download full-size image
Previous article in issue
Next article in issue

Cited by (0)

School of Engineering and Applied Sciences
UB Directory
Department of Electrical Engineering >
Research >
Research Highlights >

Study enhancing learning methods for AI and machine learning systems wins IEEE award

Grid lines and numerical text on either side of a silhouette.

By Peter Murphy

Published May 21, 2024

A paper authored by Seyyedali Hosseinalipour (Ali Alipour) received the Institute of Electrical and Electronics Engineers (IEEE) Communications Society William R. Bennett Prize. The research could enhance learning methods used by artificial intelligence (AI) and machine learning (ML) systems.

PRESS RELEASES

11/9/23 UB receives $2.4 million grant to develop Earth’s thinnest materials for space tech
7/24/23 ‘Quantum avalanche’ explains how nonconductors turn into conductors
7/14/23 Democratizing wireless network research with UnionLabs project
4/20/23 A particular ‘sandwich’ of graphene and boron nitride may lead to next-gen microelectronics
6/3/22 New electrocatalyst offers hope for less expensive hydrogen fuel

Upcoming Events

Princeton Plasma Physics Laboratory

Replacing hype about artificial intelligence with accurate measurements of success.

illustration of ship with circuitry pattern sailing on turbulent waves made of equations

(Illustration credit: Kyle Palmer / PPPL Communications Department)

PPPL researchers find overoptimism in journal articles using machine learning to solve fluid-related partial differential equations

The hype surrounding machine learning, a form of artificial intelligence, can make it seem like it is only a matter of time before such techniques are used to solve all scientific problems. While impressive claims are often made, those claims do not always hold up under scrutiny. Machine learning may be useful for solving some problems but falls short for others.

In a new paper in Nature Machine Intelligence, researchers at the U.S. Department of Energy ’s Princeton Plasma Physics Laboratory (PPPL) and Princeton University performed a systematic review of research comparing machine learning to traditional methods for solving fluid-related partial differential equations (PDEs). Such equations are important in many scientific fields, including the plasma research that supports the development of fusion power for the electricity grid.

The researchers found that comparisons between machine learning methods for solving fluid-related PDEs and traditional methods are often biased in favor of machine learning methods. They also found that negative results were consistently underreported. They suggest rules for performing fair comparisons but argue that cultural changes are also needed to fix what appear to be systemic problems.

“Our research suggests that, though machine learning has great potential, the present literature paints an overly optimistic picture of how machine learning works to solve these particular types of equations,” said Ammar Hakim , PPPL’s deputy head of computational science and the principal investigator on the research.

Comparing results to weak baselines

PDEs are ubiquitous in physics and are particularly useful for explaining natural phenomena, such as heat, fluid flow and waves. For example, these kinds of equations can be used to figure out the temperatures along the length of a spoon placed in hot soup. Knowing the initial temperature of the soup and the spoon, as well as the type of metal in the spoon, a PDE could be used to determine the temperature at any point along the utensil at a given time after it was placed in the soup. Such equations are used in plasma physics, as many of the equations that govern plasmas are mathematically similar to those of fluids.

Scientists and engineers have developed various mathematical approaches to solving PDEs. One approach is known as numerical methods because it solves problems numerically, rather than analytically or symbolically, to find approximate solutions to problems that are difficult or impossible to solve exactly. Recently, researchers have explored whether machine learning can be used to solve these PDEs. The goal is to solve problems faster than they could with other methods.

The systematic review found that in most journal articles, machine learning hasn’t been as successful as advertised. “Our research indicates that there might be some cases where machine learning can be slightly faster for solving fluid-related PDEs, but in most cases, numerical methods are faster,” said Nick McGreivy . McGreivy is the lead author of the paper and recently completed his doctorate at the Princeton Program in Plasma Physics .

Numerical methods have a fundamental trade-off between accuracy and runtime. “If you spend more time to solve the problem, you’ll get a more accurate answer,” McGreivy said. “Many papers didn’t take that into account in their comparisons.”

Furthermore, there can be a dramatic difference in speed between numerical methods. In order to be useful, machine learning methods need to outperform the best numerical methods, McGreivy said. Yet his research found that comparisons were often being made to numerical methods that were much slower than the fastest methods.

Two rules for making fair comparisons

Consequently, the paper proposes two rules to try to overcome these problems. The first rule is to only compare machine learning methods against numerical methods of either equal accuracy or equal runtime. The second is to compare machine learning methods to an efficient numerical method.

Of 82 journal articles studied, 76 claimed the machine learning method outperformed when compared to a numerical method. The researchers found that 79% of those articles touting a machine learning method as superior actually had a weak baseline, breaking at least one of those rules. Four of the journal articles claimed to underperform when compared to a numerical method, and two articles claimed to have similar or varied performance.

chart showing relationship between baselines and biases on samples

The researchers created the image above to convey the cumulative effects of weak baselines and reporting biases on samples. The circles or hexagons represent articles. Green indicates a positive result, for example, that the machine learning method was faster than the numerical method, while red represents a negative result. Column (a) shows what the system would likely look like if strong baselines were used and reporting bias was not an issue. Column (b) depicts the likely results without reporting bias. Column (c) shows the actual results seen in the published literature. (Image credit: Nick McGreivy / Princeton University)

“Very few articles reported worse performance with machine learning, not because machine learning almost always does better, but because researchers almost never publish articles where machine learning does worse,” McGreivy said.

McGreivy thinks low-bar comparisons are often driven by perverse incentives in academic publishing. “In order to get a paper accepted, it helps to have some impressive results. This incentivizes you to make your machine learning model work as well as possible, which is good. However, you can also get impressive results if the baseline method you’re comparing to doesn’t work very well. As a result, you aren’t incentivized to improve your baseline, which is bad,” he said. The net result is that researchers end up working hard on their models but not on finding the best possible numerical method as a baseline for comparison.

The researchers also found evidence of reporting biases, including publication bias and outcome reporting bias. Publication bias occurs when a researcher chooses not to publish their results after realizing that their machine learning model doesn’t perform better than a numerical method, while outcome reporting bias can involve discarding negative results from the analyses or using nonstandard measures of success that make machine learning models appear more successful. Collectively, reporting biases tend to suppress negative results and create an overall impression that machine learning is better at solving fluid-related PDEs than it is.

“There’s a lot of hype in the field. Hopefully, our work lays guidelines for principled approaches to use machine learning to improve the state of the art,” Hakim said.

To overcome these systemic, cultural issues, Hakim argues that agencies funding research and large conferences should adopt policies to prevent the use of weak baselines or require a more detailed description of the baseline used and the reasons it was selected. “They need to encourage their researchers to be skeptical of their own results,” Hakim said. “If I find results that seem too good to be true, they probably are.” This work was completed with funding from DOE grants DE-AC02-09CH11466 and DE-AC02-09CH11466.

PPPL is mastering the art of using plasma — the fourth state of matter — to solve some of the world's toughest science and technology challenges. Nestled on Princeton University’s Forrestal Campus in Plainsboro, New Jersey, our research ignites innovation in a range of applications including fusion energy, nanoscale fabrication, quantum materials and devices, and sustainability science. The University manages the Laboratory for the U.S. Department of Energy’s Office of Science, which is the nation’s single largest supporter of basic research in the physical sciences. Feel the heat at https://energy.gov/science and https://www.pppl.gov .

IMAGES

(PDF) An Overview of Artificial Intelligence and their Applications
Artificial Intelligence and Machine Learning to Accelerate
Top 3 Artificial Intelligence Research Papers
(PDF) A review of artificial intelligence
(PDF) Applications of Artificial Intelligence in Machine Learning
(PDF) Artificial Intelligence and Machine Learning Journal, ISSN 1687

VIDEO

MLDescent #1: Can Anyone write a Research Paper in the Age of AI?
Implementing #ai in business #tip1
Machine Learning and Artificial Intelligence Tools to Advance Genomic Translational Research
Artificial Intelligence And Machine Learning
💥💥 The difference between Artificial Intelligence, Machine Learning, and Deep Learning! 🚀 #ai
How to actually learn AI/ML: Reading Research Papers

COMMENTS

Artificial intelligence and machine learning research: towards digital
This editorial introduces a collection of papers on artificial intelligence and machine learning research, published in a special issue of the Journal of Ambient Intelligence and Humanized Computing. The papers cover various topics related to data-driven innovation and digital transformation at a global scale.
Artificial intelligence and machine learning
Within the last decade, the application of "artificial intelligence" and "machine learning" has become popular across multiple disciplines, especially in information systems.
Machine Learning: Algorithms, Real-World Applications and Research
A comprehensive review of machine learning techniques and their applications in various domains, such as cybersecurity, smart cities, healthcare, and more. Learn the principles, challenges, and potential research directions of different types of machine learning algorithms, such as supervised, unsupervised, semi-supervised, and reinforcement learning.
(PDF) Artificial Intelligence and Machine Learning: Improvements
These fields are Artificial Intelligence (AI) and Machine Learning (ML). The convergence of AI and ML has propelled innovation to unprecedented heights, revolutionizing the way we perceive and
(PDF) The Impact of AI and Machine Learning on Innovation: A
Artificial Intelligence (AI) and Machine Learning (ML) have emerged as transformative technologies with the potential to revolutionize various industries and spur innovation.
Artificial intelligence: A powerful paradigm for scientific research
This paper surveys the development and application of AI in various aspects of fundamental sciences, such as information science, mathematics, medical science, etc. It discusses the challenges and potentials of AI techniques to handle these challenges, and provides a research guideline on AI-based fundamental sciences.
Forecasting the future of artificial intelligence with machine learning
Specifically, in the field of artificial intelligence (AI) and machine learning (ML), the number of papers every month is growing exponentially with a doubling rate of roughly 23 months (Fig. 1 ...
Artificial Intelligence and Machine Learning in Clinical Medicine, 2023
As computers and the concept of artificial intelligence (AI) were almost simultaneously developed in the 1940s and 1950s, the field of medicine was quick to see their potential relevance and ...
Review of Artificial Intelligence and Machine Learning Technologies
Artificial intelligence (AI) is an evolving set of technologies used for solving a wide range of applied issues. The core of AI is machine learning (ML)—a complex of algorithms and methods that address the problems of classification, clustering, and forecasting. The practical application of AI&ML holds promising prospects. Therefore, the researches in this area are intensive.
Journal of Artificial Intelligence Research
The Journal of Artificial Intelligence Research (JAIR) publishes important research results in all areas of AI. The current issue (Vol. 80, 2024) features papers on goal reasoning, decision learning, submodular optimization, language models, argumentation, and more.
AI and Machine Learning
These approaches use data on past arrivals to generate machine learning models that can... View Details. Keywords: AI and Machine Learning ... Working Paper; ... Numenta's co-founder, Jeff Hawkins, completed his pathbreaking research on artificial intelligence. His co-founder and CEO, Donna Dubinsky, had to find a business model to monetize ...
[2104.05314] Machine learning and deep learning
Today, intelligent systems that offer artificial intelligence capabilities often rely on machine learning. Machine learning describes the capacity of systems to learn from problem-specific training data to automate the process of analytical model building and solve associated tasks. Deep learning is a machine learning concept based on artificial neural networks. For many applications, deep ...
Machine Learning
Artificial Intelligence, Volume 7 Machine Learning Algorithms, Models and Applications Edited by Jaydip Sen Edited by Jaydip Sen Recent times are witnessing rapid development in machine learning algorithm systems, especially in reinforcement learning, natural language processing, computer and
Artificial intelligence and machine learning in finance: Identifying
Artificial intelligence (AI) and machine learning (ML) are two related technologies that are emergent in financial scholarship. However, no review, to date, has offered a wholistic retrospection of this research, despite an important need to encourage finance scholarship to improve our understand of the impacts of AI and ML on financial systems.
Artificial intelligence in healthcare: transforming the practice of
Machine learning (ML) refers to the study of algorithms that allow computer programs to automatically improve through experience. 14 ML itself may be categorised as 'supervised', 'unsupervised' and 'reinforcement learning' (RL), and there is ongoing research in various sub-fields including 'semi-supervised', 'self-supervised ...
INGR Roadmap Artificial Intelligence And Machine Learning Chapter
In the evolution of artificial Intelligence (AI) and machine learning (ML), reasoning, knowledge representation, planning, learning, natural language processing, perception, and the ability to move and manipulate objects have been widely used. These features enable the creation of intelligent mechanisms for decision support to overcome the limits of human knowledge processing. In addition, ML ...
A comprehensive analysis of the role of artificial intelligence and
Machine learning (ML) and artificial intelligence (AI) are two potent technologies that have the potential to revolutionise digital forensics by enabling analysts to process vast amounts of data swiftly and precisely, thereby detecting crucial evidence, as stated by Du et al., 2020a, Du et al., 2020b.
Artificial intelligence and machine learning
Artificial Intelligence (AI) has been named as one of the most recent, fundamental developments of the convergence in electronic markets (Alt, 2021) and has become an increasingly relevant topic for information systems (IS) research (Abdel-Karim et al., 2021; Alt, 2018).While a large body of literature is concerned with designing AI to mimic and replace humans (Dunin-Barkowski, 2020; Fukuda et ...
Artificial intelligence, machine learning and deep learning
It is increasingly recognized that artificial intelligence has been touted as a new mobile. Because of the high volume of data that being generated by devices, sensors and social media users, the machine can learn to distinguish the pattern and makes a reasonably good prediction. This article will explore the use of machine learning and its methodologies. Furthermore, the field of deep ...
Artificial Intelligence and Machine Learning
Artificial Intelligence and Machine Learning ... Transactions on Machine Learning Research (TMLR) 2023. Flexible Isosurface Extraction for Gradient-Based Mesh Optimization. Tianchang Shen, ... Best Technical Paper, SIGGRAPH 2022, THE BEST INVENTIONS OF 2022, TIME.
Artificial intelligence and machine learning in finance: A bibliometric
Artificial intelligence (AI), a key technology in the 2010s, has been increasingly becoming a dominant technology in the 2020 s (Hilpisch, 2020).For example, numerous businesses have applied AI to improve their operations (Bughin et al., 2018), while many multi-national corporations (e.g., Facebook, Google, IBM, and Microsoft) have significantly invested in AI to improve their efficiency ...
[2409.10304] How to do impactful research in artificial intelligence
Machine learning has been pervasively touching many fields of science. Chemistry and materials science are no exception. While machine learning has been making a great impact, it is still not reaching its full potential or maturity. In this perspective, we first outline current applications across a diversity of problems in chemistry. Then, we discuss how machine learning researchers view and ...
Deep learning, Artificial Intelligence and machine learning in cancer
Thus, over the past years, the progress in AI, especially in machine learning and deep learning, has affected the area of oncology. It is in this context that this paper reviews the different changes in technologies for cancer prognosis, diagnosis, and treatment. Some of the ways that AI is helping improve cancer diagnosis and treatment are in the analysis of large clinical
A systematic review of fairness in machine learning
According to [], Systematic Literature Review (SLR) refers to a specific research methodology, developed with the aim of collecting and evaluating available evidence relating to a specific topic.In this sense, the works retrieved in this analysis contemplate a broad view of bias reduction in machine learning. [] points out that the SLR development process has three phases: Planning, Conducting ...
Machine Learning Algorithms and Software Tools for Early Detection and
To address these questions, we aim to gather pioneering research employing machine learning, artificial intelligence, and data analytics to transform schizophrenia diagnosis and treatment. This Research Topic welcomes both empirical and review papers focused on, but not limited to, the following themes: -Applications of machine learning ...
Artificial Intelligence and Machine Learning Applications in Smart
Thus, the aim of the present research was to analyze, systematically, the scientific literature relating to the application of artificial intelligence and machine learning (ML) in industry.
Artificial Intelligence Techniques in Grapevine Research: A ...
In the last few years, the agricultural field has undergone a digital transformation, incorporating artificial intelligence systems to make good employment of the growing volume of data from various sources and derive value from it. Within artificial intelligence, Machine Learning is a powerful tool for confronting the numerous challenges of developing knowledge-based farming systems. This ...
Artificial intelligence, machine learning and deep learning in advanced
Artificial intelligence (AI), machine learning (ML), and deep learning (DL) are all important technologies in the field of robotics [1].The term artificial intelligence (AI) describes a machine's capacity to carry out operations that ordinarily require human intellect, such as speech recognition, understanding of natural language, and decision-making.
Study enhancing learning methods for AI and machine learning systems
A paper authored by Seyyedali Hosseinalipour (Ali Alipour) received the Institute of Electrical and Electronics Engineers (IEEE) Communications Society William R. Bennett Prize. The research could enhance learning methods used by artificial intelligence (AI) and machine learning (ML) systems.
Replacing hype about artificial intelligence with accurate measurements
In a new paper in Nature Machine Intelligence, researchers at the U.S. Department of Energy's Princeton Plasma Physics Laboratory (PPPL) and Princeton University performed a systematic review of research comparing machine learning to traditional methods for solving fluid-related partial differential equations (PDEs). Such equations are ...

Forecasting the future of artificial intelligence with machine learning-based link prediction in an exponentially growing knowledge network

Similar content being viewed by others

Learning on knowledge graph dynamics provides an early warning of impactful research

TrendyGenes, a computational pipeline for the detection of literature trends in academia and drug discovery

Accelerating science with human-aware artificial intelligence

Semantic networks

Link prediction in semantic networks

Potential for idea generation in science

Generation and analysis of the dataset

Network-theoretical analysis

Problem formulation

AI-based solutions

Extensions and future work

Details on concept set generation and application

Time gap between the generation of edges

Positive examples in the test dataset

Publication rates in quantum physics

Details on models M1–M8

Details on M1

Details on M2

Details on M3

Details on M4

Details on M5

Details on M6

Details on M7

Details on M8

Data availability

Code availability

Acknowledgements

Author information

Contributions

Corresponding author

Ethics declarations

Peer review

Additional information

Extended data

Extended Data Fig. 2

Extended Data Fig. 3

Extended Data Fig. 4

Rights and permissions

About this article

Share this article

This article is cited by

A commentary on transformative consumer research: Musings on its genesis, evolution, and opportunity for scientific specialization

Quick links

Create an E-mail Alert for This Article

Create your account to get 2 free subscriber-only articles each month.

Export citation

View Options

PREVIOUS ARTICLE

Current Issue

The Effect of Preferences in Abstract Argumentation under a Claim-Centric View

Artificial intelligence in healthcare: transforming the practice of medicine

Usman Munir

Aditya Nori

Bryan Williams

Introduction

What is artificial intelligence?

How to build effective and trusted AI-augmented healthcare systems?

Design and develop

Stakeholder engagement and co-creation

Human-centred AI

Experimentation

Evaluate and validate

Scale and diffuse

Monitor and maintain

What are the current and future use cases of AI in healthcare?

AI today (and in the near future)

AI in the medium term (the next 5–10 years)

AI in the long term (>10 years)

Connected/augmented care

Virtual assistants and AI chatbots

Ambient and intelligent care

Precision diagnostics

Diabetic retinopathy screening

Improving the precision and reducing waiting timings for radiotherapy planning

Precision therapeutics

Immunomics and synthetic biology

AI-driven drug discovery

Precision medicine