Social network link prediction techniques for smart urban infrastructure management: Factors,  parameters,  and applications

Mukesh Kumar; Kulvinder Singh; Sanjeev Dhawan

doi:10.22712/susb.20260009

Preview

General Article

International Journal of Sustainable Building Technology and Urban Development. 31 March 2026. 138-153
https://doi.org/10.22712/susb.20260009

Social network link prediction techniques for smart urban infrastructure management: Factors, parameters, and applications

Mukesh Kumar¹^*

Kulvinder Singh²

Sanjeev Dhawan²

¹Research Scholar, Department of Computer Science and Engineering, University Institute of Engineering and Technology (UIET), Kurukshetra University, Kurukshetra, India

²Associate Professor, Department of Computer Science and Engineering, University Institute of Engineering and Technology (UIET), Kurukshetra University, Kurukshetra, India

^{*Corresponding Author}

License (open-access, https://creativecommons.org/licenses/by-nc/4.0/):

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non- commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

ABSTRACT

A social network represents a dynamic system of interpersonal interactions among individuals, communities, or organizations. These networks support diverse activities such as information sharing, public communication, service coordination, and citizen engagement. In smart urban environments, social networks play a crucial role in understanding human behavior, optimizing resource usage, and enhancing service delivery. However, one of the major challenges is to identify and strengthen meaningful connections among users that facilitate efficient information flow and improved system performance. Link prediction addresses this challenge by estimating the likelihood of future or missing connections between currently unconnected nodes. It contributes to several smart city applications, including community-based decision making and intelligent transportation systems. The objective of this work is to review state-of-the-art techniques used for link prediction in social networks ranging from similarity-based heuristics to advanced machine learning methods and to analyze the key factors and parameters influencing the prediction of missing links within smart urban systems.

Keywords

community detection

urban connectivity patterns

missing references

similarity score

smart urban sustainability

MAIN

Introduction
Link Prediction Problem
Applications of Link Prediction Techniques
Literature Review
Similarity Score based Link Prediction Techniques
Supervised Machine Learning Based Link prediction Methods
Other Link Prediction Approaches
Research Gaps
Factors and Parameters Affecting the Missing Link Prediction in Social Network
Conclusion

Introduction

A social network is a structure composed of social actors, dyadic ties, and interactions among these actors [1]. It serves as a representation of the social hierarchy among the many social units. An individual or an organization might be considered a social entity. Social entities, also known as nodes, are linked to further nodes through associations, likewise recognized as edges, in online social networks [2]. The presence of an association or communication among the nodes is shown by these linkages [3]. A model social set-up is shown in Figure 1, with nodes standing in for social entities and connections for the interactions between them.

https://cdn.apub.kr/journalsite/sites/durabi/2026-017-01/N0300170108/images/Figure_susb_17_01_08_F1.jpg

Figure 1.

Illustration of a sample social network where nodes represent social entities and edges denote interactions between them [1].

The use of online social systems has increased dramatically in the modern era. Millions of people have used it as a platform to communicate with one another about their ideas, opinions, and views [4]. In addition, political awareness initiatives, blogging, reviews, and marketing are conducted on this site [5]. Social networking platforms such as Facebook, Instagram, Twitter, and WhatsApp have become indispensable in daily life [6]. Social networks are inherently quite dynamic since they continue to expand (or change) throughout time [7]. A social system’s primary objective is to maximize the number of connections since this promotes efficient use of the services it offers and guarantees that information spreads quickly throughout the network [8].

Examining the relationships between two particular nodes will help to simplify this problem. The link prediction difficulty is the task of determining which relations among unconnected nodes in a social system are most likely to occur. It calculates the likelihood that two currently separated nodes may establish a link. Based on the nodes and connections that are currently in place, the chance is estimated. It is conceivable that a user is not linked to someone on a social network that he knows in real life. Therefore, in this case, the objective of the link prediction approach is to suggest a association among two people in the specified social network who are currently unconnected.

An essential component of all social networking sites is link prediction. The aim of link prediction techniques is to create as many associations as they can. Even if a completely connected network is unattainable, there are always techniques to increase the likelihood that users who are not connected can connect. The several domains in which link prediction algorithms are applied are revealed in Table 1.

Table 1.

Major application areas of link prediction techniques across different domains

System Type	Interaction Prediction
Social	Friendships Collaborations (or Co-authorship) Collusion Community Detection Privacy Control Expert Detection Influence Detection
Biological	Protein-protein interactions in biological processes Food webs - how different organisms interact with each other and their environment Medical Referral Systems Disease Prediction
Information Systems	User-Item interactions in recommender systems

This paper’s remaining sections are arranged as follows. The mathematical formulation of the link prediction issue is described in section 2. A variety of link prediction application domains are shown in section 3. The literature review is presented in section 4. Factors and parameters affecting the absent link prediction in social system are discussed in section 5, and the article is concluded in section 6.

Link Prediction Problem

An enormous graph can be used to depict an online social network, with each node (also known as a vertex) representing an individual and each edge (also known as a link) representing the relationship between two individuals.

A symmetric adjacency matrix A ∈ {0,1} of dimension N × N can be used to depict an undirected network with N nodes, where A_ij = A_ji = 1 specifies the presence of an edge among nodes i and j. Associations in online social systems are often constructed on shared interests between participants.

The connection estimate difficulty in online social systems may be described mathematically as follows: Let G is a graph with V denoting the group of persons and E the set of edges. The link among users x and y at any time instance, t₀, is represented by an edge, e (x,y) ∈ E. G[t₀] is the name given to the graph at time t₀. Link prediction aims to identify edges that are absent at time t₀ but are likely to appear at a later time t₁ (t₁ > t₀). An illustration of a link prediction issue for a specific set-up is revealed in Figure 2. Six vertices and their connections (shown by solid lines) at time instance t₀ are shown in Figure 2(a).

https://cdn.apub.kr/journalsite/sites/durabi/2026-017-01/N0300170108/images/Figure_susb_17_01_08_F2.jpg

Figure 2.

Example illustrating the link prediction problem by showing existing and potential links between nodes at two different time instances [9].

Finding highly likely linkages between the unconnected node pairs (shown by dotted lines in Figure 2(b)) at time instance t₁ is the goal of the link prediction approach. In an online social system, the link prediction difficulty is commonly considered as a twofold classification difficulty [9, 10]. One way to conceptualize the linkage forecast concern is as a challenge of classifying pairs of unconnected nodes as either “connected” or “not connected.” Usually, it is based on a resemblance score that is considered for every disconnected node duos in the specified network [11].

There is a significant coincidental that detached nodes with equal similarity scores may eventually link. Given the high grade of dynamic nature of social systems, link prediction, also known as forecasting, is a computationally difficult problem. The social network is constantly adding new connections and members.

Two categories can be applied to categorize link prediction issues centred on data availability:

a) future (or temporal) link prediction, or

b) disappeared link prediction

Future link prediction is the procedure of forecasting network structure at timestamp t_k based on data available at timestamp t_i if temporal data is available for two separate timestamps, t_i and t_k (where t_k > t_i). The link prediction concern, on the other side, may be treated as missing link prediction if temporal data is not provided. In this case, one can anticipate the present network topology by constructing random missing links. Since there are millions of nodes in an online social network, managing data becomes challenging when attempting to anticipate links. The adding or removal of nodes and edges on a regular basis causes a network’s topological features to change. These modifications suggest that, as a result of new link formation in online social systems, it is essential to investigate the dynamics of system topological features. The existence of class skewness (or imbalance) in the link prediction job is another problem. On Facebook, every user may connect to 1.35 billion other users [12], yet each person only links to a few hundred nodes on average in the social network. There is a class disparity in the number of connections that are made and those that are not because of how social networks link people. Due to the computational impossibility of predicting every lost link in a network this size, the number of node pairs that should be employed for experimental assessment must be reduced. Because social networks are self-similar by nature, research done on a subset of the system may be applied to the whole system.

Applications of Link Prediction Techniques

Numerous application domains, including protein- protein interaction prediction, automatic hyperlink prediction, recommendation system construction, inferring complete networks from partial networks [13], detecting anomalous communications [14], predicting possible collaborators [15], and so on, are closely associated with link prediction.

Recommender Systems: A recommender scheme, also recognised as a commendation arrangement, is a sort of data mining arrangement that recommends a product to the consumer centred on the user’s past ratings or purchasing habits. Amazon, Flipkart, Myntra, and other commercial apps are the main uses for these platforms. Different link prediction algorithms are available for customized recommendations [16].

Medical Referral System: The procedure of sending patients to doctors with particular specializations is known as the medical referral system [17]. When a patient becomes unwell, they consult a general practitioner in different nations. The patient is sent to a specialist practitioner who specializes in the patient’s ailment type if the general practitioner is unable to resolve the condition. Finding the correct expert has gotten more difficult, though, as healthcare systems have grown. Finding the right specialist might take a long time at times, and this can have detrimental effects. Furthermore, patients could be directed to experts who are fully booked and unable to accept new patients. Link prediction has been used to forecast which specialists would likely receive recommendations in the future, which has helped to tackle the problem of medical referrals. Using SVM classifier for link prediction in medical referral systems, Almansoori et al. [17] achieved prediction accuracy of up to 92%.

Spam Mail Detection: Spam, often known as junk email, is any unsolicited email communication that is typically delivered in large quantities. Emails that are unsolicited by the recipient are known as spam. It’s critical to monitor traffic in a variety of security applications in order to spot any odd communication. Techniques for link prediction have been used to identify unusual vertices [18] and spam emails [19].

Community Discovery: The aim of community identification is to divide a whole social network into a collection of modules. These modules consist of clusters with extremely dense node connections [20][21]. Applications for community detection may be found in a number of areas, including product suggestion and review gathering. Many publications have employed link prediction algorithms for network evolution prediction and community discovery [22].

Privacy Control in Social Systems: The increasing usage of social media spots has made it more important than ever to discover reliable people. On these social networking sites, users frequently disclose a lot of personal and private information, including phone numbers, email addresses, and images. Making ensuring that data is protected from being exploited by any untrustworthy people is crucial. Link prediction tools are useful in identifying reliable individuals. This will protect user privacy on social media platforms. Aloufi [23] identified a local group of reliable individuals in a network by using link prediction algorithms.

Identifying Missing References in a Publication: Academic dishonesty and ethical violations are caused by plagiarism. Typically, an article has connections to a number of other articles. Some allusions, nevertheless, could be overlooked. Through the use of author and keyword matching, link prediction systems can assist in locating missing references within a publication [24].

Routing in Networks: Link prediction may be used to monitor traffic over many communication channels in a variety of security and network applications. In order to quickly discover communications those are odd (or aberrant) and warrant more examination, monitoring is necessary. In order to enhance routing efficiency, network applications can also utilize link prediction to find the best routes. Regular network outages have a negative impact on mobile communication quality. Weiss et al. [25] and Yadav et al. [26] suggested approaches for utilizing link prediction for routing in order to estimate signal strength. Hu and Hou [27] suggested traffic prediction methods to enhance packet routing in wireless networks.

Expert Detection: Research partnerships frequently provide high-quality outcomes. It might be challenging to locate a specialist researcher in one’s own field of expertise. Link prediction approaches may be employed in co-authorship networks to classify domain specialists given a network including several researchers and their relationships (in terms of co-authorship) working in distinct areas [28, 29]. Liu & Ning [30] ranked the applicants for high-level government positions using the link prediction approach.

Influence Detection: Analysing which users in the system have the greatest persuasive power is often crucial. For example, powerful customers can have a big impact on product sales. Link prediction algorithms were employed by Cervantes et al. [31] to classify the prominent nodes inside a specific system. In a comparable vein, Nguyen et al. [32] employ link prediction algorithms to determine a user’s impact and personality attributes.

Disease Prediction: Patients’ illnesses can be predicted via link prediction. Patients and disorders were arranged in a social network by Folino et al. [33]. The diseases are characterized as nodes in the system, and the simultaneous presence of two sicknesses in a patient is shown by an edge connecting them. The illnesses that might strike a patient given his present state of health are predicted via link prediction algorithms.

Literature Review

The first research discussed in this review is the major research aimed at similarity score based heuristics applied to link prediction in social networks. It then looks at machine learning-based methods which have gained immense traction to enhance accuracy of predictions. Other methods of link prediction used in the literature are also described to have a wider outlook of the field. Given that the majority of the available literature can be classified into the similarity- based and machine learning-based approaches, they are highlighted in the context of the review. Lastly, some areas of research gaps that have been noted amongst the existing literature are revealed to steer future developments in predicting social network links.

Similarity Score based Link Prediction Techniques

Some of the most important elements in the territory of online social systems are link prediction and suggestion. In recent decades, it has been a relevant area of study [34, 35, 36]. “Individuals You May Need to Employ” on LinkedIn, “You May Recognize” on Google+, and “Individuals You May Recognize” (PYMK) on Facebook are a few well-known instances of link prediction. Finding the similarity (or proximity) score between disconnected pairs of nodes is an essential first step in connection prediction. The recommendation of whether or not to recommend a connection to disconnected nodes is based on this score. The similarity score determines whether u and v are connected or not. A variety of link prediction techniques are available that govern the resemblance score among a given pair of nodes. These methods can be divided into groups according on various heuristics. Similarity- based link prediction techniques were divided into three groups by Martinez et al. [37]: local methods [38, 39, 40], global methods [41] and quasi-local methods [42]. To compute the resemblance mark among two nodes, local similarity score approaches rely on structural features based on the neighborhoods of the nodes [43]. When calculating similarity, these methods only consider neighbors that are directly shared. Common Neighbor, Jaccard Index, Salton Index, Adamic Adar Index, Preferential Attachment Index, Resource Allocation Index, and Sorensen Index are a few frequently used node neighborhood based local similarity score computation approaches [44, 45, 46]. Conversely, global approaches compute similarity by utilizing the chains of neighbors that exist between two nodes. This kind of similarity measurement technique has a high computing complexity and is very noisy. The Katz Index [47], SimRank [48], Blondel Index [49], and Global Leicht Holme Newman Index [50] are a few often used global similarity score heuristics. When calculating similarity, quasi-local approaches employ neighbors-of-neighbors or neighbors with limited lengths. Several frequently employed techniques for quasi-local similarity scores include local path index, friendlink and local random walks [51].

Local similarity score grounded approaches are the best extensively used of the three similarity calculation methods for online social network link prediction. These are straightforward in that they find the similarity score between disconnected node pairs using common shared properties, which are typically common shared nodes. Since these are the quickest link prediction methods, they can be used in large parallel applications. These methods also achieve low computational cost and good prediction accuracy [52]. An online community system is frequently treated as a graph for the purposes of experimental measurement and analysis. An edge (or link) in the network denotes the associations between the consumers, whereas a vertex in the graph represents each individual user. The presence and lack of linkages between the vertices are displayed visually via the adjacency matrix [53]. Increasing the total of relations between disconnected vertices in the given system is the primary goal of the link prediction technique. To determine a connection value among unconnected nodes, a variety of heuristics are employed by each link prediction technique. Local similarity score heuristics use node degree or the quantity of shared nodes as a criterion when calculating similarity scores. The network statistics serve as the main basis for selecting prediction methods.

The following is a summary of the prominent node neighborhood-based similarity score heuristics for link prediction:

1. Common Neighbors (CN): This metric calculates the closeness of two nodes in a given network by counting all of their shared neighbors. For collaboration networks, Newman [54] calculated the resemblance mark metric among two nodes by means of mutual neighbors. Equation 1 can be used to calculate the similarity score using Common Neighbors in the following way:

(1)

Sim (u; v) = |T_{u} \cap T_{v}|

Where T_u and T_v stand for the group of nodes that a paired group of nodes, u and v, respectively, has as neighbors. When disconnected users, u and v, have a high similarity score at time instance t₀, there is a strong likelihood that they will likely reconnect at time instance t₁, where t₀ <= t₁.

2. Jaccard Index (JI): The Jaccard established the Jaccard Index (also recognized as the Jaccard Constant (JC)) more than a century ago, in 1901 [55]. The similarity over diversity is measured by this metric. In information retrieval, it is also known as a similarity metric. Equation 2 can be used to define the Jaccard Index as follows:

(2)

Sim (u; v) = |T_{u} \cap T_{v}| / |T_{u} \underset{\cdot}{\cup} T_{v}|

Based on the likelihood that a neighbor of node u or node v has a high chance of becoming a neighbor of nodes u and v, the resemblance mark is planned.

Researchers have used heuristics based on similarity scores in the past to identify likely relationships in a variety of applications. A link prediction algorithm constructed on keyword matching was suggested by Bhattacharya et al. [56]. The suggested method used text similarity to calculate the resemblance among node pairs. They discovered that the average similarity score drops as the amount of frequent common neighbors and keywords rises. They also discovered that, independent of the topological measures, user similarity ratings are comparable, with the exception of direct links. Zhou et al. [57] employed a variety of heuristics based on similarity scores to anticipate links across six distinct networks. Using various similarity score algorithms, they were able to achieve average performance up to 93.3% in footings of region below the ROC curve.

A summary of works on link prediction approaches based on similarity scores may be found in Table 2. Based on previous research on different similarity score heuristics for link prediction, we discovered that while a range of network topological property indices and link prediction approaches have been studied in the works; no study has looked at the relationship between topological property indices and link prediction approaches. Furthermore, we discovered that the methods currently in use for predicting links based on similarity scores either use node construction or node attribute information. No link prediction method that combines the use of node construction and profile data for link prediction is currently known to exist in the literature. Additionally, we found that threshold values are required for similarity score-based link prediction techniques in order to make decisions. It is challenging to determine the threshold value, nevertheless, at which link prediction can be carried out because huge social networks have a very dynamic structure. Every customer in an online social system has a unique combination of traits; therefore choosing a common cutoff point to determine if two detached users will reconnect is difficult for the entire social network (which has thousands of egos).

Table 2.

Different Techniques used for Similarity Score-based Link Prediction

Author(s)	Year	Heuristic	Features	Dataset	Best Performing Similarity Score Heuristic
Chen et al. [58]	2005	Common Neighbor, Jaccard Index, Node Book Sales Preferential Attachment Adamic Adar Index, Neighborhood Index Preferential Attachment Index, Katz Index, Local Path Index	Node Neighborhood	Book Sales	Resource Allocation Index
Murata & Moriyasu [59]	2007	Common Neighbor, Adamic Adar Index, Preferential Attachment Index	Node Neighborhood	Yahoo! Chiebukuro	Adamic Adar Index
Song et al. [60]	2009	Common Neighbor, Adamic Adar Index, Preferential Attachment Index, Katz Index	Node Neighborhood Graph Distance	Flickr, Digg YouTube, MySpace Wikipedia LivJournal	Different heuristics for each dataset
Izudheen & Mathew [61]	2016	Common Neighbor, Jaccard Index, Adamic Adar Index, Preferential Attachment Index	Node Neighborhood	PPI (MINT)	Preferential Attachment Index
Lu et al. [62]	2017	Common Neighbor, Adamic Adar Index, Preferential Attachment Index, Katz Index	Node Neighborhood	MATADAOR	Common Neighbor
Tariq et al. [63]	2019	Common Neighbor, Jaccard Index, Adamic Adar Index	Node Neighborhood	Facebook	Jaccard Index, Adamic Adar Index
Hao Tian & Reza Zafarani [64]	2020	Common Neighbor variants, Jaccard Index	Proposed a γ-decay model generalizing neighborhood- based heuristics for improved link prediction in large networks.	Social and collaboration networks	γ-decay enhanced Common Neighbor
Tillman et al. [65]	2020	Extended CN, Jaccard Index, Preferential Attachment	Designed a unified heuristic framework for multiplex networks (multiple types of edges).	Scientific collaboration, trade, transportation	Proposed multiplex heuristic
Wang et al. [66]	2020	Motif-based Heuristic, CN, Adamic Adar	Combined local motifs with classical heuristics to capture higher-order structure.	Social, biological, academic networks	Motif-based heuristic
Govind Sharma et al. [67]	2021	CN, Adamic Adar	Studied higher-order relations affecting heuristic accuracy; analyzed bias in CN and AA.	Synthetic and real-world graphs	CN (noted to overestimate links)
Haji Gul et al. [68]	2022	Matrix-Forest Metric, CN, AA	Integrated local similarity and matrix-forest index to enhance prediction reliability.	Complex real-world networks	Matrix-Forest Metric
Yun et al. [69]	2022	Neo-GNN (heuristic-integrated)	Incorporated structural overlap heuristics (CN, Jaccard) inside GNN architecture for robustness.	Open Graph Benchmark (OGB)	Neo-GNN (heuristic-enhanced model)
Nirmaljit Singh & Ikvinderpal Singh [70]	2023	CN, AA, Resource Allocation, Preferential Attachment	Compared classical heuristics on wireless multiplex networks.	Wireless network datasets	Resource Allocation
Zhang et al. [71]	2023	Heuristic Learning GNN (HL-GNN)	Unified local/global heuristics through matrix-based GNN learning.	Planetoid, Amazon, OGB datasets	HL-GNN (learned heuristic model)
Y.V. Nandini et al. [72]	2024	Average Centrality Similarity	Combined degree, betweenness, closeness, and clustering centralities into a new similarity metric.	Real-world complex networks	Average Centrality Similarity
Puneet Kapoor et al. [73]	2024	Heuristic-based Feature Learning	Integrated heuristic and embedding features for heterogeneous networks.	Social, citation, biological networks	Learned hybrid heuristic features
Zhou, Wan & Du [74]	2025	Information Entropy Common Neighbor (IECNC)	Extended CN by including entropy to measure information uncertainty in neighborhoods.	Social and biological networks	IECNC (entropy- enhanced CN)
Pandey S.D. et al. [75]	2025	Strength Prominence (SP) Index	New similarity index combining tie strength and node prominence, works without common neighbors.	Fuzzy social networks	SP Index
La Cava et al. [76]	2025	Mixture of Experts (MoE-ML-LP)	Integrated multiple heuristic-informed experts for multilayer networks.	Real-world multilayer networks	MoE-ML-LP (ensemble heuristic model)

Supervised Machine Learning Based Link prediction Methods

A cumulative similarity mark is calculated by similarity score established link prediction approaches using either communal shared attributes or the neighborhood of the node. When predicting missing links, it ignores the connectivity pattern between linked pairs. On the other side, machine learning centered link prediction approaches predict a class label based on the connection pattern among linked nodes (number of common neighbors or shared profile traits). Link prediction in online social systems can be viewed as a twofold machine learning classification challenge.

Hasan et al. [77] suggested using machine learning classification for absent link prediction in co-authorship systems as a resolution to the link prediction difficulty. Across two distinct co-authorship systems, a collection of supervised machine learning classification methods is applied to predict whether or not the two authors would collaborate on a paper in the future. This study examines two co-authorship networks: BioBase and DBLP. In a comparison of machine learning classifiers for link prediction, Hasan et al. [77] examined Decision Tree, SVM, KNN, Naïve Bayes, and Multilayer Perceptron. They discovered that all of these classifiers could solve the link prediction problem with a reasonable degree of accuracy. They did not, however, assess how well machine learning classifiers performed in comparison to several traditional similarity score algorithms. Benchettara et al. [78] expanded on Hasan et al.’s work for link prediction in the user-item purchase network of websites that engage in electronic commerce (E-Com). There is a bipartite graph in the E-Com network with two different kinds of nodes: users and products. If a user has bought an item, then the user and the item are connected. The use of an “indirect” feature for link prediction was demonstrated in the article. A feature for all users who bought a common item is the use of a similar item purchase count. It has been demonstrated that link prediction performance was enhanced by adding indirect features to the training set. O’ Madadhain et al. [79] utilized the logistic regression technique to forecast the likelihood of connections between various node pairings. They made use of network data gathered via emails from Enron, calls to AT&T, and articles from CiteSeer. Gong et al. [80] implemented link prediction utilizing support vector machine (SVM) technology on the Google+ website. Liu et al. [81] Using deep belief networks (DBN) for signed network missing link prediction. Every link in a signed network has a sign, either positive or negative. There is a positive indication on the edge between two users if they support or agree with each other. Conversely, there is a negative symbol on the edge between two users if they oppose or disagree with one another. They discovered that the effective prediction of link development in signed networks may be achieved through the application of deep belief networks. Scellato et al. [82] employed a variety of supervised learning methods for link prediction in location-based networks, including Naïve Bayes (NV), Random Forest (RF), Decision Tree (DT), and J48. For link prediction, they looked into place-based features like the overall number of common check-in locations, the percentage of common places between two users from all the places they have checked in, the total number of check-ins at a location, etc. If there are few check-ins, a common place is deemed to have high importance. The utilization of place information enhances the machine learning technique’s prediction ability, according to the findings of the scientists. Tasnádi & Berend [83] assessed the application of supervised machine learning algorithm for Yelp.com, a restaurant review platform. In the restaurant review network, they predicted links using greatest entropy.

References

Wikipedia Contributors, Wikipedia, The Free Encyclopedia: Social Network [Online], 2004. Available at: https://en.wikipedia.org/wiki/Social_network [Accessed 28/11/2025].

D. Singla, D. Gupta, and N. Goyal, Sustainable basil leaf disease classification: Benchmarking seven deep learning models using transfer learning for urban and rural farming. International Journal of Sustainable Building Technology and Urban Development. 16(1) (2025), pp. 141-157.

P. Gupta, K.K. Bhatia, and N. Duhan, A Socio-economic cost-effective budget allocation framework for real-time bidding in online advertisement for urban development. International Journal of Sustainable Building Technology and Urban Development. 16(2) (2025), pp. 234-250.

S. Ressler, Social network analysis as an approach to combat terrorism: Past, present, and future research. Homeland Security Affairs. 2(2) (2006), pp. 1-10.

E.M. Airoldi, D.M. Blei, S.E. Fienberg, E.P. Xing, and T. Jaakkola, Mixed membership stochastic block models for relational data with application to protein-protein interactions. Proc. International Biometrics Society Annual Meeting. 15 (2006), pp. 1-34.

A. Sharma, N. Kumar, C. Diwaker, B. Sharma, R. Baniwal, S.B. Bhattacharjee, and S. Rani, A Machine learning-based framework for energy-efficient load balancing in sustainable urban infrastructure and smart buildings. International Journal of Sustainable Building Technology and Urban Development. 15(4) (2024), pp. 498-512.

X. Li and H. Chen, Recommendation as link prediction: A graph kernel-based machine learning approach. Proc. 9th ACM/ IEEE-CS Joint Conference on Digital libraries, ACM. (2009), pp. 213-216.

10.1145/1555400.1555433

R. Sharma and A. Kamra, Enhancing diagnosis of breast cancer through mammographic image segmentation using Fuzzy C-Means. International Journal of Sustainable Building Technology and Urban Development. 14(4) (2023), pp. 488-499.

M. Hasan Al and M.J. Zaki, A survey of link prediction in social networks, Social Network Data Analytics. 2011, Springer, pp. 243-275.

10.1007/978-1-4419-8462-3_9

D.D. Lee and H.S. Seung, Algorithms for non-negative matrix factorization. Proc. Advances in Neural Information Processing Systems. (2001). pp. 556-562.

Y.-J. Wu, E. Levina, and J. Zhu, Link prediction for egocentrically sampled networks. arXiv preprint arXiv: 1803.04084. (2018).

M. Eck, Y. Zemlyanskiy, J. Zhang, and A. Waibel, Extracting translation pairs from social network content. Proc. International Workshop on Spoken Language Translation (IWSLT). (2014). pp. 200-205.

M. Kim and J. Leskovec, The network completion problem: Inferring missing nodes and edges in networks. Proc. SIAM International Conference on Data Mining, SIAM. (2011), pp. 47-58.

10.1137/1.9781611972818.5

K. Jahanbakhsh, V. King, and G.C. Shoja, Predicting missing contacts in mobile social networks. Pervasive and Mobile Computing. 8(5) (2012), pp. 698-716.

10.1016/j.pmcj.2012.07.007

J. Mori, Y. Kajikawa, H. Kashima, and I. Sakata, Machine learning approach for finding business partners and building reciprocal relationships. Expert Systems with Applications. 39(12) (2012), pp. 10402-10407.

10.1016/j.eswa.2012.01.202

N. Talasu, A. Jonnalagadda, S.S.A. Pillai, and J. Rahul, A link prediction based approach for recommendation systems. Proc. International Conference on Advances in Computing, Communications and Informatics (ICACCI), IEEE. (2017). pp. 2059-2062.

10.1109/ICACCI.2017.8126148

W. Almansoori, S. Gao, T.N. Jarada, A.M. Elsheikh, A.N. Murshed, J. Jida, R. Alhajj, and J. Rokne, Link prediction and classification in social networks and its application in healthcare and systems biology. Network Modeling Analysis in Health Informatics and Bioinformatics. 1(1-2) (2012), pp. 27-36.

10.1007/s13721-012-0005-7

D. Kagan, Y. Elovichi, and M. Fire, Generic anomalous vertices detection utilizing a link prediction algorithm. Social Network Analysis and Mining. 8(1) (2018), pp. 1-13.

10.1007/s13278-018-0503-4

Z. Huang and D.D. Zeng, A link prediction approach to anomalous email detection. Proc. IEEE International Conference on Systems, Man and Cybernetics, IEEE. (2) (2006), pp. 1131-1136.

10.1109/ICSMC.2006.384552

A.P. Appel, R.L. Cunha, C.C. Aggarwal, and M.M. Terakado, Temporally evolving community detection and prediction in content-centric networks. Proc. Joint European Conference on Machine Learning and Knowledge Discovery in Databases. (2018), pp. 3-18.

10.1007/978-3-030-10928-8_1

H.-M. Cheng, Y.-Z. Ning, Z. Yin, C. Yan, X. Liu, and Z.-Y. Zhang, Community detection in complex networks using link prediction. Modern Physics Letters B. 32(1) (2018), 1850004.

10.1142/S0217984918500045

W. Yu, C.C. Aggarwal, and W. Wang, Temporally factorized network modeling for evolutionary network analysis. Proc. 10th ACM International Conference on Web Search and Data Mining, ACM. (2017), pp. 455-464.

10.1145/3018661.301866928626845PMC5470848

S. Aloufi, Trust-aware Link Prediction in Online Social Networks. Ph.D. Thesis, University of Ottawa/ University of Ottawa, 2012.

M. Kc, R. Chau, M. Hagenbuchner, A.C. Tsoi, and V. Lee, A machine learning approach to link prediction for interlinked documents. Proc. International Workshop of the Initiative for the Evaluation of XML Retrieval. (2009), pp. 342-354.

10.1007/978-3-642-14556-8_34

E. Weiss, K. Kurowski, S. Hischke, and B. Xu, Avoiding route breakage in ad hoc networks using link prediction. Proc. 9th IEEE Symposium on Computers and Communications (ISCC), IEEE. (2003), pp. 57-62.

10.1109/ISCC.2003.1214101

A. Yadav, Y.N. Singh, and R. Singh, Improving routing performance in AODV with link prediction in mobile ad hoc networks. Wireless Personal Communications. 83(1) (2015), pp. 603-618.

10.1007/s11277-015-2411-5

C. Hu and J.C. Hou, A link-indexed statistical traffic prediction approach to improving IEEE 802.11 PSM. Ad Hoc Networks. 3(5) (2005), pp. 529-545.

10.1016/j.adhoc.2004.08.003

M. Pavlov and R. Ichise, Finding experts by link prediction in co-authorship networks. Proc. 2nd International Conference on Finding Experts on the Web with Semantics (FEWS), CEUR-WS. 290 (2007), pp. 42-55.

T. Wohlfarth and R. Ichise, Semantic and event-based approach for link prediction. Proc. International Conference on Practical Aspects of Knowledge Management. (2008), pp. 50-61.

10.1007/978-3-540-89447-6_7

J.-S. Liu and K.-C. Ning, Applying link prediction to ranking candidates for high-level government post. Proc. International Conference on Advances in Social Networks Analysis and Mining, IEEE. (2011), pp. 145-152.

10.1109/ASONAM.2011.54

E. Perez-Cervantes, J.P. Mena-Chalco, M.C.F. de Oliveira, and R.M. Cesar, Using link prediction to estimate the collaborative influence of researchers. Proc. 9th IEEE International Conference on e-Science, IEEE. (2013). pp. 293-300.

10.1109/eScience.2013.32

T. Nguyen, D. Phung, B. Adams, and S. Venkatesh, Towards discovery of influence and personality traits through social link prediction. Proc. 5th International AAAI Conference on Weblogs and Social Media. (2011). pp. 566-569.

10.1609/icwsm.v5i1.14151

F. Folino and C. Pizzuti, Link prediction approaches for disease networks. Proc. International Conference on Information Technology in Bio and Medical Informatics. (2012), pp. 99-108.

10.1007/978-3-642-32395-9_8

D. Liben-Nowell and J. Kleinberg, The link-prediction problem for social networks. Journal of the American Society for Information Science and Technology. 58(7) (2007), pp. 1019-1031.

10.1002/asi.20591

J. Zhao, L. Miao, J. Yang, H. Fang, Q.-M. Zhang, M. Nie, P. Holme, and T. Zhou, Prediction of links and weights in networks by reliable routes. Scientific reports. 5 (2015), 12261.

10.1038/srep1226126198206PMC4510530

J. Lee and R. Tukhvatov, Evaluations of similarity measures on VK for link prediction. Data Science and Engineering. 3(3) (2018), pp. 277-289.

10.1007/s41019-018-0073-5

V. Martinez, F. Berzal, and J.-C. Cubero, Adaptive degree penalization for link prediction. Journal of Computational Science. 13 (2016), pp. 1-9.

10.1016/j.jocs.2015.12.003

L. Lü, C.-H. Jin, and T. Zhou, Similarity index based on local paths for link prediction of complex networks. Physical Review E. 80(4), 046122.

10.1103/PhysRevE.80.046122

C. Wang, V. Satuluri, and S. Parthasarathy, Local probabilistic models for link prediction. Proc. 7th IEEE international conference on data mining (ICDM), IEEE. (2007), pp. 322-331.

10.1109/ICDM.2007.108

B. Meng, H. Ke, and T. Yi, Link prediction based on a semi-local similarity index. Chinese Physics B. 20(12) (2011), 128902.

10.1088/1674-1056/20/12/128902

A. Papadimitriou, P. Symeonidis, and Y. Manolopoulos, Fast and accurate link prediction in social networking systems. Journal of Systems and Software (JSS). 85(9) (2012), pp. 2119-2132.

10.1016/j.jss.2012.04.019

C.A. Bliss, M.R. Frank, C.M. Danforth, and P.S. Dodds, An evolutionary algorithm approach to link prediction in dynamic social networks. Journal of Computational Science. 5(5) (2014), pp. 750-764.

10.1016/j.jocs.2014.01.003

P. Wang, B. Xu, Y. Wu, and X. Zhou, Link prediction in social networks: The state-of the-art. Science China Information Sciences. 58(1) (2015), pp. 1-38.

10.1007/s11432-014-5237-y

L. Lü and T. Zhou, prediction in complex networks: A survey. Physica A: statistical mechanics and its applications. 390(6) (2011), pp. 1150-1170.

10.1016/j.physa.2010.11.027

T. Zhou, L. Lü, and Y.-C. Zhang, Predicting missing links via local information. The European Physical Journal B. 71(4) (2009), pp. 623-630.

10.1140/epjb/e2009-00335-8

L.A. Adamic and E. Adar, Friends and neighbors on the web. Social networks. 25(3) (2000), pp. 211-230.

10.1016/S0378-8733(03)00009-1

L. Katz, A new status index derived from sociometric analysis. Psychometrika. 18(1) (1953), pp. 39-43.

10.1007/BF02289026

G. Jeh and J. Widom, Simrank: A measure of structural-context similarity. Proc. 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM. (2002), pp. 538-543.

10.1145/775047.775126

V.D. Blondel, A. Gajardo, M. Heymans, P. Senellart, and P. Van Dooren, A measure of similarity between graph vertices: Applications to synonym extraction and web searching. SIAM Review. 46(4) (2004), pp. 647-666.

10.1137/S0036144502415960

E.A. Leicht, P. Holme, and M.E. Newman, Vertex similarity in networks. Physical Review E. 73(2) (2006), 026120.

10.1103/PhysRevE.73.026120

W. Liu and L. Lü, Link prediction based on local random walk. Europhysics Letters (EPL). 89(5) (2010), 58007.

10.1209/0295-5075/89/58007

V. Martínez, F. Berzal, and J.-C. Cubero, A survey of link prediction in complex networks. ACM Computing Surveys (CSUR). 49(4) (2017), pp. 1-33.

10.1145/3012704

P. Srilatha and R. Manjula, Similarity index based link prediction algorithms in social networks: A survey. Journal of Telecommunications and Information Technology. 2 (2016), pp. 87-94.

10.26636/jtit.2016.2.725

M.E. Newman, Clustering and preferential attachment in growing networks. Physical Review E. 64(2) (2001), 025102.

10.1103/PhysRevE.64.025102

P. Jaccard, Étude comparative de la distribution florale dans une portion des alpes et des jura. Bull Soc Vaudoise Sci Nat. 37 (1901), pp. 547-579.

P. Bhattacharyya, A. Garg, and S.F. Wu, Analysis of user keyword similarity in online social networks. Social Network Analysis and Mining. 1(3) (2011), pp. 143-158.

10.1007/s13278-010-0006-4

L. Lü and T. Zhou, Role of weak ties in link prediction of complex networks. Proc. 1st ACM International Workshop on Complex Networks Meet Information and Knowledge Management, ACM. (2009), pp. 55-58.

10.1145/1651274.1651285

H. Chen, X. Li, and Z. Huang, Link prediction approach to collaborative filtering. Proc. 5th ACM/ IEEE-CS Joint Conference on Digital Libraries (JCDL), IEEE. (2005), pp. 141-142.

10.1145/1065385.1065415

T. Murata and S. Moriyasu, Link prediction of social networks based on weighted proximity measures. Proc. IEEE/WIC/ACM International Conference on Web Intelligence, IEEE Computer Society. (2007), pp. 85-88.

10.1109/WI.2007.52

H.H. Song, T.W. Cho, V. Dave, Y. Zhang, and L. Qiu, Scalable proximity estimation and link prediction in online social networks. Proc. 9th ACM SIGCOMM Conference on Internet Measurement, ACM. (2009), pp. 322-335.

10.1145/1644893.1644932

S. Izudheen and S. Mathew, Identifying negative interactions in protein-protein interaction network using weak edge-edge domination set. Procedia Technology. 24 (2016), pp. 1423-1430.

10.1016/j.protcy.2016.05.167

Y. Lu, Y. Guo, and A. Korhonen, Link prediction in drug-target interactions network using similarity indices. BMC Bioinformatics. 18(1) (2017), pp. 39-48.

10.1186/s12859-017-1460-z28095781PMC5240398

S. Tariq, M. Saleem, and M. Shahbaz, User similarity determination in social networks. Technologies. 7(2) (2019), pp. 36-51.

10.3390/technologies7020036

H. Tian and R. Zafarani, Exploiting common neighbor graph for link prediction. in Proc. 29th ACM Int. Conf. Inf. & Knowl. Manage. (CIKM ’20). (2020), pp. 3333-3336. DOI: 10.1145/3340531.3417464.

10.1145/3340531.3417464

R.E. Tillman, V.K. Potluru, J. Chen, P.P. Reddy and M.M. Veloso, Heuristics for link prediction in multiplex networks, in Proc. 24th European Conf. Artificial Intelligence (ECAI 2020). (2020). Available at: https://arxiv.org/abs/2004.04704.

L. Wang, J. Ren, B. Xu, J. Li, W. Luo, and F. Xia, MODEL: Motif-based deep feature learning for link prediction. arXiv:2008.03637, Aug. (2020). [Preprint]. Available at: https://arxiv.org/abs/2008.03637.

G. Sharma, A. Challa, P. Gupta, and M.N. Murty, Higher-order relations skew link prediction in graphs. CoRR, arXiv:2111.00271, Oct. (2021). [Preprint]. Available at: https://arxiv.org/abs/2111.00271.

H. Gul, F. Al-Obeidat, A. Amin, M. Tahir, and K. Huang, Efficient link prediction model for real-world complex networks using matrix-forest metric with local similarity features. J. Complex Networks. 10(5) (2022), cnac039. DOI: 10.1093/comnet/cnac039.

10.1093/comnet/cnac039

S. Yun, S. Kim, J. Lee, J. Kang, and H. J. Kim, Neo-GNNs: Neighborhood overlap-aware graph neural networks for link prediction. arXiv:2206.04216, Jun. (2022). [Preprint]. Available at: https://arxiv.org/abs/2206.04216.

N. Singh and I. Singh, Application of resource allocation similarity-based link prediction in wireless networks. Int. J. Wireless Security & Networks. 1(2) (2023), pp. 37-42. Available at: https://journals.stmjournals.com/ijwsn/article=2023/view=118836.

J. Zhang, L. Wei, Z. Xu, and Q. Yao, Heuristic learning with graph neural networks: A unified framework for link prediction. arXiv:2406.07979, Jun. (2024). [Preprint]. Available at: https://arxiv.org/abs/2406.07979.

10.1145/3637528.3671946

Y.V. Nandini, T.J. Lakshmi, M.K. Enduri, and H. Sharma, Link prediction in complex networks using average centrality-based similarity score. Entropy. 26(6) (2024), 433. DOI: 10.3390/e26060433.

10.3390/e2606043338920442PMC11202912

P. Kapoor, S. Kaushal, H. Kumar, and K. Kanwar, A survey on feature extraction and learning techniques for link prediction in homogeneous and heterogeneous complex networks. Artif. Intell. Rev. 57 (2024). DOI: 10.1007/s10462-024-10998-7.

10.1007/s10462-024-10998-7

Z. Zhou, G. Wan, and B. Du, Common neighbor completion with information entropy for link prediction in social networks. Data Science and Engineering. 10 (2025), pp. 40-53. DOI: 10.1007/s41019-024-00267-6.

10.1007/s41019-024-00267-6

S.D. Pandey and S. Samanta, Strength prominence (SP) index: A link prediction method in fuzzy social networks. Complex & Intelligent Systems. 11 (2025). DOI: 10.1007/s40747-025-01925-6.

10.1007/s40747-025-01925-6

L. La Cava, D. Mandaglio, L. Zangari, and A. Tagarelli, Heuristic-informed mixture of experts for link prediction in multilayer networks (MoE-ML-LP). arXiv:2501.17557, Jan. (2025). [Preprint]. Available at: https://arxiv.org/abs/2501.17557.

10.1016/j.ins.2026.123106

M. Al Hasan, V. Chaoji, S. Salem, and M. Zaki, Link prediction using supervised learning. Proc. Workshop on Link Analysis, Counter-Terrorism and Security (SDM). (2006), pp. 1-10.

N. Benchettara, R. Kanawati, and C. Rouveirol, Supervised machine learning applied to link prediction in bipartite social networks. Proc. International Conference on Advances in Social Networks Analysis and Mining, IEEE. (2010), pp. 326-330.

10.1109/ASONAM.2010.87

J. O’Madadhain, J. Hutchins, and P. Smyth, Prediction and ranking algorithms for event based network data. ACM SIGKDD Explorations Newsletter. 7(2) (2005), pp. 23-30.

10.1145/1117454.1117458

N.Z. Gong, A. Talwalkar, L. Mackey, L. Huang, E.C.R. Shin, E. Stefanov, E.R. Shi, and D. Song, Joint link prediction and attribute inference using a social-attribute network. ACM Transactions on Intelligent Systems and Technology (TIST). 5(2) (2014), pp. 1-20.

10.1145/2594455

F. Liu, B. Liu, C. Sun, M. Liu, and X. Wang, Deep belief network-based approaches for link prediction in signed social networks. Entropy. 17(4) (2015), pp. 2140-2169.

10.3390/e17042140

S. Scellato, A. Noulas, and C. Mascolo, Exploiting place features in link prediction on location-based social networks. Proc. 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM. (2011), pp. 1046-1054.

10.1145/2020408.2020575

E. Tasnádi and G. Berend, Supervised prediction of social network links using implicit sources of information. Proc. 24th International Conference on World Wide Web (WWW), ACM. (2015), pp. 1117-1122.

10.1145/2740908.2743037

J. Valverde-Rebaza and A. de Andrade Lopes, Exploiting behaviors of communities of twitter users for link prediction. Social Network Analysis and Mining. 3(4) (2013), pp. 1063-1074.

10.1007/s13278-013-0142-8

B. Qiu, K. Ivanova, J. Yen, and P. Liu, Behavior evolution and event-driven growth dynamics in social networks. Proc. 2nd IEEE International Conference on Social Computing, IEEE. (2010), pp. 217-224.

10.1109/SocialCom.2010.38

R.N. Lichtenwalter, J.T. Lussier, and N.V. Chawla, New perspectives and methods in link prediction. Proc. 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM. (2010), pp. 243-252.

10.1145/1835804.1835837

P. Symeonidis, N. Iakovidou, N. Mantas, and Y. Manolopoulos, From biological to social networks: Link prediction based on multi-way spectral clustering. Data and Knowledge Engineering. 87 (2013), pp. 226-242.

10.1016/j.datak.2013.05.008

H. Kashima and N. Abe, A parameterized probabilistic model of network evolution for supervised link prediction. Proc. 6th International Conference on Data Mining (ICDM), IEEE. (2006), pp. 340-349.

10.1109/ICDM.2006.8

T.-T. Kuo, R. Yan, Y.-Y. Huang, P.-H. Kung, and S.-D. Lin, Unsupervised link prediction using aggregative statistics on heterogeneous social networks. Proc. 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM. (2013), pp. 775-783.

10.1145/2487575.2487614

J. Zhang, J. Tang, J. Li, Y. Liu, and C. Xing, Who influenced you? Predicting retweet via social influence locality. ACM Transactions on Knowledge Discovery from Data (TKDD). 9(3) (2015), pp. 1-26.

10.1145/2700398

H. Liu, Z. Hu, H. Haddadi, and H. Tian, Hidden link prediction based on node centrality and weak ties. Europhysics Letters (EPL). 101(1) (2013), pp. 1-6.

10.1209/0295-5075/101/18004

X. Fang, P. Hu, Z. Li, and W. Tsai, Predicting Adoption Probabilities in Social Networks. Information Systems Research. 24(1) (2013), pp. 128-145.

10.1287/isre.1120.0461

X. Fang, O.R. Liu Sheng, P. Goes, When Is the Right Time to Refresh Knowledge Discovered from Data?. Operations Research. 61(1) (2013), pp. 32-44.

10.1287/opre.1120.1148

X. Fang, Inference-based Naïve Bayes: Turning Naïve Bayes Cost-sensitive. IEEE Transactions on Knowledge and Data Engineering. 25(10) (2013), pp. 2302-2313.

10.1109/TKDE.2012.196

Z. Li, X. Fang, X. Bai, and O.R. Liu Sheng, Utility-based Link Recommendation for Online Social Networks. Working Paper. 63(6) (2015).

10.1287/mnsc.2016.2446

M.A. Brandão, M.M. Moro, G.R. Lopes, and J.P.M. Oliveira, Using Link Semantics to Recommend Collaborations in Academic Social Networks. In Proceedings of the 22nd International Conference on World Wide Web Companion (WWW). (2013), pp. 833-840.

10.1145/2487788.2488058

J. Ugander, L. Backstrom, C. Marlow, and J. Kleinberg, Structural Diversity in Social Contagion. Proceedings of the National Academy of Sciences. 109(16) (2012), pp. 5962-5966.

10.1073/pnas.111650210922474360PMC3341012

S. Vargas and P. Castells, Rank and Relevance in Novelty and Diversity Metrics for Recommender Systems. In Proceedings of the 5th ACM Conference on Recommender Systems. (2011), pp. 109-116.

10.1145/2043932.2043955

International Journal of Sustainable Building Technology and Urban Development ISSN:2093-761X(Print) 2093-7628(Online)