Open Access
ARTICLE
Hypergraphs for Covering Trees and Approximating Steiner Trees
LIRMM, University of Montpellier, CNRS, Montpellier, France
* Corresponding Author: Miklós Molnár. Email:
Computers, Materials & Continua 2026, 88(1), 94 https://doi.org/10.32604/cmc.2026.077825
Received 17 December 2025; Accepted 17 April 2026; Issue published 08 May 2026
Abstract
This article presents a particular tree covering technique. To cover a set of nodes belonging to an unknown tree, a set of connected small Steiner trees is proposed. These Steiner trees can be represented as hyperedges, and a chain or tree of hyperedges provides the cover. The model allows the calculation of approximate (partial) spanning trees in graphs. The idea of covering a set of nodes by hyperedges can be used directly in Steiner heuristics. The NP-hard Steiner problem in graphs is one of the most studied graph-related problems. Several heuristics are known to give approximated solutions. The classical approximations of the Steiner problem apply shortest paths. We present a generalized metric closure that can be constructed from hyperedges of limited size, and new approximations based on the generalized metric closure are proposed. A connected hyperedge set (without loops) approximates a Steiner tree. The paper also presents a performance analysis of the proposed heuristics. The variation in hyperedge size is analyzed. An interesting result is that using larger hyperedges significantly improves the algorithm’s efficiency.Keywords
The minimal partial spanning tree problem, also known as the Steiner problem in graphs, is one of the most studied graph-related problems. The main objective of this article is to contribute fruitful ideas to the resolution of the Steiner problem. We propose a new method to construct approximate tree of the Steiner problem. The method is presented in two steps.
1) Our first proposal consists of analyzing the cover of an arbitrary tree (more precisely, the cover of its leaves) using k-limited hyperedges. We are interested in finding a set of connected hyperedges that is “good” in terms of cost (length). If a cycle exists in the hypergraph used, at least one hyperedge can be removed without loss of connectivity or degradation of coverage quality. Therefore, the coverage can take the form of a chain or a tree of hyperedges. These structures are cycle-free. We examine the ratio between the cost of the cover (of the hyperchain) and the cost of the base tree to be covered.
2) The proposed technique can then be applied to approximate minimal Steiner trees in graphs. Remember than these Steiner trees should cover a given node set with minimal cost. The problem is NP-hard and heuristics are needed to solve it in polynomial time. Hyperedges allow the construction of a hypergraph built on the nodes to span. In the same way as in the metric closure of a Steiner problem, which usually contains 2-limited hyperedges (i.e., shortest paths), we propose a generalized metric closure. It contains k-limited hyperedges on the set of nodes. Greedy and exact algorithms can then be proposed to construct approximations of the minimal Steiner tree.
The paper is composed of the following sections. After this Introduction, the most relevant related papers are presented. To facilitate the reading of the paper, Section 3 contains the used main notation and some definitions. The tree coverage using sets of size limited hyperedges is analyzed in Section 4. An exact method for constructing the best cost cover using hyperedges of limited size and a greedy construction can also be found. As a result of this discussion, in Section 5, possible approximation algorithms for solving the Steiner problem are presented. Some conclusions close the paper.
The Steiner problem in graphs is one of the most analyzed problems related to graphs [1]. The problem is NP-hard; its decision variant is NP-complete [2]. Intensive research has been conducted to obtain approximate solutions. The classical approximations of the Steiner problem apply 2-limited hyperedges (shortest paths) and give 2-approximations [3]. Well-known algorithms are Kruskal’s algorithm and Takahashi-Matsuyama’s algorithm. Triplets, added Steiner nodes of small Steiner trees, have been proposed to improve the approximation ratio (firstly in [4,5]). An LP-based approximation algorithm based on an iterative randomized rounding technique has been proposed in [6]. The solution is a tree whose cost is at most
The article [7] provides an overview of the rich developments from the last three decades for the STP in graphs and highlights the most recent computational studies for some of its closely related variants. As it is indicated in [8], studying various variations of Steiner trees became an exciting activity recently. The paper gives a review on important developments for the Steiner problem. A combination of three concepts: implications, conflicts, and reductions are analyzed to advance exact solutions of the Steiner tree problem and various new techniques are conceived [9,10]. By integrating the new methods into a branch-and-cut framework, an exact Steiner problem solver has been developed. In some particular cases, the problem is polynomial-time solvable is it is the case on strongly chordal graphs [11]. A Compendium on the Steiner tree problem is also available online at [12].
In recent years, the dynamic version of the Steiner problem has also attracted an attention. The effect of a sequence of edge insertions and deletions on the Steiner trees has been analyzed in [13]. To maintain a Steiner tree in the updated graph, a fully dynamic algorithm has been proposed. A later version can be found in [14]. A detailed discussion of the extension of the Steiner tree problem to dynamic graphs, an exact algorithm that computes all Steiner sets of a given size, and the application on a varying Steiner set are shown in [15,16]. An interesting result is that the dynamic Steiner problem is NP-hard even for shortest paths.
Steiner problems in metric spaces (in Euclidean or graph-defined spaces) are intimately related to hypergraphs. Earlier, in his PhD dissertation, Warme analyzed the geometric Steiner tree problem that can be viewed as a minimum-cost spanning tree in a special hypergraph, in which nodes are terminals (leaves of full Steiner trees), and hyperedges are full Steiner trees [17]. In [18], a Steiner tree representation of the hyperedges is considered as the edge representation of a hypergraph. To solve the Steiner tree problem, the metric closure (the distance graph) can be used, as it is also indicated in the chapter [19]. This chapter gives an analysis and some relations between the Steiner tree problem in graphs and the Minimum Spanning Tree problem in hypergraphs. A branch-and-cut method based on the linear relaxation of an integer program of the problem has been proposed.
Paper [20] compares several relaxations of the problem based on hypergraphs as well as relaxations based on the formulation of the Steiner problem in graphs. It indicates that the union of trees linked by the hyperedges is a graph, and that the concatenation must be reduced by solving the classical Steiner problem in this graph.
Let
Definition 1 (Diameter of a tree): The diameter D of the tree is the path with maximum cost among all the paths in the tree:
Definition 2 (Raceme): We call raceme a connected subgraph
t is the base/attachment/articulation node of the raceme.
The longest path
Using these terms, a tree can be decomposed into a diameter and attached racemes. Fig. 1 illustrates the decomposition. The diameters of the two highlighted racemes are indicated near the racemes.

Figure 1: The decomposition of a tree into its diameter and racemes.
Property 1: Let
Proof: Trivial. Let us suppose that the path
For any raceme, an important consequence is that:
Remember that G is connected and simple.
Definition 3 (Hyperedge): Let
Sub-graphs of G can be associated with a hyperedge. Trivially, only sub-graphs spanning the nodes in S are interesting. Among spanning sub-graphs at most one with minimal cost exists: it is the minimum Steiner tree of S.
Definition 4 (Cost of a hyperedge): We propose the association of the minimum Steiner tree with the hyperedge (if there are several trees, one of them can be selected arbitrarily.) We consider the cost of the minimum Steiner tree of S as the cost
Definition 5 (k-limited hyperedge): It is a hyperedge of size (cardinality) at most k.
Hyperedges can be used to cover node sets. They can be isolated or connected.
Connectivity. To ensure connectivity, the hyperedges must be connected; there must be a path connecting each node in S to the other nodes via the hyperedges.
Definition 6 (Coverage): A node set S is covered by a set of hyperedges, iff each node in S belongs to at most one hyperedge and the connectivity is true.
Definition 7 (Cost of a coverage): It is the sum of the costs of the hyperedges that compose it.
We are interested in a coverage using “good” sets of hyperedges: two hyperedges in the coverage share at most one leaf, and there is no redundancy (cycle) in the set of hyperedges. Consequently, the coverage is a chain of hyperedges (a path) or a hypertree.
An important parameter of the coverage is the size k of the used hyperedges. The cost of the coverage and the complexity of the construction depend on it.
On the one hand, the higher the value of k, the closer the cost of coverage gets to the optimal coverage cost. For instance, if
On the other hand, the higher the value of k, the greater the computation time and cost. Remember, that the corresponding Steiner tree computation is exponential.
For this reason, only limited size hyperedges with small k values can be used in fast computations.
Often, to solve problems of covering sets of nodes, a metric closure graph is used, which contains the distances between the nodes. The distance of two nodes is the “cost” of the shortest path between them. Trivially, a shortest path is a hyperedge of size 2. The metric closure can be generalized based on larger (but limited) size hyperedges.
Definition 8 (Generalized metric closure): Let M be a set of nodes in G. Let
The generalized metric closure hypergraph contains the usual metric closure (the shortest paths), but also all of the hyperedges based on triplets of nodes, etc. Fig. 2 illustrates some hyperedges in the generalized metric closure of a small graph.

Figure 2: Some hyperedges in a generalized 3-limited metric closure.
To make the article easier to read, we summarize the main notations in Table 1.

In our particular first case, our objective is to cover a tree using k-limited hyperedges associated with leaves of the tree. Fig. 3 illustrates a hyperedge related to a set of leaves on a tree.

Figure 3: A hyperedge defined on a subset of leaves and its cost.
Let L be the set of leaves of a tree T.
Lemma 1: Let
Proof: Each hyperedge corresponds to a sub-tree of
Consequently: if the set of hyperedges covers the leaves of T without exception, then the “sub-tree” delimited by them is T and the cost of the coverage is at least the cost of the tree T.
Our analysis is limited to cases where the intersection of hyperedges belonging to a coverage of a tree T contains only one node per peer. The cost of the coverage can be computed as follows.
Some edges of the tree T are included in two hyperedges. Their cost is computed twice (once for each hyperedge) in the cost of the coverage. We refer to “duplicated” edges for cost calculation. Fig. 4 shows an example of the inclusion of some edges of T in the computation of the cost of two connected hyperedges. The duplicated edges form a path from the shared node in T.

Figure 4: Two connected hyperedges of cardinality 4, the corresponding sub-trees and the duplicated edges of the tree.
Lemma 2: Let
Proof: Trivial.
Remember that we limit the coverage to cases where each intersection of hyperedges contains only one node. The examined hypergraphs are simple: the connected hyperedges can form a chain or a tree [21].
Lemma 3: Let us suppose that
Proof:
Adding a new connected hyperedge, the maximal number of newly covered leaves is
To cover
4.1 Construction of a Coverage
In this section, we first present a costly procedure for finding the best cost k-limited coverage. A second algorithm follows a greedy construction, without any guarantee regarding costs.
To find the coverage with minimal cost for an arbitrary value
1) For the set L of leaf nodes, construct the k-limited hyper metric closure
For the integer values i from 2 to k, compute all i-tuples. Associate the cost of corresponding sub-trees with the hyperedges. Remember that there are

2) Find the set of connected hyperedges of minimal cost covering L. For this, the set of connected hyperedges covering L must be enumerated, and the best cost set must be selected.
This second problem (often called “FST concatenation phase” in the geoSteiner problem) is known [17]. Finding a subset of the hyperedges (corresponding to Steiner trees), whose concatenation produces the final Steiner tree, corresponds to a Minimum Spanning Tree problem in the hypergraph. This problem is NP-hard; for instance, a branch-and-cut approach or an exhaustive enumeration can be proposed to solve it (cf. Algorithm 2).

The connected set of hyperedges can be a chain or a tree. It does not contain loops.
A fast solution can be based on a greedy strategy to construct a chain of hyperedges to cover the leaves of a tree T. Each hyperedge in the coverage contains (at most)
To create the chain, (for instance) the diameter D of the tree can be followed, and suitable hyperedges can be created successively. Fig. 5 illustrates a chain composed of hyperedges of cardinality 3. The algorithm is outlined as follows and detailed by Algorithm 3.

Figure 5: Illustration of a chain with
1) Starting from one of the endpoints of the diameter, a first leaf set is determined: the
2) At each step of the suite, the sub-tree corresponding to the last hyperedge is examined. To create a new connected hyperedge, a leaf from this hyperedge is selected, and the

Note that this greedy algorithm does not guarantee the best cost solution (which is not necessarily a chain, but can also be a tree). Furthermore, regarding chains, the choice of the starting point and the subsequent continuations influences the cost. Greedy decisions do not guarantee an optimal cost chain either. As indicated, the problem is NP-hard.
4.2 Performance of the Greedy Algorithm
To illustrate the performance of the greedy algorithm, we performed calculations on randomly generated trees. For each case, a tree with

Finding upper-bounds on costs is an important topic. The following lemma gives a light upper-bound related to costs.
Lemma 4: The length of the duplicated part of T (the length of duplicated edges) is
s.t.
Proof: Trivial. There is no path from a leaf to the base node
For some particular trees, more interesting upper-bounds can be found. For instance, in spiders
5 Approximation of Minimum Steiner Trees
As demonstrated in the literature, hypergraphs can be applied to model the Steiner problem. For instance, the optimal concatenation of full Steiner trees to obtain a minimum cost Steiner tree corresponds to finding a minimum spanning tree in a hypergraph (MSTH) [17,20]. The idea to cover/span the leaf nodes of a tree by k-limited hyperedges can be used to create approximations of minimum Steiner trees. In the Steiner problem, a set M of terminal nodes must be covered. That is, the set of “leaves” to be covered by the algorithm is the given terminal node set. Of course, the Steiner tree is not known, but the tree covering technique presented earlier can be applied. Subsets of M can be covered by hyperedges of limited size, and their concatenation leads to a tree spanning M.
Since the Steiner problem is NP-hard, finding the optimum for large instances is impossible within a reasonable timeframe. Feasible approximations can be obtained by constructing hypertrees or hyperchains on M.
For minimal cost Steiner trees, the following properties are true.
Property 2 (Sub-optimality): Let ST be a sub-tree of a minimal cost Steiner tree T. Let
Proof: Let us suppose that ST is not of minimal cost, and another minimal cost Steiner tree STO spanning
Inversely, a chain of PMSTs (minimum Steiner trees) is not necessarily a PMST.
Property 3 (Chain of PMSTs): Let
Proof: A simple negative example provides proof. Let
The trees
Even though a chain of PMSTs does not give a PMST, the technique can be used to obtain approximations.
5.1 An Approximate Steiner Tree Construction
Given a terminal node set M in a simple graph G, a partial spanning tree that covers M at a minimal/good cost is needed. Since the optimization is NP-hard, decomposing the set of nodes M into small subsets allows for the computation of smaller PMSTs using a reasonable computation time. In this computation, the size of the used hyperedges is limited by a value
An algorithm similar to the greedy algorithm proposed for the tree coverage can work as follows. Let
1) A first Steiner terminal set S is selected with
2) At each step of the suite, the last hyperedge is examined. A leaf from this hyperedge is selected, and the next

3) The union of the resulting PMSTs is not necessarily a tree. It is a graph covering M, but it can contain duplicated edges and loops. The last step of the construction eliminates the useless edges, preserving the connectivity of the graph.
Trivially, the loop-free connected graph is a tree spanning/covering M.
The complexity of the computation is high.
The computation can be based on the metrical closure. For the Steiner set M of
The greedy algorithm constructs a chain of hyperedges. To select the first node in the next hyperedge,
In real-world cases, the construction of the metric closure is predominant.
Furthermore, memory usage depends primarily on two factors: size
5.2 Tests, Experimental Results
The performance of the proposed algorithm has been tested in random graphs and also in a known network topology for randomly generated terminal nodes. The PMSTs have been computed using the Gurobi optimizer.
In a first series, the graphs were generated by the Albert-Barabási algorithm [22]. (Often these graphs are called BA graphs.) As it is known, this algorithm produces random scale-free networks. Several natural and human-made systems, including the Internet, the World Wide Web, citation networks, and some social networks are thought to be approximately scale-free. In these typical “network” topology, certain nodes (called hubs) have high degree.
In BA-type random graphs of 80 nodes and 238 arcs, random terminal groups of 15 nodes have been generated. The terminal node sets have been covered by hyperchains. The size of the hyperedges has been varied between

In a second case, random BA graphs with 120 nodes and 328 arcs have been created, and groups of 35 terminals have been generated. Table 4 contains the results.

A thirds series has been realized using a known, relatively small network topology: the GEANT network, illustrated in Fig. 6 [23]. In this connected graph of 40 nodes and 61 edges, random costs between 1 and 5 were assigned to the edges. For each value of

Figure 6: The GEANT network (source geant.org).

To test more important trees in a bigger topology, we used a random graph with 1500 nodes and 5968 edges (Table 6). In this graph, PMSTs of groups with 300 nodes were computed. The coverage of the PMSTs was achieved using k-limited hyperedges with k = 10, 20, 30. This test clearly shows the effect of k.

By increasing the size
An interesting result is that reducing the resulting graph (i.e., eliminating unnecessary duplication) greatly improves the algorithm’s efficiency. The experimental ratio between the chain cost and the PMST is between 1.2 and 2.34, while the ratio between the reduced tree and the PMST is between 1.11 and 1.51. Duplicate edges can always be removed after calculating the small Steiner trees corresponding to the hyperedges.
In the first part of this work, we propose to cover a set of leaves of a tree with a set of connected hyperedges (with a hyperchain). Analysis of explicitly given trees shows that the cost of hyperchains covering the nodes is greater than the tree cost itself. This is a trivial result because some edges of the graph are used several times in the set of hyperedges. The idea of covering a tree with a set of hyperedges can be applied to approximately solve the Steiner problem, where only the terminal node set is known. A simple greedy algorithm is proposed to construct a hyperchain covering the Steiner terminal sets. The union of the small Steiner trees corresponding to the hyperedges of the covers gives a multigraph that can be reduced by eliminating loops corresponding to parallel edges. Experimental results show that the partial spanning tree obtained is a good approximation of the minimal Steiner tree. Unfortunately, at the moment, there is no theoretical result on the approximation ratio varying the hyperedge sizes.
In perspective, we suggest that the greedy algorithm can be improved. During the construction of chains, node groups could be selected more advantageously (for example, based on the distance between nodes). Furthermore, the construction of hypertrees covering the terminal nodes could also be explored. In-depth studies are needed to obtain eventual theoretical results.
Acknowledgement: Not applicable.
Funding Statement: The author received no specific funding for this study.
Availability of Data and Materials: Random graphs were generated using an internal code of graph generator. The data describing the parameters are available from the author upon reasonable request.
Ethics Approval: Not applicable.
Conflicts of Interest: The author declares no conflicts of interest.
References
1. Hwang F, Richards D, Winter P. The Steiner tree problem. In: Annals of discrete mathematics. Amsterdam, The Netherland: North-Holland; 1992. [Google Scholar]
2. Karp RM. Reducibility among combinatorial problems. In: 50 years of integer programming 1958-2008. Berlin/Heidelberg, Germany: Springer; 2010. p. 219–41. [Google Scholar]
3. Vazirani VV. Approximation algorithms. Berlin/Heidelberg, Germany: Springer Publishing Company, Incorporated; 2010. [Google Scholar]
4. Zelikovsky AZ. An 11/6-approximation algorithm for the steiner problem on graphs. In: Neŝetril J, Fiedler M, editors. Annals of discrete mathematics. Amsterdam, The Netherlands: Elsevier; 1992. p. 351–4. [Google Scholar]
5. Robins G, Zelikovsky A. Tighter bounds for graph steiner tree approximation. SIAM J Discret Math. 2005;19(1):122–34. doi:10.1137/S0895480101393155. [Google Scholar] [CrossRef]
6. Byrka J, Grandoni F, Rothvoss T, Sanità L. Steiner tree approximation via iterative randomized rounding. J ACM. 2013;60(1):6. doi:10.1145/2432622.2432628. [Google Scholar] [CrossRef]
7. Ljubic I. Solving Steiner trees: recent advances, challenges, and perspectives. Networks. 2021;77(2):177–204. [Google Scholar]
8. Su K, Lu B, Ngo H, Pardalos P, Du DZ. Steiner tree problems. In: Encyclopedia of optimization. Cham, Switzerland: Springer International Publishing; 2024. p. 1–15. [Google Scholar]
9. Rehfeldt D, Koch T. Implications, conflicts, and reductions for steiner trees. In: Proceedings of the 22nd International Conference on Integer Programming and Combinatorial Optimization; 2021 May 19–21; Atlanta, GA, USA. Cham, Switzerland: Springer; 2021. p. 473–87. [Google Scholar]
10. Rehfeldt D, Koch T. Implications, conflicts, and reductions for Steiner trees. Math Program. 2023;197(2):903–66. doi:10.1007/s10107-021-01757-5. [Google Scholar] [CrossRef]
11. de Figueiredo CMH, Lopes R, de Melo AA, Silva A. Parameterized algorithms for Steiner tree and (connected) dominating set on path graphs. Networks. 2024;84(2):132–47. doi:10.1002/net.22220. [Google Scholar] [CrossRef]
12. Hauptmann M, Karpinski M. A compendium on steiner tree problems. 2015 [cited 2026 Apr 16]. Available from: https://theory.cs.uni-bonn.de/info5/steinerkompendium/netcompendium.html. [Google Scholar]
13. Raikwar H, Karmakar S. Fully dynamic algorithm for the steiner tree problem in planar graphs. In: Proceedings of the 2022 Tenth International Symposium on Computing and Networking Workshops (CANDARW); 2022 Nov 21–24; Himeji, Japan. p. 416–20. [Google Scholar]
14. Raikwar H, Sadharakiya H, Karmakar S. Dynamic algorithms for approximate steiner trees. Concurr Comput Pract Exp. 2025;37(6–8):e70040. doi:10.1002/cpe.70040. [Google Scholar] [CrossRef]
15. Balev S, Pigné Y, Sanlaville E, Vernet M. The dynamic steiner tree problem: definitions, complexity, algorithms. In: Proceedings of the 3rd Symposium on Algorithmic Foundations of Dynamic Networks (SAND 2024); 2024 Jun 5–7; Patras, Greece. doi:10.4230/LIPIcs.SAND.2024.24. [Google Scholar] [CrossRef]
16. Balev S, Sanlaville E, Schoeters J. Temporally connected components. Theor Comput Sci. 2024;1013(3):114757. doi:10.1016/j.tcs.2024.114757. [Google Scholar] [CrossRef]
17. Warme DM. Spanning trees in hypergraphs with applications to Steiner trees [dissertation]. Charlottesville, VA, USA: University of Virginia; 1998. doi:10.5555/927353. [Google Scholar] [CrossRef]
18. Kaufmann M, van Kreveld M, Speckmann B. Subdivision drawings of hypergraphs. In: Tollis IG, Patrignani M, editors. Graph drawing. Berlin/Heidelberg, Germany: Springer; 2009. p. 396–407. [Google Scholar]
19. Brazil M, Zachariasen M. Steiner trees in graphs and hypergraphs. In: Optimal interconnection trees in the plane: theory, algorithms and applications. Cham, Switzerland: Springer International Publishing; 2015. p. 301–17. [Google Scholar]
20. Polzin T, Daneshmand SV. On Steiner trees and minimum spanning trees in hypergraphs. Oper Res Lett. 2003;31(1):12–20. doi:10.1016/s0167-6377(02)00185-2. [Google Scholar] [CrossRef]
21. Ouvrard X. Hypergraphs: an introduction and review. arXiv:2002.05014. 2020. [Google Scholar]
22. Albert R, Barabási AL. Statistical mechanics of complex networks. Rev Mod Phys. 2002;74(1):47–97. doi:10.1103/revmodphys.74.47 [Google Scholar] [CrossRef]
23. GEANT network. 2012 [cited 2026 Apr 16]. Available from: https://topology-zoo.org/files/Geant2012.gml. [Google Scholar]
Cite This Article
Copyright © 2026 The Author(s). Published by Tech Science Press.This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Submit a Paper
Propose a Special lssue
View Full Text
Download PDF
Downloads
Citation Tools