Abstract
Graphical Abstract
Click image to view fullscreen
Keywords
1. Introduction
With the development of the information age, large amounts of data are being generated, gathered and disposed. Interfering based on solely on single source is no longer sufficient. Data fusion, also known as multisource data fusion, is a necessary technique to assemble various kinds of data from multiple sources to capture reliable and accurate information [1, 2, 3, 4]. Nevertheless, uncertain, imprecise, imbalance, and incomplete or even false data are inevitable on account of the impacts of the environment and the complexity of the goals [5, 6, 7]. Such kinds of problems increase difficult to multisource data fusion. To improve the performance of the fusion system, various data fusion methods have been presented [8, 9], and applied in a wide variety of areas [10, 11, 12, 13, 14], including artificial intelligence, target tracking and recognition, smart engineering management, IoT systems, financial systems, medical diagnosis, and so on [15, 16, 17].
Existing works provide an overview of the effort related to data fusion from different perspectives. Some papers [18, 19] investigate data fusion in smart Internet of Things (IoT). Some papers [20] investigate multisensor data fusion techniques. A comprehensive survey on data fusion in remote sensing is presented in [21]. Some papers [22] investigate data fusion in machine learning. A survey in [23] presents mobile agent itinerary planning for information fusion in wireless sensor networks. A survey on information fusion for edge intelligence is presented in [24]. A recently published survey conducts a comprehensive statistical analysis of the current theoretical and application achievements of multisource information fusion [25]. Dempster–Shafer evidence theory (DSET) [26, 27] is one of useful methodologies in the fusion of uncertain multi-source information [28]. To the best of our knowledge, although researchers have conducted reviews and surveys of data fusion from different perspectives, no attempt has been made to provide a comprehensive overview on the DSET for data fusion by carefully analyzing the abovementioned data fusion literature.
Therefore, in this paper, a comprehensive and systematic survey on data fusion in DSET is conducted. We first review the basic concepts and knowledge of classical DSET, then study the axioms of Dempster's rule of combination and the characteristics of DSET that are desirable for data fusion. Whereafter, we review classical DSET and its extensions, collectively referred to as classical evidence theory, for data fusion from three aspects of uncertainty modeling, fusion, and decision making. Next, we explore complex evidence theory for data fusion in both closed world and open world contexts that benefits from the frame of complex plane modelling. After that, we present classical and complex evidence theory framework-based multisource data fusion algorithms. These algorithms are applied to pattern classification to demonstrate their applicability through comparison with other related well-known methods. Finally, we discuss challenges and open directions for future research. The objectives of this work are to: 1) offer a general and synthetic review of classical and complex evidence theories; 2) provide a profound analysis and discussion of existing work in classical and complex evidence theory framework-based data fusion domains; and 3) identify remaining challenges and open directions for future research in this field.
The paper is organized as follows. Section 2 reviews classical DSET and its typical generalized theories. In Section 3, we analyze classical evidence theory for data fusion. In Section 4, we study and explore complex evidence theory for data fusion in both closed and open world contexts. In Section 5, classical and complex evidence theory framework-based multisource data fusion algorithms are presented. In Section 6, several challenges and open future research directions are discussed. Finally, Section 7 concludes this work.
2. Review of Classical DSET and Its Generalized Theories
In this section, we first review the basic concepts, knowledge and limitations of classical Dempster–Shafer evidence theory (DSET). Next, we review two typical generalizations of DSET, namely, DSmT: Dezert-Smarandache Theory, and GET: generalized evidence theory. In addition, we analyze the characteristics of the classical DSET and its generalized theories, then assess their differences.
2.1 DSET: Dempster–Shafer Evidence Theory [26, 27]
2.1.1 Basic Concepts and Knowledge of Classical DSET
The classical DSET, also called the theory of belief functions, was first presented by Dempster [26] and later developed by Shafer [27]. As a generalization of Bayesian probability theory, DSET is more flexible and effective to express and process uncertainty [29, 30, 31], which is applied in many fields, such as evidential reasoning [32, 33, 34], belief rule-base expert system [35, 36, 37, 38, 39], fault diagnosis [40, 41], software risk evaluation [42], and other aspects [43, 44]. The main concepts of DSET are introduced below [26, 27].
.
Definition 1 (Frame of discernment). Let be an arbitrary nonempty event. A frame of discernment (FOD), denoted as , is defined as:.
Definition 2 (Power set). Let be the power set of , denoted as:.
Definition 3 (Hypothesis or proposition). , is defined as a hypothesis or proposition..
Definition 4 (Mass function in DSET). In FOD , a mass function is defined as a mapping:.
Definition 5 (Focal element in DSET). Let be a BBA defined in Definition 4. , if , is defined as a focal element..
Definition 6 (Belief function). A belief function , mapping from to , is defined by.
Definition 7 (Plausibility function). A plausibility function , mapping from to , is defined byClearly, , , where and are the lower and upper limit functions to support , respectively.
.
Definition 8 (Dempster's rule of combination). Let and be two independent BBAs in FOD with propositions , respectively. Dempster's rule of combination (DRC), represented in the form , is defined byNote that Eq. (7) is feasible under the condition .
2.1.2 Axioms and Characteristics
Notably, DRC is conducive to data fusion since it has a set of attractive axioms that are illustrated below [46]:
-
Axiom A1:Compositionality.
is a function of only , , and . -
Axiom A2:Commutativity.
. -
Axiom A3:Associativity.
(). -
Axiom A4:Conditioning.
If , then -
Axiom A5:Internal Symmetry.
Let be an arbitrary permutation of hypothesis . Consider BBAs and ():Then, . -
Axiom A6:Autofunctionality.
For and , does not rely on for all .
By analyzing the above definitions and axioms, the characteristics of the classical DSET can be summarized as follows:
BBA in DSET has the ability to model partial or complete ignorance.
Compared to the Bayesian decision model, the belief function does not need experts to offer prior probabilities.
DRC satisfies the associative law and commutative law and provides flexible and facilitating reasoning to handle uncertainty in the fusion of multisource data.
The belief interval in DSET provides upper and lower probabilities by means of a belief function and plausibility function.
Consequently, as a form of nonprior generalization of Bayesian inference, DSET offers a general framework to support decision making through reasoning under uncertainty.
2.1.3 Restraints in DSET
DSET provides a mass function to express uncertainty quantitatively, and DRC for reasoning to ensure fusion. Specifically, the aim of Dempster's rule of combination is to aggregate and combine information modeled in mass functions or BBAs into a distinct function. Although DSET has many advantages that are desirable for data fusion, it suffers the following restraints that limit its application:
Restraint on the frame of discernment. In terms of the FOD of DSET, the elements are assumed to be exhaustive and exclusive. However, for a variety of fusion problems, the internal essence of hypotheses may be vague and imprecise, so the elements in the FOD may overlap. On the other hand, for dynamic fusion problems, the number of elements in the FOD changes over time, accompanied by the amendment of available knowledge. Hence, relaxing these assumptions by taking into account nonexclusivity among elements, as well as the evolution of element quantity in the FOD, is a key issue in data fusion to describe the problems in actual applications more realistically.
Constraint on independent evidence fusion. In DSET, when using DRC, the evidence to be fused assumed to be independent; however, dependency among evidence is ubiquitous in practical applications. How to overcome this limitation to make the combination rule able to handle uncertainty is another key issue in data fusion.
Counterintuitive result when fusing conflicting evidence. Because the BBAs are generated based on uncertain input variables modeled in a variety of forms from different sources, conflicts among multiple sources may exist due to the impacts of subjective and objective uncertainties. Nevertheless, counterintuitive results occur when fusing highly conflicting evidence through DRC. How to manage conflicting evidence to improve the fusion quality with a high decision level is another key issue in data fusion.
Thus far, several studies have contributed solutions to the abovementioned issues; however, no consensus has been reached about what the best approach is for the restraint relaxation of the FOD, dependent evidence fusion, and conflict management. Next, we survey two typical generalization frameworks of DSET that were presented to address these issues, namely, DSmT: Dezert-Smarandache Theory [47], and GET: generalized evidence theory [48]. The main concepts of these frameworks will be introduced below. In addition, a corresponding analysis of their characteristics, compared with those of classical DSET, will be presented.
2.2 DSmT: Dezert-Smarandache theory [47]
.
Definition 9 (Frame). Let be a finite set of exhaustive elements called a frame. If is congenitally not closed, namely, it is an open world or frame, can always be included in as a new closed world or frame: .In contrast to classical DSET, there are constraints on and in DSmT other than exhaustivity. Therefore, the frame of DSmT releases the restraints where the events must be exclusive and constant in the FOD of DSET. As a result, DSmT offers a more realistic and flexible structure of the frame model.
.
Definition 10 (Hyper-power set). The hyper-power set of is denoted asRemarkably, given a finite frame , has the following characteristics:
When , is called the hyper-power set of .
When all the elements of are known (or are assumed) to be truly exclusive, becomes the classical power set .
.
Definition 11 (Super-power set). The super-power set of is denoted as.
Definition 12 (Mass function in DSmT). In frame , a mass function in DSmT is defined as a mapping:Comparison of Definition 4 with Definition 12 indicates that compared to the classical BBA in DSET, in DSmT has the following interpretations and properties:
In DSmT, can be modeled in the power set , hyper-power set , and super-power set , while in DSET, can be modeled only in the power set .
When , where all the elements in are known and exclusive, GBBA in DSmT reduces to the classical BBA in DSET.
.
Definition 13 (Generalized belief function in DSmT). A generalized belief function in DSmT, mapping from to , is defined by.
Definition 14 (Generalized plausibility function in DSmT). A generalized plausibility function in DSmT, mapping from to , is defined byClearly, , , where and are the lower and upper limit functions of , respectively.
In DSmT, several proportional conflict redistribution (PCR) rules are presented for data fusion. The idea behind the PCR fusion rules is to proportionally shift total or partial conflict masses to nonempty sets involved in the conflicts in regard to the masses allocated by sources [49]. In particular, as discussed in [49], PCR5, which takes the conjunctive normal form of partial conflict into consideration, is regarded as the most mathematical and effective PCR fusion rule. Thus, we introduce the basic concept of PCR5 below.
.
Definition 15 (PCR5 rule) [49]. Let and be two GBBAs in frame . The PCR5 rule in DSmT is defined by:Notably, in contrast to the classical DRC, the PCR5 rule is quasi-associative and maintains the neutral influence of vacuous belief assignment. This is because the conjunctive normal form of each partial conflict does not cover , as is a neutral element for conflict. Therefore, no mass is assigned to after redistributing the conflict mass.
In summary, the generalized DSmT inherits the merits of classical DSET and has its own attractive characteristics, as follows [47]:
The frame in DSmT relaxes the assumptions on the FOD in DSET, except for the exhaustivity of . Specifically, the frame in DSmT also contains the elements with conjunctions and/or disjunctions and negations/complements of pure hypotheses.
GBBA in DSmT is capable of expressing partial or complete ignorance with not only the power set, but also the hyper- and super-power sets.
The generalized belief function in DSmT also does not need experts to provide prior probabilities, in contrast to the Bayesian decision model.
DSmT affords better fusion rule of PCR5, which can effectively cope with conflicting evidence compared to the classical DRC in DSET.
The generalized belief interval in DSmT also provides upper and lower probabilities. In addition, DSmT presents a method to work with imprecise quantitative or qualitative information without the limitation of interval-valued belief structures.
2.3 GET: Generalized Evidence Theory [48]
.
Definition 16 (Mass function in GET). In the FOD , a mass function in GET is defined as a mapping:.
Definition 17 (Focal element in GET). Let be a GBBA defined in Definition 16. , if , is defined as a focal element..
Definition 18 (Generalized belief function in GET). A generalized belief function in GET, mapping from to , is defined by.
Definition 19 (Generalized plausibility function in GET). A generalized plausibility function in GET, mapping from to , is defined by.
Definition 20 (Generalized combination rule in GET). Let and be two independent GBBAs with propositions , respectively, defined in Definition 16. The generalized combination rule (GCR), represented in the form , is defined as follows:The generalized GET inherits the merits of classical DSET and has the following attractive characteristics [48]:
The structure of GBBA in GET has the ability not only to model partial or complete ignorance but also to express the uncertainty caused by the incompleteness of FOD, so it can handle the uncertainty problem in an open world.
The generalized belief function in GET also does not need experts to provide prior probabilities, in contrast to the Bayesian decision model.
The generalized combination rule in GET not only satisfies the associative law and commutative law but also has the capability to reason with multisource data in the face of uncertainty, even under the condition that .
The generalized belief interval in GET provides upper and lower probabilities and can be applied to the open world not just the closed world.
When , GET reduces to the classical DSET.
Moreover, DSET has been extended in terms of other aspects [50, 51, 52].
3. Classical Evidence Theory for Data Fusion
Our methodology expresses a deep understanding of the surveyed papers with regards to evidence theory for data fusion. The process involves three steps, including uncertainty modeling, fusion, and decision making. Figure 1 shows a process of data fusion in the context of evidence theory.
3.1 Uncertainty Modeling
Uncertainty is ubiquitous in the real world and is found in almost all areas of scientific research [53, 54, 55]. Uncertainty in data fusion can generally be classified into the following two categories:
Aleatory uncertainty. This uncertainty originates from natural variability of the physical world and reflects its inherent randomness. Aleatory uncertainty exists naturally without connection to human knowledge. This kind of uncertainty cannot be removed or decreased by gathering more information.
Epistemic uncertainty. This uncertainty occurs because humans lack knowledge of the physical world and the ability to measure and model the physical world. In contrast to aleatory uncertainty, this kind of uncertainty can be reduced and even eliminated with the aid of more information and appropriate methods.
For example, phrases "I am 80% sure that …" and "I think there is a 85% change that …" express epistemic and aleatory uncertainty, respectively. In various areas of science and engineering, these uncertainty makes tasks more complicated and influences decision making in many adverse ways. Modeling and handling of these kinds of uncertainty plays an important role. The mass function/BBA, belief function and plausibility function in DSET offer more flexible and realistic expression and formalization of available knowledge with regards to possible values of uncertain input variables. Especially, for mass function, instead of filling in the missing value by a certain estimation, it can provide a straightforward way to quantify such a kind of ignorance state. In this way, any external operation in terms of the missing value is not required.
For evidence theory-based data fusion, BBA, as a basic unit of evidence theory framework processing, is the key issue that need to be addressed first. Therefore, how to generate appropriate BBAs from multisource information has been intensively studied in recent years. For example, the BBAs can be generated in accordance with multiple attributes of dataset. According to whether prior sample knowledge is used in the process of BBA generation, it can be mainly divided into the following three types:
Unsupervised. For instance, paper [56] investigates unsupervised segmentation of hidden Markov fields corrupted by correlated non-Gaussian noise; paper [57] presents a belief shift clustering method for dealing with object data; paper [58] investigates neural network-based evidential clustering for BBA generation.
Semi-supervised. For example, paper [59] studies a semi-supervised evidential label propagation algorithm for graph data clustering; paper [60] researches disagreement based semi-supervised learning approaches with belief functions; paper [61] investigates a fast semi-supervised evidential clustering.
Supervised. For instance, paper [62] investigates evidential calibration of binary SVM classifiers; paper [63] studies evidential classifiers, including logistic regression and its nonlinear generalizations of multilayer feedforward neural networks; paper [64] investigates an evidential classifier based on Dempster-Shafer theory and deep learning.
3.2 Fusion
After obtaining BBAs from multisource information, other key issue is about how to fuse these BBAs to better support decision-making.
In addition to the typical DRC and promotion generalized combination rules discussed in Section 2, various research has been conducted from other perspectives to improve the fusion performance. To summarise, there are three dominating classifications: evidential combination rule-based data fusion; evidence pretreatment-based data fusion; and hybrid evidential conflict models for data fusion. We will explain evidence theory-based data fusion methods from these three aspects in the following sections.
3.2.1 Evidential combination rule-based data fusion
In this section, we survey several existing evidential combination rules that have been widely applied in data fusion. Additionally, we compare existing evidential combination rules and summarized their properties.
(1) Evidential combination rules
Let and be two independent BBAs with hypotheses and in FOD , respectively.
.
Definition 21 (Smets's combination rules) [46]. Smets's conjunctive combination rule, represented in the formIn Eq. (1) and Eq. (2),
is an empty set belonging to∅ in a closed world with2 Φ .m ( ∅ ) = 0 is one or several hypotheses in an open world that does not belong to∅ .2 Φ
.
Definition 22 (Yager's combination rule) [65]. Yager's combination rule, represented in the formIn contrast to that under DRC, the conflict
.
Definition 23 (Dubois and Prade's combination rule) [66]. Dubois and Prade's hybrid combination rule is defined byThe mass satisfying
.
Definition 24 (Unified combination rule) [67, 68]. LetNote that the conflict
| Combination rules | Axioms | |||||
| A1 | A2 | A3 | A4 | A5 | A6 | |
| Dempster [26] | yes | yes | yes | yes | yes | yes |
| Smets [46] | yes | yes | yes | yes | yes | yes |
| Yager [65] | yes | yes | no | yes | yes | yes |
| Dubois and Prade [66] | yes | yes | no | yes | yes | no |
| Inagaki and Lefevre et al. [67, 68] | yes | yes | yes | yes | yes | no |
| Zhang [69] | yes | yes | Under conditions | yes | yes | yes |
| Mahler [70] | yes | yes | yes | yes | yes | yes |
| Dezert and Smarandache [49] | yes | yes | yes | yes | yes | no |
| Deng [48] | yes | yes | yes | Under conditions | yes | yes |
| Jiang and Zhan [71] | yes | yes | yes | Under conditions | yes | yes |
| Xiao [72, 73] | yes | yes | yes | yes | yes | yes |
| GCECR | yes | yes | yes | Under conditions | yes | yes |
.
Definition 25 (Weighted product combination rule) [69]. LetThe weighted product combination rule is associative if satisfying
.
Definition 26 (Mahler's weighted combination rule) [70]. Mahler's weighted combination rule is defined byUnlike DRC, the weighted product combination rule and Mahler's weighted combination rule introduce
.
Definition 27 (Jiang and Zhan's combination rule) [71]. Jiang and Zhan's combination rule, represented in the formJiang and Zhan's combination rule overcomes the shortcomings of the generalized combination rule in GET and has the following characteristics:
When
in Eq. (10), Jiang and Zhan's combination rule reduces to the classical DRC.m 1 ⊕ m 2 ( ∅ ) = 0 andm 1 ( ∅ ) are combined through the orthogonal sum operation.m 2 ( ∅ ) In Eq. (10),
is a process of normalization that is a generalization of1 1 − K in Eq. (8) of the classical DRC.1 1 − K If the sum of the GBBAs of all nonempty sets is zero or
, the whole belief is reallocated toK = 1 .∅
(2) Comparison and analysis
In accordance with the axioms A1-A6 of DRC in Section 2, we compare the combination rules described in Section 3.2. The results are summarized in Table 1.
Table 1 indicates that Smets [46], and Mahler [70] satisfy axioms A1-A6: compositionality, commutativity, associativity, conditioning, internal symmetry, and autofunctionality, as does Dempster's rule [26]. By contrast, the combination rules of Yager [65] and Dubois and Prade [66] do not satisfy axiom A3: associativity; Zhang [69]'s combination rule satisfies axiom A3 under the condition that
3.2.2 Evidence pretreatment-based data fusion
In this section, we review evidence pretreatment-based data fusion methods from several aspects, including evidential distance, Pignistic probability distance, correlation coefficient, belief divergence, belief entropy, and belief information quality.
(1) Evidential distance.
The classical evidential distance proposed by Jousselme et al. [75] is a useful tool to measure differences between different source data modeled by BBAs.
.
Definition 28 (Jousselme et al.'s distance) [75]. LetJousselme et al.'s distance has several desirable properties: nonnegativeness, symmetry, nondegeneracy, and triangle inequality. Since Jousselme et al.'s distance is a true metric [76] in favor of managing conflict in data fusion, several researchers have improved upon it [77].
(2) Pignistic probability distance.
The Pignistic probability transformation (PPT) function [78] can be used to measure conflict in data fusion, as will be detailed in Section 3.3. On the basis of the PPT function, Liu [79] presents a Pignistic probability distance to measure conflict between BBAs.
.
Definition 29 Liu's Pignistic probability distance [79] is defined byLiu's Pignistic probability distance is also called the distance between betting commitments of BBAs.
(3) Correlation coefficient.
.
Definition 30 Jiang's correlation coefficient [80] is defined by(4) Belief divergence.
.
Definition 31 Belief divergence, also called belief Jensen–Shannon (BJS), is defined byInspired by BJS divergence, various kinds of belief divergences for data fusion are exploited.
(5) Belief entropy.
.
Definition 32 Deng entropy [81] is defined byWhen
(6) Belief information quality.
As a complementary of belief entropy, the belief information quality is proposed to measure the certainty/quality of information [89].
.
Definition 33 Li et al.'s information quality of BBA3.2.3 Other evidential conflict models for data fusion
The classical discounting method proposed by Shafer [27] has been extended to manage conflicts in data fusion by taking into account the reliability of sources. Let us recall the basic definition.
.
Definition 34 (Shafer's discounting method) [27]. LetIn Eq. (19), the values of
indicates that data sourceα t = 0 is completely unreliable.t indicates that data sourceα t = 1 is completely reliable.t
Clearly, in the process of fusion, discounting is beneficial for handling the conflict from multisource data in accordance with their reliability.
.
Definition 35 Liu's two-dimensional conflict model [79] is defined byLet
.
Definition 36 Daniel's plausibility conflict [90] is defined by.
Definition 37 Lefèvre and Elouedi's combination with adapted conflict (CWAC) rule [91], which adapts the weight between Dempster's rule and the conjunctive rule by means of Jousselme et al.'s distance, is defined by:From the above discussion of hybrid evidential conflict models for data fusion, it can be learned that appropriately constructing a hybrid model by considering different aspects is a feasible and effective way to handle conflict in the fusion process.
3.3 Decision making
The outcome of evidence theory-based data fusion is related to the belief functions. Since belief functions have multiple interpretations, it is necessary to consider not only how to make a decision by defining the probabilistic transformation function for belief functions, but also that the selection of a suitable transformation function should be explained and justified. In this section, we survey several typical solutions for decision making on the basis of belief functions. Their advantages and limitations are also discussed.
(1) Pignistic probability transformation.
The classical Pignistic probability transformation (PPT) function presented by Smets and Kennes [78] can transform a BBA into a probability distribution.
.
Definition 38 Smets and Kennes's PPT [78] is defined byIn Eq. (25), the values of
This process represents a kind of average assignment and is sometimes too conservative to produce appropriate distributions [92]. As a result, many researchers have attempted to improve the model from various perspectives.
(2) Sudano and Martin's probability transformation.
.
Definition 39 Sudano and Martin's probability transformation by means of a mapping that is proportional to all plausibilities, denoted as PraPl [93], is defined byThe above definition indicates that Sudano and Martin's probability transformation is based on a belief function and plausibility function. However, when certain singletons are not included in the subsets of focal elements, the PraPl probability transformation function cannot make a reasonable assignment.
(3) Cobb and Shenoy's probability transformation.
.
Definition 40 Cobb and Shenoy's probability transformation function [78] is defined byCobb and Shenoy's probability transformation method is a kind of plausibility normalization.
(4) Cuzzolin's probability transformation.
.
Definition 41 Cuzzolin's probability transformation function [94], denoted as CuzzP, is defined byThe CuzzP probability transformation function considers the proportional redistribution of the total nonspecific mass (TNSM). However, CuzzP has some limitations. When
(5) DSmP probability transformation.
.
Definition 42 In the DSmT theoretical framework, a Pignistic probability transformation function [49] presented by Dezert and Smarandache and denoted as DSmP is defined byIn the DSmP probability transformation function,
Additional decision-making approaches with belief functions can be found in [95].
4. Complex evidence theory for data fusion
Traditional evidence theory based on real numbers for data fusion has been found to not be applicable in some complex applications to represent data fluctuations at a given phase of time during their execution. Notably, CET, as a generalization of classical DSET, was presented by Xiao [72, 73] to be a solution. CET extends the classical DSET into the complex plane and is capable of modeling and handling uncertainty by means of complex numbers. The main concepts of CET are introduced below [72, 73].
4.1 CET for data fusion in a closed world
.
Definition 43 (Complex mass function) A complex mass function (CMF).
Definition 44 (Focal element in CET). LetComparison of Definitions 4-5 with Definitions 43-44 indicates that
The CBBA
in CET can be expressed by not only complex numbers but also positive real numbers, while𝕄 can only be expressed by positive real numbers in DSET.m In contrast to DSET, the value of
or| 𝕄 ( ψ j ) | represents the degree to which the evidence supports𝐦 ( ψ j ) .ψ j When the focal elements of
reduce to positive real numbers, the CBBA𝕄 in CET degrades into the classical BBA in DSET.𝕄
.
Definition 45 (Commitment degree in CET). The commitment degree.
Definition 46 (Generalized belief function in CET). A generalized belief function.
Definition 47 (Generalized plausibility function in CET). A generalized plausibility functionComparison of Definitions 6-7 with Definitions 46-47 indicates that the functions of
Similar to those in DSET,
andG B e l ( ψ j ) in CET are the lower and upper limit functions ofG P l ( ψ j ) , respectively.ψ j When focal elements of
reduce to positive real numbers,𝕄 andG B e l ( ψ j ) in CET degrade into the classicalG P l ( ψ j ) andB e l ( ψ j ) in DSET, respectively.P l ( ψ j )
.
Definition 48 (Complex evidence combination rule) LetSince
4.2 Generalized CET for data fusion in an open world
In this section, CET is generalised for data fusion in an open world, called as GCET. The basic concepts of GCET are presented as follows.
.
Definition 49 (Generalized complex mass function) A generalized complex mass function (GCMF).
Definition 50 (Focal element in GCET). LetComparison of Definitions 43-44 with Definitions 49-50 indicates that, in contrast to the CBBA in CET,
It is unnecessary for
in GCET, such that| 𝕄 ( ∅ ) | = 0 , while| 𝕄 ( ∅ ) | ≥ 0 must be equal to 0 in CET.| 𝕄 ( ∅ ) | can be a focal element as∅ in GCET, but| 𝕄 ( ∅ ) | > 0 cannot be a focal element in CET.∅ When
, it is utilized to model an open world in GCET, indicating that| 𝕄 ( ∅ ) | > 0 is a focal element or the union of focal elements not within the FOD, rather than the empty set in CBBA in CET.∅ When
, the GCBBA| 𝕄 ( ∅ ) | = 0 in GCET degrades into the CBBA in CET.𝕄
.
Definition 51 (Commitment degree in GCET). The commitment degree.
Definition 52 (Generalized belief function in GCET). A generalized belief function.
Definition 53 (Generalized plausibility function in GCET). A generalized plausibility functionComparison of Definitions 46-47 with Definitions 52-53 indicates that the functions of
Similar to CET,
andG B e l ( ψ j ) in GCET are the lower and upper limit functions ofG P l ( ψ j ) , respectively.ψ j It is unnecessary for
= 0 andG B e l ( ∅ ) = 0 in GCET, such thatG P l ( ∅ ) andG B e l ( ∅ ) ≥ 0 , whileG P l ( ∅ ) ≥ 0 andG B e l ( ∅ ) must be equal to 0 in CET.G P l ( ∅ ) When
,| 𝕄 ( ∅ ) | = 0 andG B e l ( ψ j ) in GCET degrade into theG P l ( ψ j ) andG B e l ( ψ j ) in CET, respectively.G P l ( ψ j )
.
Definition 54 (Generalized complex evidence combination rule) LetGCECR has the following characteristics:
When
, GCECR reduces to the CECR. Since𝕄 1 ⊕ ⋯ ⊕ 𝕄 q ⊕ ⋯ ⊕ 𝕄 t ( ∅ ) = 0 indicating a closed world and| 𝕄 ( ∅ ) | = 0 indicating an open world, the GCECR can merge arbitrary multiple GCBBAs to facilitate uncertainty reasoning for data fusion, both in the closed world and open world contexts.| 𝕄 ( ∅ ) | > 0 of GCBBAs are fused by the operation of orthogonal sum.𝕄 ( ∅ ) In Definition 54, the factor
is a process of normalization that is a generalization of1 1 − 𝕂 in Eq. (10) of the CECR.1 1 − 𝕂 When
,𝕄 1 ⊕ ⋯ ⊕ 𝕄 q ⊕ ⋯ ⊕ 𝕄 t ( ∅ ) = 0 in Eq. (20) reduces to𝕂 in Eq. (10).𝕂 If the sum of the GCBBAs of all nonempty sets is zero or GCECC is equal to 1, the whole belief is reallocated to
.∅
4.3 Analysis of the characteristics of CET and GCET
The CET and GCET inherit the merits of classical DSET and GET, respectively. They have the following attractive characteristics:
The structure of
in CET and GCET can model partial or complete ignorance using complex numbers rather than real numbers, enhancing its effectiveness in addressing uncertainty modelling challenges, particularly for signal and image data. Furthermore, in GCET,𝕄 can represent the uncertainty arising from the incompleteness of the FOD, enabling it to model uncertainty in an open world context.𝕄 The generalized belief function in CET and GCET also do not need experts to provide prior probabilities, in contrast to the Bayesian decision model.
The CECR satisfies axioms A1-A6: compositionality, commutativity, associativity, conditioning, internal symmetry, and autofunctionality, as does DRC shown in Table 1. Whereas, similar to Deng [48] and Jiang and Zhan [71]'s combination rules, GCECR satisfy axiom A4 when returning to a closed world, due to the processing of the empty set representing uncertainty in an open world.
The CECR in CET and GCECR in GCET adhere to the associative law and commutative law, providing flexible and straightforward approaches to uncertainty reasoning for the process of data fusion in the complex plane. In contrast to CECR, GCECR facilitates uncertainty reasoning not only in a closed world but also in an open world context.
The generalized interval
in CET and GCET, consisting of the generalized belief and plausibility functions, also provides upper and lower probabilities.[ G B e l ( ψ j ) , G P l ( ψ j ) ] When GCBBAs reduce to CBBAs, GCECR degenerates into CECR. Overall, CET offers an effective approach to uncertainty modelling and reasoning in a closed world context, whereas GCET is capable of uncertainty modelling and reasoning in both open world and closed world contexts.
| Characteristics | Evidence theories | ||||
| Traditional | Generalization | ||||
| DSET [26, 27] | DSmT [47] | GET [48] | CET [72, 73] | GCET | |
| Model partial or complete ignorance quantitatively | yes | yes | yes | yes | yes |
| Regardless of prior probabilities | yes | yes | yes | yes | yes |
| Reasoning | yes | yes | yes | yes | yes |
| Associative law | yes | yes | yes | yes | yes |
| Commutative law | yes | yes | yes | yes | yes |
| Upper and lower probabilities | yes | yes | yes | yes | yes |
| Hyper-/Super-power sets | no | yes | no | no | no |
| Closed world | yes | yes | yes | yes | yes |
| Open world | no | yes | yes | no | yes |
| Complex plane | no | no | no | yes | yes |
In some cases, GCET has greater capability than CET to model and handle the uncertainty problem in data fusion. Take the recent novel Coronavirus (COVID-19) as an example; this virus is distinctly beyond the FOD due to lack of human knowledge. In this situation, because of the exhaustiveness assumption of the FOD, CET is not applicable, whereas GCET can model a focal element outside of the FOD just for the case of COVID-19 to handle such uncertainty in the open world.
Besides, to compare CET and GCET with the typical theoretical frameworks DSET, DSmT, and GET, the characteristics are summarized in Table 2. All of these evidence theories 1) can model partial or complete ignorance quantitatively; 2) do not require prior probabilities; 3) have reasoning ability; 4) satisfy the associative law and commutative law; 5) can be regarded as upper and lower probabilities; and 6) can handle uncertainty in the fusion process in a closed world context. In addition, DSmT, GET and GCET can handle uncertainty in the fusion process in an open world. On the other hand, DSmT can handle uncertainty under the FOD modeled by a hyper-power set or super-power set rather than the power set, while CET and GCET can handle uncertainty in the fusion process on the complex plane.
5. Algorithm and application
Pattern classification has attracted much attention in recent years [96, 97]. In this section, we focus on the closed world, and present classical and complex evidence theory framework-based multisource data fusion algorithms. Then, we apply these fusion algorithms to pattern classification to demonstrate their practicabilities.
5.1 Evidence theory framework-based weighted multisource data fusion algorithms
In this section, classical evidence theory framework-based weighted multisource data fusion (ETF-WMSDF) algorithms are devised based on evidential distance, Pignistic probability distance, correlation coefficient, belief divergence, belief entropy, and belief information quality for decision making, respectively.
Problem statement: Let
-
Step 1: The weight for
is calculated as:m q Note that "Methods A and B" denote the weighted methods based on belief distance functions defined in Definitions 28 and 29, respectively; "Method C" denotes the weighted method based on belief divergence defined in Definition 31; "Method D" denotes the weighted method based on belief correlation coefficient defined in Definition 30; "Method E" denotes the weighted method based on Deng entropy defined in Definition 32; "Method F" denotes the weighted method based on information quality defined in Definition 33.Correspondingly, these ETF-WMSDF algorithms are denoted as ETF-WMSDFA, ETF-WMSDFB, ETF-WMSDFC, ETF-WMSDFD, ETF-WMSDFE, and ETF-WMSDFF.W ( m q ) = { t − 1 ∑ p = 1 t d J G B ( m q , m p ) , Method A ; t − 1 ∑ p = 1 t d i f B e t P ( m q , m p ) , Method B ; t − 1 ∑ p = 1 t D X ( m q , m p ) , Method C ; ∑ p = 1 t C J ( m q , m p ) − 1 , Method D ; e − E D ( m q ) , Method E ; I Q L D ( m q ) , Method F . -
Step 2: The weight of
is normalised as:m q W ~ ( m q ) = W ( m q ) ∑ q = 1 t W ( m q ) . -
Step 3: According to the normalised weight, a weighted average evidence
is generated as:m ~ m ~ = ∑ q = 1 t W ~ ( m q ) × m q . -
Step 4:
is fusedm ~ times with DCR to obtain a final BBA:t − 1 m ˙ = ( ( m ~ ⊕ m ~ ) 1 ⊕ ⋯ ⊕ m ~ ) ( t − 1 ) . -
Step 5: According to PPT function [78], we get:
Bet ( { ϕ i } ) = ∑ ϕ i ⊆ ψ j m ˙ ( ψ j ) | ψ j | . -
Step 6: The largest
is chosen by:Bet ( { ϕ φ } ) φ = arg max 1 ≤ i ≤ n { Bet ( { ϕ i } ) } . -
Step 7: The target is determined as:
{ if Bet ( { ϕ φ } ) ≥ δ , ϕ φ is the target , if Bet ( { ϕ φ } ) < δ , Cannot be determined .
The ETF-WMSDF Algorithm 1 is as follows.
Input:
;Θ = { ϕ 1 , … , ϕ i , … , ϕ n } ; Thresholdℳ = { m 1 , … , m q , … , m t } ;δ Output: A decision;
for
doq = 1 ; q ≤ t Calculate the weight
forW ( m q ) by Eq. (1);m q end for
for
doq = 1 ; q ≤ t Obtain the normalised weight
by Eq. (2);W ~ ( m q ) end for
for
doq = 1 ; q ≤ t Calculate a weighted average evidence
by Eq. (3);m ~ end for
Obtain the fused BBA
by Eq. (4);m ˙ for
doi = 1 ; i ≤ n Calculate
by Eq. (5);Bet ( { ϕ i } ) end for
Select
by Eq. (6);φ = arg max 1 ≤ i ≤ n { Bet ( { ϕ i } ) } if
thenBet ( { ϕ φ } ) ≥ δ The target
← ;ϕ φ else
Cannot be determined.
end if
5.2 Complex evidence theory framework-based multisource data fusion algorithms
In this section, a complex evidence theory framework-based multisource data fusion (CETF- MSDF) algorithm is introduced [98].
Problem statement: Let
-
Step 1: The CBBAs of
are fused with CECR to obtain a final CBBA:𝕄 𝕄 ˙ = ( ( 𝕄 1 ⊕ 𝕄 ) 2 ⊕ ⋯ ⊕ 𝕄 t ) . -
Step 2: According to CPPT function [98], we get:
CBet ( { ϕ i } ) = ∑ ϕ i ⊆ ψ j 𝕄 ˙ ( ψ j ) | ψ j | . -
Step 3: The largest
is chosen by:CBet ( { ϕ φ } ) φ = arg max 1 ≤ i ≤ n { CBet ( { ϕ i } ) } . -
Step 4: The target is determined as:
{ if CBet ( { ϕ φ } ) ≥ δ , ϕ φ is the target , if CBet ( { ϕ φ } ) < δ , Cannot be determined .
The CETF-MSDF Algorithm 2 is as follows.
5.3 Application to pattern classification
In this section, the ETF-WMSDF and CETF-MSDF algorithms are applied to pattern classification to demonstrate their practicabilities. Then, the ETF-WMSDF and CETF-MSDF algorithms are compared with related well-known works to reveal their performances.
5.3.1 Descriptions of the datasets
In this section, the performances of the ETF-WMSDF and CETF-MSDF algorithms are validated over five real-world datasets from the UCI machine learning repository (http://archive.ics.uci.edu/ml/).
| Dataset | ETF-based weighted multisource data fusion algorithms |
|
|||||||
| ETF-WMSDFA | ETF-WMSDFB | ETF-WMSDFC | ETF-WMSDFD | ETF-WMSDFE | ETF-WMSDFF | CETF-MSDF | |||
| Iris | 95.33 |
96.00 |
96.00 |
96.67 |
96.00 |
96.67 |
96.67 |
||
| Wine | 93.75 |
94.23 |
93.87 |
95.41 |
96.47 |
96.47 |
97.65 |
||
| Heart | 83.70 |
83.33 |
83.33 |
84.44 |
84.44 |
84.81 |
87.41 |
||
| Parkinson's | 83.65 |
83.18 |
82.66 |
79.62 |
80.02 |
80.02 |
83.00 |
||
| Australian | 82.77 |
83.21 |
83.08 |
87.38 |
86.35 |
85.50 |
88.84 |
||
| Average | 87.84 |
87.99 |
87.79 |
88.70 |
88.66 |
88.69 |
90.71 |
||
| Dataset | Classifiers | ETF-based fusion methods | ||||||||||||||
| NaB | NMC | kNN | REPTree | SVM | SVM-RBF | MlP | RBFN | kNN-DST | NDC | EvC |
|
|
||||
| Iris | 94.67% | 90.67% | 95.33% | 92.00% | 94.67% | 94.67% | 93.33% | 92.67% | 95.33% | 94.00% | 94.67% | 96.67% | 96.67% | |||
| Wine | 95.51% | 70.44% | 70.19% | 84.92% | 96.62% | 96.63% | 94.93% | 95.49% | 93.84% | 96.63% | 97.17% | 95.41% | 97.65% | |||
| Heart | 82.59% | 60.37% | 57.78% | 70.74% | 83.70% | 82.96% | 75.19% | 81.85% | 76.30% | 82.59% | 83.70% | 84.44% | 87.41% | |||
| Parkinson's | 68.75% | 70.77% | 83.02% | 80.94% | 70.13% | 81.03% | 74.39% | 82.05% | 78.01% | 70.26% | 81.64% | 79.62% | 83.00% | |||
| Australian | 79.56% | 64.21% | 67.40% | 80.59% | 80.29% | 79.86% | 82.32% | 82.61% | 78.41% | 80.01% | 80.60% | 87.38% | 88.84% | |||
| Average | 80.47% | 67.63% | 73.85% | 79.18% | 81.75% | 83.06% | 80.03% | 84.10% | 81.72% | 81.25% | 84.10% | 88.70% | 90.71% | |||
| Std | 8.46% | 13.00% | 13.50% | 7.71% | 7.89% | 6.15% | 7.26% | 4.30% | 6.94% | 7.62% | 5.44% | 6.49% | 5.61% | |||
5.3.2 Implementation of ETF-WMSDF and CETF-MSDF algorithms
In this experiment, each attribute from a dataset is considered as independent source to provide information. The missing values can be modelled as "complete ignorance" by
Then, with reference to each testing instance, the ETF-WMSDF and CETF-MSDF algorithms are applied to fuse these generated BBAs and CBBAs, respectively, and classify the testing instance to a certain pattern. In addition, a five-fold cross validation is carried out: 80% of each dataset are randomly selected as training instances, while the rest of 20% of each dataset serves as the testing instances. We repeat this process five times, and average the accuracies of all classes, in which the results are summarized in Table 3.
It is noticed that the average classification accuracy generated by ETF-WMSDFA, ETF-WMSDFB, ETF-WMSDFC, ETF-WMSDFD, ETF-WMSDFE, ETF-WMSDFF, and CETF-MSDF algorithms over the five UCI datasets are 87.84
5.3.3 Comparison
The ETF-WMSDF and CETF-MSDF algorithms with the best performance are compared with several well-known related works to verify their performances: 1) state-of-the-art classifiers: Naïve Bayes (NaB) [99], nearest mean classifier (NMC) [100], k-nearest neighbor (kNN) [101], Decision Tree (REPTree) [102], support vector machine (SVM) [103], SVM with radial basis function (SVM-RBF) [103], multilayer perceptron (MlP) [104], and RBF network (RBFN) [105], and 2) evidence theory framework-based fusion methods: k-nearest neighbor DS theory (kNN-DST) [106], normal distribution-based classifier (NDC) [107], and evidential calibration (EvC) [62].
| Dataset | Classifiers | ETF-based fusion methods | ||||||||||||||
| NaB | NMC | kNN | REPTree | SVM | SVM-RBF | MlP | RBFN | kNN-DST | NDC | EvC |
|
|
||||
| Iris | 2.00% | 6.00% | 1.34% | 4.67% | 2.00% | 2.00% | 3.34% | 4.00% | 1.34% | 2.67% | 2.00% | 0.00% | 0.00% | |||
| Wine | 2.14% | 27.21% | 27.46% | 12.73% | 1.03% | 1.02% | 2.72% | 2.16% | 3.81% | 1.02% | 0.48% | 2.24% | 0.00% | |||
| Heart | 4.82% | 27.04% | 29.63% | 16.67% | 3.71% | 4.45% | 12.22% | 5.56% | 11.11% | 4.82% | 3.71% | 2.97% | 0.00% | |||
| Parkinson's | 14.27% | 12.25% | 0.00% | 2.08% | 12.89% | 1.99% | 8.63% | 0.97% | 5.01% | 12.76% | 1.38% | 3.40% | 0.02% | |||
| Australian | 9.28% | 24.63% | 21.44% | 8.25% | 8.55% | 8.98% | 6.52% | 6.23% | 10.43% | 8.83% | 8.24% | 1.46% | 0.00% | |||
| Accumulate | 32.50% | 97.12% | 79.86% | 44.39% | 28.17% | 18.43% | 33.42% | 18.91% | 31.69% | 30.09% | 15.80% | 10.07% | 0.02% | |||
In this comparison experiment, the same five-fold cross validation is carried out. The results of classification accuracies in terms of different datasets obtained by different methods are shown in Table 4, in which the optimal performance is highlighted in bold. The ETF-WMSDFD algorithm has classification accuracies: 96.67%, 95.41%, 84.44%, 79.62%, and 87.38% in terms of the Iris, Wine, Heart, Parkinson's and Australian datasets, respectively. The CETF-MSDF algorithm has classification accuracies: 96.67%, 97.65%, 87.41%, 83.00%, and 88.84% in terms of the Iris, Heart, Hepatitis, Parkinson's and Australian datasets, respectively. The CETF-MSDF algorithm obviously outperforms ETF-WMSDFD algorithm as well as other well-known methods for all but the Parkinson's dataset. For five UCI datasets, the NaB, NMC, kNN, REPTree, SVM, SVM-RBF, MlP, RBFN, kNN-DST, NDC, EvC, and ETF-WMSDFD algorithms have the following average classification accuracies: 84.22%
In addition, the differences between the average classification accuracy of each method and that of the optimal performance are calculated in Table 5. The differences across five datasets are accumulated for further evaluation of their relative performance. As shown in Figure 2, the overall accumulated difference across the five datasets for the CETF-MSDF algorithm is only 0.02%. However, the NaB, NMC, kNN, REPTree, SVM, SVM-RBF, MlP, RBFN, kNN-DST, NDC, EvC and ETF-WMSDFD algorithms have total accumulated differences of 32.50%, 97.12%, 79.86%, 44.39%, 28.17%, 18.43%, 33.42%, 18.91%, 31.69%, 30.09%, 15.80% and 10.07%, respectively. These results reveal that the superiority of the CETF-MSDF algorithm.
The proposed CETF-MSDF algorithm outperforms other methods because it utilizes CBBA to effectively model and enhance data features through the phase parameter on the occasion of generating CBBAs. To be specific, by fusing appropriate CBBAs expressed by complex numbers, constructive interference will be produced to strengthen modeling data features. Nevertheless, the computational complexity of the CETF-MSDF algorithm is higher comparing with these related methods, which limits its applicability in real-time scenarios. However, in scenarios where accuracy is critical or in certain complex number-based situations, the proposed CETF-MSDF algorithm is the preferred choice. To further improve the processing efficiency of the proposed algorithm for real-time applications, the phase can be modeled in an interpretable manner at an early stage, just like the magnitude.
6. Challenges and open future research directions
In this section, several challenges and open future research directions are summarized and discussed.
6.1 (C)BBA generation with large, heterogeneous and multi-modal data
As science and technology continue to develop, many applications have become heterogeneous sensor-based, so large, heterogeneous and multi-modal data are inevitable [108]. These data have the characteristics of large volume, high variety, but low value. How to generate appropriate BBAs, and even CBBAs, with these large, heterogeneous and multi-modal data to facilitate decisions is a challenging problem in data fusion. It remains an open issue to fuse such heterogeneous and multidimension data.
6.2 Combination of dependent evidence
As discussed in a previous section, DRC in DSET and the CECR in CET require independence among multiple pieces of evidence. However, dependency among some types of data is unavoidable. Several researchers have studied the fusion of dependent evidence from the perspective of combination rule modification and belief structure improvement, but some limitations of these methods restrict their application in data fusion. Therefore, it is necessary to consider how to determine the degree of dependence and how to develop the classical and complex evidence combination rules to fuse dependent evidence.
6.3 Conflict management
In evidence theory, when fusing conflicting data, the classical DRC and CECR may generate counterintuitive results, which impacts the effectiveness of these combination rules in real-world applications of data fusion. In recent decades, the problem of conflict management in the classical DRC has been extensively studied and discussed. Various methods have been proposed, as discussed in Section 3. Careful summary and analysis indicates that no distinct conclusions have been reached. Different applications may require different or even hybrid solutions to manage the conflicting information according to the specific situation, especially for large, heterogeneous and multi-modal data. Furthermore, the current presentation of the CECR in CET requires new strategies to measure and manage conflicts from multisource data modeled in a complex plane.
6.4 Open world
The other limitation of evidence theory is that its FOD must be fully complete. However, in real-world applications, the targets to be detected may be unknown, for example, the detection of unknown diseases, recognition of aircraft types, and classification of unknown elements. Recently, Xiao [109] proposes a generalized quantum evidence theory (GQET) based on a quantum mechanical framework. GQET can facilitate the uncertainty reasoning of data fusion not only in the closed world (when the squared amplitude of the generalized quantum basic belief assignment for the empty set is zero
6.5 Complexity and real time
In evidence theory, as the number of FODs increases, the power set of the FOD will increase exponentially; how to fuse evidence under an FOD with an abundance of elements remains an open issue. On the other hand, big data has the characteristic of high velocity. The integration of evidence theory for real-time applications requires effective solutions. Our recent work has proposed a novel quantum Dempster's rule of combination, which constructs quantum circuits using quantum logical gates, significantly reducing the computational complexity of Dempster's rule of combination without information loss. This approach offers a promising way to handle such complexity and real-time challenges, making it worthy of further investigation.
7. Conclusion
In this paper, we conducted a comprehensive review of the literature on Dempster–Shafer evidence theory (DSET) for data fusion. We first introduced the basis concepts and knowledge of classical DSET and studied the axioms of Dempster's rule of combination and the characteristics and restraints of DSET. We further provided a review of the classical DSET and its extensions, collectively referred to as classical evidence theory, for data fusion from three aspects, namely, uncertainty modeling, fusion, and decision making. Particularly, in the fusion section, three main kinds of solutions for evidence theory-based data fusion were summarized, including evidential combination rule-based data fusion, evidence pretreatment-based data fusion, and other hybrid evidential conflict models for data fusion, and the typical methods and techniques were described. Next, we studied complex evidence theory for data fusion that benefits from the frame of complex plane modelling in both closed world and open world contexts. Furthermore, we presented classical and complex evidence theory framework-based multisource data fusion algorithms, which were applied to pattern classification. Through comparison with other well-known methods, complex evidence theory framework-based multisource data fusion algorithm showed its superiority to handle pattern classification problem in the complex plane. It also revealed the applicability and limitation of complex evidence theory framework-based multisource data fusion algorithm. On the basis of this survey, analysis and discussion, we present a number of challenges and open issues to help guide future research directions on evidence theory-based data fusion.
Data Availability Statement
Funding
Conflicts of Interest
Ethical Approval and Consent to Participate
References
- Zhang, L., Xie, Y., Xidao, L., & Zhang, X. (2018, May). Multi-source heterogeneous data fusion. In 2018 International conference on artificial intelligence and big data (ICAIBD) (pp. 47-51). IEEE.
- Albahri, A. S., Duhaim, A. M., Fadhel, M. A., Alnoor, A., Baqer, N. S., Alzubaidi, L., ... & Deveci, M. (2023). A systematic review of trustworthy and explainable artificial intelligence in healthcare: Assessment of quality, bias risk, and data fusion. Information Fusion, 96, 156-191.
- Lai, J. W., Chang, J., Ang, L. K., & Cheong, K. H. (2020). Multi-level information fusion to alleviate network congestion. Information Fusion, 63, 248-255.
- Yager, R. R. (2004). A framework for multi-source data fusion. Information Sciences, 163(1-3), 175-200.
- Yang, J. B., Xu, D. L., Xu, X., & Fu, C. (2023). Likelihood analysis of imperfect data. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 53(8), 5046-5057.
- Cao, B., Li, C., Song, Y., Qin, Y., & Chen, C. (2022). Network intrusion detection model based on CNN and GRU. Applied Sciences, 12(9), 4184.
- Miao, W., Xu, Z., Geng, J., & Jiang, W. (2023). ECAE: Edge-aware class activation enhancement for semisupervised remote sensing image semantic segmentation. IEEE Transactions on Geoscience and Remote Sensing, 61, 1-14.
- Judah, A., & Hu, B. (2022). An advanced data fusion method to improve wetland classification using multi-source remotely sensed data. Sensors, 22(22), 8942.
- Charte, D., Charte, F., García, S., del Jesus, M. J., & Herrera, F. (2018). A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines. Information Fusion, 44, 78-96.
- Li, T., Song, Y., & Fan, H. (2023). From target tracking to targeting track: A data-driven yet analytical approach to joint target detection and tracking. Signal Processing, 205, 108883.
- Deng, X., Xue, S., & Jiang, W. (2023). A novel quantum model of mass function for uncertain information fusion. Information Fusion, 89, 619-631.
- Hussain, L. A., Singh, S., Mizouni, R., Otrok, H., & Damiani, E. (2023). A predictive target tracking framework for IoT using CNN–LSTM. Internet of Things, 22, 100744.
- Kang, B., & Zhao, C. (2024). Deceptive evidence detection in information fusion of belief functions based on reinforcement learning. Information Fusion, 103, 102102.
- Liu, Z., Chen, F., Xu, J., Pei, W., & Lu, G. (2022). Image-text retrieval with cross-modal semantic importance consistency. IEEE Transactions on Circuits and Systems for Video Technology, 33(5), 2465-2476.
- Wang, X., Zhu, D., Li, G., Zhang, X. P., & He, Y. (2022). Proposal-Copula-Based Fusion of Spaceborne and Airborne SAR Images for Ship Target Detection⁎⁎. Information Fusion, 77, 247-260.
- Chenghai, L. I., Ke, W. A. N. G., Yafei, S. O. N. G., Peng, W. A. N. G., & Lemin, L. I. (2024). Air target intent recognition method combining graphing time series and diffusion models. Chinese Journal of Aeronautics.
- Zhang, Y., Wang, X., Jiang, Z., Li, G., & He, Y. (2022). An efficient center-based method with multilevel auxiliary supervision for multiscale SAR ship detection. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 15, 7065-7075.
- Lau, B. P. L., Marakkalage, S. H., Zhou, Y., Hassan, N. U., Yuen, C., Zhang, M., & Tan, U. X. (2019). A survey of data fusion in smart city applications. Information Fusion, 52, 357-374.
- Ding, W., Jing, X., Yan, Z., & Yang, L. T. (2019). A survey on data fusion in internet of things: Towards secure and privacy-preserving fusion. Information Fusion, 51, 129-144.
- Deng, X., Jiang, Y., Yang, L. T., Lin, M., Yi, L., & Wang, M. (2019). Data fusion based coverage optimization in heterogeneous sensor networks: A survey. Information Fusion, 52, 90-105.
- Ghamisi, P., Rasti, B., Yokoya, N., Wang, Q., Hofle, B., Bruzzone, L., ... & Benediktsson, J. A. (2019). Multisource and multitemporal data fusion in remote sensing: A comprehensive review of the state of the art. IEEE Geoscience and Remote Sensing Magazine, 7(1), 6-39.
- Meng, T., Jing, X., Yan, Z., & Pedrycz, W. (2020). A survey on machine learning for data fusion. Information Fusion, 57, 115-129.
- El Fissaoui, M., Beni-hssane, A., Ouhmad, S., & El Makkaoui, K. (2021). A survey on mobile agent itinerary planning for information fusion in wireless sensor networks. Archives of computational methods in engineering, 28(3), 1323-1334.
- Zhang, Y., Jiang, C., Yue, B., Wan, J., & Guizani, M. (2022). Information fusion for edge intelligence: A survey. Information Fusion, 81, 171-186.
- Xinde, L. I., DUNKIN, F., & DEZERT, J. (2023). Multi-source information fusion: Progress and future. Chinese Journal of Aeronautics.
- Dempster, A. P. (2008). Upper and lower probabilities induced by a multivalued mapping. In Classic works of the Dempster-Shafer theory of belief functions (pp. 57-72). Berlin, Heidelberg: Springer Berlin Heidelberg.
- Shafer, G. et~al. (1976). A mathematical theory of evidence, volume~1. Princeton University Press Princeton.
- Zhang, Z., Ye, S., Zhang, Y., Ding, W., & Wang, H. (2022). Belief combination of classifiers for incomplete data. IEEE/CAA Journal of Automatica Sinica, 9(4), 652-667.
- Fujita, H., & Ko, Y. C. (2020). A heuristic representation learning based on evidential memberships: Case study of UCI-SPECTF. International Journal of Approximate Reasoning, 120, 125-137.
- Yager, R.~R. and Liu, L. (2008). Classic works of the Dempster--Shafer theory of belief functions, volume 219. Springer.
- Xiong, L., Su, X., & Qian, H. (2021). Conflicting evidence combination from the perspective of networks. Information Sciences, 580, 408-418.
- Liu, P., Li, Y., Zhang, X., & Pedrycz, W. (2022). A multiattribute group decision-making method with probabilistic linguistic information based on an adaptive consensus reaching model and evidential reasoning. IEEE Transactions on Cybernetics, 53(3), 1905-1919.
- Xu, X., Zheng, J., Yang, J. B., Xu, D. L., & Chen, Y. W. (2017). Data classification using evidence reasoning rule. Knowledge-Based Systems, 116, 144-151.
- Tang, S. W., Zhou, Z. J., Hu, C. H., Yang, J. B., & Cao, Y. (2019). Perturbation analysis of evidential reasoning rule. IEEE transactions on systems, man, and cybernetics: systems, 51(8), 4895-4910.
- Zhang, B., Zhang, Y., Hu, G., Zhou, Z., Wu, L., & Lv, S. (2020). A method of automatically generating initial parameters for large-scale belief rule base. Knowledge-Based Systems, 199, 105904.
- Fu, C., Hou, B., Xue, M., Chang, L., & Liu, W. (2022). Extended belief rule-based system with accurate rule weights and efficient rule activation for diagnosis of thyroid nodules. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 53(1), 251-263.
- Zhou, Z. J., Hu, G. Y., Hu, C. H., Wen, C. L., & Chang, L. L. (2019). A survey of belief rule-base expert system. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 51(8), 4944-4958.
- Chang, L., Zhang, L., Fu, C., & Chen, Y. W. (2021). Transparent digital twin for output control using belief rule base. IEEE Transactions on cybernetics, 52(10), 10364-10378.
- Cao, Y., Zhou, Z., Hu, C., He, W., & Tang, S. (2020). On the interpretability of belief rule-based expert systems. IEEE Transactions on Fuzzy Systems, 29(11), 3489-3503.
- Xu, X., Guo, H., Zhang, Z., Shi, P., Huang, W., Li, X., & Brunauer, G. (2024). Fault diagnosis method via one vs rest evidence classifier considering imprecise feature samples. Applied Soft Computing, 161, 111761.
- Xu, X., Guo, H., Zhang, Z., Yu, S., Chang, L., Steyskal, F., & Brunauer, G. (2024). A cloud model-based interval-valued evidence fusion method and its application in fault diagnosis. Information Sciences, 658, 119995.
- Chen, X., & Deng, Y. (2024). Evidential software risk assessment model on ordered frame of discernment. Expert Systems with Applications, 250, 123786.
- Zhou, M., Zheng, Y. Q., Chen, Y. W., Cheng, B. Y., Herrera-Viedma, E., & Wu, J. (2023). A large-scale group consensus reaching approach considering self-confidence with two-tuple linguistic trust/distrust relationship and its application in life cycle sustainability assessment. Information Fusion, 94, 181-199.
- Fei, L., Liu, X., & Zhang, C. (2024). An evidential linguistic ELECTRE method for selection of emergency shelter sites. Artificial Intelligence Review, 57(4), 81.
- Zadeh, L. A. (1986). A simple view of the Dempster-Shafer theory of evidence and its implication for the rule of combination. AI magazine, 7(2), 85-85.
- Smets, P. (2002). The combination of evidence in the transferable belief model. IEEE Transactions on pattern analysis and machine intelligence, 12(5), 447-458.
- Dezert, J., & Smarandache, F. (2006). DSmT: A new paradigm shift for information fusion. Infinite Study.
- Deng, Y. (2015). Generalized evidence theory. Applied Intelligence, 43(3), 530-543.
- Smarandache, F., Dezert, J., & Tchamova, A. (Eds.). (2023). Advances and Applications of DSmT for Information Fusion (Collected Works. Volume 5).
- Deng, Y. (2022). Random permutation set. International Journal of Computers Communications & Control, 17(1).
- Deng, J., Deng, Y., & Yang, J. B. (2024). Random permutation set reasoning. IEEE Transactions on Pattern Analysis and Machine Intelligence.
- Deng, X., & Jiang, W. (2023). A framework for the fusion of non-exclusive and incomplete information on the basis of D number theory. Applied Intelligence, 53(10), 11861-11884.
- Kouatli, I. (2020). The use of fuzzy logic as augmentation to quantitative analysis to unleash knowledge of participants’ uncertainty when filling a survey: case of cloud computing. IEEE Transactions on Knowledge and Data Engineering, 34(3), 1489-1500.
- Akcora, C. G., Gel, Y. R., Kantarcioglu, M., Lyubchich, V., & Thuraisingham, B. (2019). Graphboot: Quantifying uncertainty in node feature learning on large networks. IEEE Transactions on Knowledge and Data Engineering, 33(1), 116-127.
- Fei, L., & Wang, Y. (2022). An optimization model for rescuer assignments under an uncertain environment by using Dempster–Shafer theory. Knowledge-Based Systems, 255, 109680.
- An, L., Li, M., Boudaren, M. E. Y., & Pieczynski, W. (2018). Unsupervised segmentation of hidden Markov fields corrupted by correlated non-Gaussian noise. International journal of approximate reasoning, 102, 41-59.
- Zhang, Z. W., Liu, Z. G., Martin, A., & Zhou, K. (2022). BSC: Belief shift clustering. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 53(3), 1748-1760.
- Denoeux, T. (2021). NN-EVCLUS: Neural network-based evidential clustering. Information Sciences, 572, 297-330.
- Zhou, K., Martin, A., Pan, Q., & Liu, Z. (2018). SELP: Semi-supervised evidential label propagation algorithm for graph data clustering. International Journal of Approximate Reasoning, 92, 139-154.
- He, H., Han, D., & Dezert, J. (2020). Disagreement based semi-supervised learning approaches with belief functions. Knowledge-Based Systems, 193, 105426.
- Antoine, V., Guerrero, J. A., & Xie, J. (2021). Fast semi-supervised evidential clustering. International Journal of Approximate Reasoning, 133, 116-132.
- Xu, P., Davoine, F., Zha, H., & Denoeux, T. (2016). Evidential calibration of binary SVM classifiers. International Journal of Approximate Reasoning, 72, 55-70.
- Denœux, T. (2019). Logistic regression, neural networks and Dempster–Shafer theory: A new perspective. Knowledge-Based Systems, 176, 54-67.
- Tong, Z., Xu, P., & Denoeux, T. (2021). An evidential classifier based on Dempster-Shafer theory and deep learning. Neurocomputing, 450, 275-293.
- Yager, R. R. (1987). On the Dempster-Shafer framework and new combination rules. Information sciences, 41(2), 93-137.
- Dubois, D., & Prade, H. (1988). Representation and combination of uncertainty with belief functions and possibility measures. Computational intelligence, 4(3), 244-264.
- Inagaki, T. (2002). Interdependence between safety-control policy and multiple-sensor schemes via Dempster-Shafer theory. IEEE Transactions on Reliability, 40(2), 182-188.
- Lefevre, E., Colot, O., & Vannoorenberghe, P. (2002). Belief function combination and conflict management. Information fusion, 3(2), 149-162.
- Zhang, L. (1994). Representation, independence, and combination of evidence in the Dempster-Shafer theory. In Advances in the Dempster-Shafer theory of evidence (pp. 51-69). https://dl.acm.org/doi/abs/10.5555/186965.186968
- Mahler, R. P. (1996). Combining ambiguous evidence with respect to ambiguous a priori knowledge. I. Boolean logic. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, 26(1), 27-41.
- Jiang, W., & Zhan, J. (2017). A modified combination rule in generalized evidence theory. Applied Intelligence, 46(3), 630-640.
- Xiao, F. (2020). Generalization of Dempster–Shafer theory: A complex mass function. Applied Intelligence, 50(10), 3266-3275.
- Xiao, F. (2020). Generalized belief function in complex evidence theory. Journal of Intelligent & Fuzzy Systems, 38(4), 3665-3673.
- Chen, X., & Deng, Y. (2023). A novel combination rule for conflict management in data fusion. Soft Computing, 27(22), 16483-16492.
- Jousselme, A. L., Grenier, D., & Bossé, É. (2001). A new distance between two bodies of evidence. Information fusion, 2(2), 91-101.
- Jousselme, A. L., & Maupin, P. (2012). Distances in evidence theory: Comprehensive survey and generalizations. International Journal of Approximate Reasoning, 53(2), 118-145.
- Han, D., Dezert, J., & Yang, Y. (2016). Belief interval-based distance measures in the theory of belief functions. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 48(6), 833-850.
- Smets, P., & Kennes, R. (1994). The transferable belief model. Artificial intelligence, 66(2), 191-234.
- Liu, W. (2006). Analyzing the degree of conflict among belief functions. Artificial intelligence, 170(11), 909-924.
- Jiang, W. (2018). A correlation coefficient for belief functions. International Journal of Approximate Reasoning, 103, 94-106.
- Deng, Y. (2020). Uncertainty measure in evidence theory. Science China Information Sciences, 63(11), 210201.
- Abellán, J. (2017). Analyzing properties of Deng entropy in the theory of evidence. Chaos, Solitons & Fractals, 95, 195-199.
- Deng, Y. (2020). Information volume of mass function. International Journal of Computers Communications & Control, 15(6).
- Liao, H., Ren, Z., & Fang, R. (2020). A Deng-entropy-based evidential reasoning approach for multi-expert multi-criterion decision-making with uncertainty. International Journal of Computational Intelligence Systems, 13(1), 1281-1294.
- Zhao, T., Li, Z., & Deng, Y. (2024). Linearity in Deng entropy. Chaos, Solitons & Fractals, 178, 114388.
- Cui, Y., & Deng, X. (2023). Plausibility entropy: A new total uncertainty measure in evidence theory based on plausibility function. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 53(6), 3833-3844.
- Qiang, C., Deng, Y., & Cheong, K. H. (2022). Information fractal dimension of mass function. Fractals, 30(06), 2250110.
- Zhu, L., Zhou, Q., Deng, Y., & Cheong, K. H. (2024). Fractal-based basic probability assignment: A transient mass function. Information Sciences, 652, 119767.
- Li, D., Deng, Y., & Cheong, K. H. (2021). Multisource basic probability assignment fusion based on information quality. International Journal of Intelligent Systems, 36(4), 1851-1875.
- Daniel, M. (2010, June). Conflicts within and between belief functions. In International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (pp. 696-705). Berlin, Heidelberg: Springer Berlin Heidelberg.
- Lefevre, E., & Elouedi, Z. (2013). How to preserve the conflict as an alarm in the combination of belief functions?. Decision Support Systems, 56, 326-333.
- Abellán, J., & Bossé, É. (2016). Drawbacks of uncertainty measures based on the pignistic transformation. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 48(3), 382-388.
- Martin, L., & Sudano, J. J. (2006, July). Yet another paradigm illustrating evidence fusion (YAPIEF). In 2006 9th international conference on information fusion (pp. 1-7). IEEE.
- Cuzzolin, F. (2007). Two new Bayesian approximations of belief functions based on convex geometry. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 37(4), 993-1008.
- Han, D., Dezert, J., & Duan, Z. (2015). Evaluation of probability transformations of belief functions for decision making. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 46(1), 93-108.
- Liu, Z. G., Fu, Y. M., Pan, Q., & Zhang, Z. W. (2022). Orientational distribution learning with hierarchical spatial attention for open set recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(7), 8757-8772.
- Liu, Z. G., Qiu, G. H., Wang, S. Y., Li, T. C., & Pan, Q. (2021). A new belief-based bidirectional transfer classification method. IEEE Transactions on Cybernetics, 52(8), 8101-8113.
- Xiao, F., & Pedrycz, W. (2022). Negation of the quantum mass function for multisource quantum information fusion with its application to pattern classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(2), 2054-2070.
- Hu, B. G. (2013). What are the differences between Bayesian classifiers and mutual-information classifiers?. IEEE transactions on neural networks and learning systems, 25(2), 249-264.
- Veenman, C. J., & Reinders, M. J. (2005). The nearest subclass classifier: A compromise between the nearest mean and nearest neighbor classifier. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(9), 1417-1429.
- Cover, T., & Hart, P. (1967). Nearest neighbor pattern classification. IEEE transactions on information theory, 13(1), 21-27.
- Freund, Y., & Mason, L. (1999, June). The alternating decision tree learning algorithm. In icml (Vol. 99, pp. 124-133).
- Chang, C. C., & Lin, C. J. (2011). LIBSVM: A library for support vector machines. ACM transactions on intelligent systems and technology (TIST), 2(3), 1-27.
- Castro, C. L., & Braga, A. P. (2013). Novel cost-sensitive approach to improve the multilayer perceptron performance on imbalanced data. IEEE transactions on neural networks and learning systems, 24(6), 888-899.
- CHEN S, G. M., & Grant, P. M. (1991). Orthogonal Least Squares Learning Algorithm for Radial Basis Function Networks. IEEE Transactions on Neural Networks, 2(2), 302-309.
- Denoeux, T. (1995). A k-nearest neighbor classification rule based on Dempster-Shafer theory. IEEE transactions on systems, man, and cybernetics, 25(5), 804-813.
- Xu, P., Deng, Y., Su, X., & Mahadevan, S. (2013). A new method to determine basic probability assignment from training data. Knowledge-Based Systems, 46, 69-80.
- Wang, Y. M., Pan, X. H., He, S. F., Dutta, B., García-Zamora, D., & Martínez, L. (2022). A new decision-making framework for site selection of electric vehicle charging station with heterogeneous information and multigranular linguistic terms. IEEE Transactions on Fuzzy Systems, 31(2), 485-499.
- Xiao, F. (2023). Generalized quantum evidence theory. Applied Intelligence, 53(11), 14329-14344.
Cite This Article
Article Metrics
Citations:Article Access Statistics:
Publisher's Note
ICCK stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and Permissions
Copyright © 2024 by the Author(s). Published by Institute of Central Computation and Knowledge.
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.