Cross and Relative Entropies of Mass Functions Inspired by the Plausibility Entropy

Xinyang Deng; Wen Jiang

doi:10.62762/CJIF.2025.592789

CiteScore

Impact Factor

Volume 2, Issue 3, Chinese Journal of Information Fusion

Volume 2, Issue 3, 2025

Submit Manuscript Edit a Special Issue

Table of Content

1. Introduction
2. Basics of Dempster-Shafer theory
3. Related work
4. New cross and relative entropies of mass functions
5. An example of application
6. Conclusion

Chinese Journal of Information Fusion, Volume 2, Issue 3, 2025: 212-222

Open Access | Research Article | 18 September 2025

Cross and Relative Entropies of Mass Functions Inspired by the Plausibility Entropy

Xinyang Deng 1 *

Wen Jiang 1

1 School of Electronics and Information, Northwestern Polytechnical University, Xi'an 710072, China

* Corresponding Author: Xinyang Deng, [email protected]

DOI: 10.62762/CJIF.2025.592789

Received: 19 March 2025, Accepted: 23 July 2025, Published: 18 September 2025

PDF (903.80 KB) Full-Text HTML XML

Article Metrics Cite This Article

Abstract

Related concepts of entropy play a very important role in dealing with uncertainty in terms of Shannon's information theory. However, for uncertain information involving epistemic uncertainty, which is usually modelled by using Dempster-Shafer theory, the concepts of cross entropy and relative entropy are still not well defined currently. Facing this issue, by reviewing and importing existing related work, this study gives new definitions of cross entropy and relative entropy of mass functions, which are respectively named as cross plausibility entropy and relative plausibility entropy since they are both based on an uncertainty measure called plausibility entropy. The properties of cross and relative plausibility entropies are also given, which shows a strong connection with classical cross entropy and relative entropy in Shannon's information theory. An example of application regarding parameter estimation is provided to show the effectiveness and reasonability of the presented entropies, which has implemented the parameter estimation for a generalized Bernoulli distribution with plausibility distribution observations.

Keywords

cross entropy

relative entropy

plausibility entropy

mass functions

dempster-shafer theory

uncertainty

Nomenclature

Symbol	Meaning
$\Omega$	Frame of discernment (FOD)
$2^{\Omega}$	Power set of a FOD
$m$	Mass function
$B e l$	Belief function
$P l$	Plausibility function
$Pl\_P_{m}$	Plausibility transformation of a mass function
$U(m)$	Dezert's entropy of a mass function
$U(m_{1},m_{2})$	Dezert and Dambreville's cross entropy
$U({m_{1}}\|\|{m_{2}})$	Dezert and Dambreville's relative entropy
$H_{D}(m)$	Deng's entropy of a mass function
$H_{D}(m_{1},m_{2})$	Gao et al.'s cross entropy
${H_{{\rm{pignistic}}}}({m})$	Pignistic entropy of a mass function
${H_{{\rm{pignistic}}}}({m_{1}},{m_{2}})$	Cross pignistic entropy
${H_{{\rm{Yager}}}}({m})$	Yager entropy of a mass function
${H_{{\rm{Yager}}}}({m_{1}},{m_{2}})$	Cross Yager entropy
$H_{Pl}(m)$	Plausibility entropy of a mass function
$H_{Pl}(m_{1},m_{2})$	Cross plausibility entropy
$H_{Pl}({m_{1}}\|\|{m_{2}})$	Relative plausibility entropy
$H_{S}(P)$	Shannon's entropy of a probability distribution
$H_{S}(P_{1},P_{2})$	Shannon's cross entropy
$H_{S}({P_{1}}\|\|{P_{2}})$	Shannon's relative entropy

1. Introduction

How to represent and measure the uncertainty is one of the central issues in information sciences. In general, the uncertainty can be briefly classified into random uncertainty and epistemic uncertainty [33]. Attributed to Shannon's innovative contributions [1], quantifying the randomness of uncertain information is solved through the mathematical framework of information theory. Dempster-Shafer theory [2, 3], also known as belief function theory, is a widely used framework to represent information with epistemic uncertainty, where a mathematical structure called mass functions is provided to simultaneously describe discord and non-specificity involved in the given information [4]. However, measuring the uncertainty of mass functions is not yet well solved currently, especially it still has not a consensus with respect to the definitions of entropy, cross entropy, and relative entropy of mass functions.

With respect to the entropy of mass functions, many researchers have paid attention on the problem [5, 6, 7]. Klir [8] has proposed generalized information theory (GIT) which aims to generalize Shannon's information theory for probabilities to various uncertainty theories including imprecise probabilities, fuzzy sets, belief functions, and so on. However, the proposed aggregated uncertainty (AU) in GIT for mass functions is of some shortcomings [9], especially some of the underlying axiomatic requirements of AU have been challenged [10]. Recent years, with the proposal of Deng's entropy [11], a new entropy measure for calculating the uncertainty of a mass function, the research of uncertainty measures in Dempster-Shafer theory has welcomed a "strong resurgence" [12]. Many novel entropy definitions of mass functions have been put forward [13, 14, 15]. For example, Jirousek and Shenoy [10] designed an entropy measure for mass functions by combining plausibility transformation and weighted Hartley entropy. Zhou and Deng [16] proposed a fractal-based belief entropy on the basis of Deng's entropy. In terms of belief intervals of single elements, Moral-Garcia and Abellan [17] developed an uncertainty measure of mass functions which is analogous to AU. Besides, Cui and Deng [18] presented a total uncertainty measure of mass functions based on plausibility function, which is called plausibility entropy. Facing existing uncertainty measures, Dezert and Tchamova [19] have raised the effectiveness problem of uncertainty measures, and provided four desiderata to check if an uncertainty measure is effective, and a new effective measure of uncertainty for mass functions has been proposed in [20].

Cross entropy and relative entropy are another two important concepts according to Shannon's information theory. Although the research of entropy of mass functions is flourishing, cross entropy and relative entropy of mass functions, however, are relatively rare. This matter of fact is easy to understand, because cross and relative entropies are strongly connected with the concept of entropy, their forms are usually on the basis of the definition of entropy. Many existing entropy measures of mass functions are hard to yield corresponding cross and relative entropies. Very recently, Dezert and Dambreville [21] have provided definitions of cross entropy and relative entropy of mass functions in terms of Dezert's effective measure of uncertainty presented in [20]. In addition, Gao et al. [22] proposed a definition of cross entropy of mass functions based on Deng's entropy [11]. However, as will be analyzed in this paper, the existing definitions of cross entropy of mass functions are of some defects in underlying properties, new cross entropy, as well as corresponding relative entropy, of mass functions are required, which is the purpose of the study.

Specifically, inspired by the plausibility entropy, a total uncertainty measure of mass functions, presented in [18], new cross entropy and relative entropy of mass functions are given in this paper, which are named as cross plausibility entropy and relative plausibility entropy, respectively. The properties of cross and relative plausibility entropies are given, which shows a strong connection with classical cross entropy and relative entropy in Shannon's information theory. In addition, an illustrative example in parameter estimation is given to show the potential application of the presented entropies.

The remainder of the paper is organized as follows. Section 2 briefly introduces the basic knowledge of Dempster-Shafer theory. Related work regarding existing definitions of cross entropy of mass functions are reviewed in Section 3. Then, new cross and relative entropies of mass functions are given in Section 4. An example of application is provided in Section 5. Finally, Section 6 concludes the study.

2. Basics of Dempster-Shafer theory

Dempster-Shafer theory [2, 3] has provided a well-defined framework to represent and deal with uncertainty information with epistemic uncertainty. In this theory, the set of possible answers to a given question of interest is called as a frame of discernment (FOD), denoted as $\Omega=\{\theta_{1},\theta_{2},\cdots,\theta_{n}\}$ , in which the set is collectively exhaustive and all elements in a FOD are mutually exclusive. The power set of FOD $\Omega$ is represented by $2^{\Omega}$ .

In order to represent the uncertain information involving epistemic uncertainty, mass functions, also known as basic probability assignments (BPAs), are defined in Dempster-Shafer theory. A mass function is a mapping from the power set of a FOD to interval $[0,1]$ , denoted as $m:{2^{\Omega}}\to[0,1]$ , satisfying the following conditions

m(\emptyset)=0\quad{\rm{and}}\quad\sum\limits_{A\in 2^{\Omega}}{m(A)=1}

where $A$ is called as a focal element of $m$ if $m(A)>0$ , and $m(A)$ measures the belief assigned exactly to $A$ . In general, a probability distribution can be treated as a Bayesian mass function in Dempster-Shafer theory, where $\forall A$ with $|A|\geq 2$ there is $m(A)=0$ .

Belief function $B e l$ and plausibility function $P l$ are two equivalent forms of mass functions, which respectively express the lower bound and upper bound of the support degree of a set $A$ based on a given mass function $m$ , $A\subseteq\Omega$ . Given a mass function $m$ , $B e l$ and $P l$ are defined as follows

Bel(A)=\sum\limits_{B\subseteq A}{m(B)}

Pl(A)=1-Bel(\bar{A})=\sum\limits_{B\cap A\neq\emptyset}{m(B)}

where $\bar{A}=\Omega-A$ . For each $A\subseteq\Omega$ , there is $Bel(A)\leq Pl(A)$ , and $[Bel(A),Pl(A)]$ is called the belief interval of $A$ . In general, the wider the belief interval of a set $A$ , the larger the uncertainty it contributes to the whole mass function. Therefore, in Dempster-Shafer theory, given a FOD $\Omega$ , the vacuous mass function $m_{\gamma}$ , in which $m_{\gamma}(\Omega)=1$ , has the largest uncertainty.

3. Related work

In this section, two existing definitions of cross entropy of mass functions, proposed very recently, are reviewed.

3.1 Dezert and Dambreville's cross entropy

Dezert [20] has given an entropy definition to measure the uncertainty of a mass function $m$ on a FOD $\Omega$

U(m)=\sum\limits_{A\in{2^{\Omega}}}{s(A)}

with

s(A)=-m(A)(1-u(A))\ln(m(A))+u(A)(1-m(A))

where $\ln(\cdot)$ represents the natural logarithm, and $u(A)=Pl(A)-Bel(A)$ for any $A\in{2^{\Omega}}$ .

On the basis of $U(m)$ , Dezert and Dambreville [21] have further proposed a cross entropy of mass functions

U({m_{1}},{m_{2}})=\sum\limits_{A\in{2^{\Omega}}}{{s_{1,2}}(A)}

with

\begin{array}[]{l}{s_{1,2}}(A)=-{m_{1}}(A)(1-{u_{1}}(A))\ln({m_{2}}(A))\\ \quad\quad\quad\qquad+{u_{1}}(A)(1-{m_{2}}(A))\end{array}

where ${u_{1}}(A)=P{l_{{m_{1}}}}(A)-Be{l_{{m_{1}}}}(A)$ for any $A\in{2^{\Omega}}$ .

In addition, a relative entropy of mass functions was also proposed in [21] as below

\begin{array}[]{l}U({m_{1}}||{m_{2}})=\sum\limits_{A\subseteq\Omega}{{m_{1}}(A% )(1-{u_{1}}(A))\ln\left({\frac{{{m_{1}}(A)}}{{{m_{2}}(A)}}}\right)}\\ \quad\quad\quad\quad\qquad+\sum\limits_{A\subseteq\Omega}{{u_{1}}(A)({m_{1}}(A% )-{m_{2}}(A))}\end{array}

It is proved in [21] that $U(m)$ , $U({m_{1}},{m_{2}})$ , and $U({m_{1}}||{m_{2}})$ can respectively degenerate into classical Shannon's entropy, cross entropy, and relative entropy (also known as Kullback-Leibler (KL) divergence), and they own a relation $U({m_{1}},{m_{2}})=U({m_{1}}||{m_{2}})+U({m_{1}})$ .

3.2 Gao et al's cross entropy

Gao et al. [22] presented another cross entropy definition of mass functions as below

{H_{D}}({m_{1}},{m_{2}})=-\sum\limits_{A\subseteq\Omega}{{m_{1}}(A){{\log}_{2}% }\frac{{{m_{2}}(A)}}{{{2^{|A|}}-1}}}

in which the underlying entropy definition for mass functions is based on Deng's entropy [11] as follows

{H_{{D}}}(m)=-\sum\limits_{A\subseteq\Omega}{m(A){{\log}_{2}}\frac{{m(A)}}{{{2% ^{\left|A\right|}}-1}}}

Since the relative entropy inspired by Deng's entropy is not defined at present, quantitative relation between ${H_{{D}}}(m)$ and ${H_{D}}({m_{1}},{m_{2}})$ is not established yet.

4. New cross and relative entropies of mass functions

4.1 Analysis of existing cross entropy definitions

In this subsection, these two cross entropy definitions of mass functions mentioned in the above section are analyzed simply.

At first, Gao et al.'s cross entropy ${H_{D}}({m_{1}},{m_{2}})$ has been questioned in [21] because the underlying Deng's entropy ${H_{{D}}}(m)$ is non-effective. According to the four desiderata proposed in [19], "unicity of max value of MoU" (D4), i.e., ${\rm{MoU}}(m_{\gamma})>{\rm{MoU}}(m)$ for any $m\neq m_{\gamma}$ in which $m_{\gamma}$ is the vacuous mass function, is not satisfied by ${H_{{D}}}(m)$ . In terms of Deng's entropy, given a FOD $\Omega$ , the vacuous mass function, i.e., $m(\Omega)=1$ , does not have the maximum uncertainty. Please refer to literature [21, 31, 32] for more details.

Secondly, based on a similar consideration, Dezert and Dambreville's cross entropy $U({m_{1}},{m_{2}})$ may also not be recommended since its underlying entropy measure $U(m)$ violates the monotonicity [17] of a rational uncertainty measure in belief function theory. Specifically, the monotonicity means that ${\rm{Uncertainty}}(m_{1})\leq{\rm{Uncertainty}}(m_{2})$ holds if $\forall A\subseteq\Omega:[Bel{{}_{{m_{1}}}}(A),Pl{{}_{{m_{1}}}}(A)]\subseteq[% Bel{{}_{{m_{2}}}}(A),Pl{{}_{{m_{2}}}}(A)]$ for arbitrary BPAs $m_{1}$ , $m_{2}$ defined on a same FOD $\Omega$ . Literature [23] has first revealed the problem of $U(m)$ , however the given counterexample in [23] is a little problem in the calculation of $U(m)$ (specifically, $\log_{2}$ is misused). In this paper, a real counterexample of $U(m)$ on the monotonicity is given as below.

Given a FOD $\Omega=\{a,b,c\}$ , $m_{1}$ and $m_{2}$ are two mass functions defined on $\Omega$ , in which

{m_{1}}(a)=0.05,\quad{m_{1}}(ac)=0.05,\quad{m_{1}}(bc)=0.9;

{m_{2}}(ab)=0.05,\quad{m_{2}}(ac)=0.05,\quad{m_{2}}(bc)=0.9.

Obviously, there are $Bel{{}_{{m_{1}}}}(A)\geq Bel{{}_{{m_{2}}}}(A)$ and $Pl{{}_{{m_{1}}}}(A)\leq Pl{{}_{{m_{2}}}}(A)$ for any $A\subseteq\Omega$ . Therefore, it clearly should be ${\rm{Uncertainty}}(m_{1})\leq{\rm{Uncertainty}}(m_{2})$ in terms of the monotonicity. However, by means of the uncertainty measure $U(m)$ expressed in nats, it obtains $U(m_{1})=3.9549$ and $U(m_{2})=3.9153$ . Namely, there is $U(m_{1})>U(m_{2})$ . Therefore, the monotonicity is violated by $U(m)$ .

Based on the above analysis, new cross entropy definition of mass functions is required and it is exactly the purpose of the study.

4.2 Cross and relative plausibility entropies

In this paper, new cross and relative entropies of mass functions are presented, which is on the basis of an uncertainty measure called plausibility entropy.

The plausibility entropy was recently proposed in [18], which is defined as

H_{{Pl}}(m)=-\sum\limits_{{\theta_{i}}\in\Omega}{{{Pl}}({\theta_{i}}){{\log}_{% 2}}\frac{{{{Pl}}({\theta_{i}})}}{{\sum\limits_{{\theta_{j}}\in\Omega}{{{Pl}}({% \theta_{j}})}}}}

where $m$ is a mass function defined on FOD $\Omega=\{\theta_{1},\theta_{2},\cdots,\theta_{n}\}$ . Alternatively, the plausibility entropy $H_{{Pl}}(m)$ can also be expressed in the form of Shannon's entropy

{H_{{{Pl}}}}(m)={\sum\limits_{{\theta_{i}}\in\Omega}{{{Pl}}({\theta_{i}})}}% \times{H_{{{S}}}}({{{Pl\_P}}_{m}})

where $Pl\_P_{m}$ is the plausibility transformation [24] of $m$ , satisfying $Pl\_P_{m}(\theta_{i})=\frac{{Pl(\theta_{i})}}{{\sum\limits_{\theta_{j}\in% \Omega}{Pl(\theta_{j})}}}$ , $\theta_{i}\in\Omega$ , and ${H_{S}}({Pl\_P}_{m})=-\sum\limits_{\theta_{i}\in\Omega}{{{{Pl\_P}_{m}}(\theta_% {i})}{{\log}_{2}}{{{Pl\_P}_{m}}(\theta_{i})}}$ is Shannon's entropy of probability distribution ${Pl\_P}_{m}$ on $\Omega$ .

It can be proved that the plausibility entropy $H_{{Pl}}(m)$ has satisfied four desiderata given in [19] for an effective measure of uncertainty (MoU) including "zero min value of MoU" (D1), "increasing of MoU of vacuous BPA" (D2), "compatibility with Shannon's entropy" (D3), and "unicity of max value of MoU" (D4). In addition, many other desirable properties are also satisfied by $H_{{Pl}}(m)$ , please refer to [18, 25, 26] for more information. Based on the well-defined plausibility entropy, new cross and relative entropies of mass functions, named as cross plausibility entropy and relative plausibility entropy respectively, are given.

Cross plausibility entropy. Given two mass functions $m_{1}$ and $m_{2}$ on a same FOD $\Omega=\{\theta_{1},\theta_{2},\cdots,\theta_{n}\}$ , a cross plausibility entropy, denoted as ${H_{Pl}}({m_{1}},{m_{2}})$ , is defined as

{H_{Pl}}({m_{1}},{m_{2}})=-\sum\limits_{{\theta_{i}}\in\Omega}{P{l_{{m_{1}}}}(% {\theta_{i}})\log_{2}\frac{{P{l_{{m_{2}}}}({\theta_{i}})}}{{\sum\limits_{{% \theta_{j}}\in\Omega}{P{l_{{m_{2}}}}({\theta_{j}})}}}}

Moreover, the cross plausibility entropy ${H_{Pl}}({m_{1}},{m_{2}})$ can also be represented as

{H_{Pl}}({m_{1}},{m_{2}})=\sum\limits_{{\theta_{i}}\in\Omega}{P{l_{{m_{1}}}}({% \theta_{i}})}\times{H_{S}}(Pl\_{P_{{m_{1}}}},Pl\_{P_{{m_{2}}}})

where $Pl\_{P_{{m_{1}}}}$ and $Pl\_{P_{{m_{2}}}}$ are the plausibility transformations of $m_{1}$ and $m_{2}$ respectively, and ${H_{S}}(Pl\_{P_{{m_{1}}}},Pl\_{P_{{m_{2}}}})=-\sum\limits_{{\theta_{i}}\in% \Omega}{Pl\_{P_{{m_{1}}}}({\theta_{i}})\log_{2}Pl\_{P_{{m_{2}}}}({\theta_{i}})}$ is the classical cross entropy between probability distributions $Pl\_{P_{{m_{1}}}}$ and $Pl\_{P_{{m_{2}}}}$ .

In terms of Eq. (4), a series of properties satisfied by the cross plausibility entropy ${H_{Pl}}({m_{1}},{m_{2}})$ can be obtained easily.

.

{H_{Pl}}({m_{1}},{m_{2}})\geq{H_{Pl}}({m_{1}})

, the equality holds if and only if $Pl\_{P_{{m_{1}}}}=Pl\_{P_{{m_{2}}}}$ .

.

{H_{Pl}}({m_{1}},{m_{2}})\neq{H_{Pl}}({m_{2}},{m_{1}})

, i.e., the cross plausibility entropy is not symmetric in general.

.

If ${m_{1}}$ and ${m_{2}}$ are Bayesian mass functions, the cross plausibility entropy coincides with the classical cross entropy for probabilities, i.e., ${H_{Pl}}({m_{1}},{m_{2}})=-\sum\limits_{{\theta_{i}}\in\Omega}{m_{1}({\theta_{% i}})\log_{2}{m_{2}({\theta_{i}})}}$ .

Having the above definition of cross plausibility entropy, the relative entropy of mass functions can be defined immediately in a similar means. We have noted that reference [27] provided a KL divergence as a straightforward derivative of the plausibility entropy, it is exactly the expected form of relative entropy, which is introduced as follows.

Relative plausibility entropy. Let $m_{1}$ and $m_{2}$ be two mass functions defined on a FOD $\Omega=\{\theta_{1},\theta_{2},\cdots,\theta_{n}\}$ , a relative plausibility entropy, denoted as ${H_{Pl}}({m_{1}}||{m_{2}})$ , is defined as follows

\begin{array}[]{l}{H_{Pl}}({m_{1}}||{m_{2}})\\ \qquad=\sum\limits_{{\theta_{i}}\in\Omega}{P{l_{{m_{1}}}}({\theta_{i}})\log_{2% }\frac{{{{P{l_{{m_{1}}}}({\theta_{i}})}\mathord{\left/{\vphantom{{P{l_{{m_{1}}% }}({\theta_{i}})}{\sum\limits_{{\theta_{j}}\in\Omega}{P{l_{{m_{1}}}}({\theta_{% j}})}}}}\right.\kern-1.2pt}{\sum\limits_{{\theta_{j}}\in\Omega}{P{l_{{m_{1}}}}% ({\theta_{j}})}}}}}{{{{P{l_{{m_{2}}}}({\theta_{i}})}\mathord{\left/{\vphantom{% {P{l_{{m_{2}}}}({\theta_{i}})}{\sum\limits_{{\theta_{j}}\in\Omega}{P{l_{{m_{2}% }}}({\theta_{j}})}}}}\right.\kern-1.2pt}{\sum\limits_{{\theta_{j}}\in\Omega}{P% {l_{{m_{2}}}}({\theta_{j}})}}}}}}\end{array}

Similarly, the relative plausibility entropy ${H_{Pl}}({m_{1}}||{m_{2}})$ can also be simply represented as

{H_{Pl}}({m_{1}}||{m_{2}})=\sum\limits_{{\theta_{i}}\in\Omega}{P{l_{{m_{1}}}}(% {\theta_{i}})}\times{H_{S}}(Pl\_{P_{{m_{1}}}}||Pl\_{P_{{m_{2}}}})

where ${H_{S}}(Pl\_{P_{{m_{1}}}}||Pl\_{P_{{m_{2}}}})$ is the classical cross entropy between probability distributions $Pl\_{P_{{m_{1}}}}$ and $Pl\_{P_{{m_{2}}}}$ , i.e., ${H_{S}}(Pl\_{P_{{m_{1}}}}||Pl\_{P_{{m_{2}}}})=\sum\limits_{{\theta_{i}}\in% \Omega}{Pl\_{P_{{m_{1}}}}({\theta_{i}}){{\log}_{2}}\frac{{Pl\_{P_{{m_{1}}}}({% \theta_{i}})}}{{Pl\_{P_{{m_{2}}}}({\theta_{i}})}}}$ .

From Eq. (6), some properties of the relative plausibility entropy ${H_{Pl}}({m_{1}}||{m_{2}})$ are derived as below.

.

{H_{Pl}}({m_{1}}||{m_{2}})\geq 0

, the equality holds if and only if $Pl\_{P_{{m_{1}}}}=Pl\_{P_{{m_{2}}}}$ .

.

{H_{Pl}}({m_{1}}||{m_{2}})\neq{H_{Pl}}({m_{2}}||{m_{1}})

in general, i.e., the cross plausibility entropy is not symmetric.

.

For two Bayesian mass functions ${m_{1}}$ and ${m_{2}}$ , the cross plausibility entropy degenerates into classical relative entropy (or KL divergence) for probabilities, i.e., ${H_{Pl}}({m_{1}}||{m_{2}})=\sum\limits_{{\theta_{i}}\in\Omega}{{m_{1}}({\theta% _{i}}){{\log}_{2}}\frac{{{m_{1}}({\theta_{i}})}}{{{m_{2}}({\theta_{i}})}}}$ .

What's more, as same as the equality relation for probability distributions $P_{1}$ and $P_{2}$ in terms of Shannon's entropy, classical cross entropy and relative entropy, i.e., ${H_{S}}({P_{1}},{P_{2}})={H_{S}}({P_{1}}||{P_{2}})+{H_{S}}({P_{1}})$ , the presented ${H_{Pl}}({m_{1}},{m_{2}})$ , ${H_{Pl}}({m_{1}}||{m_{2}})$ , as well as plausibility entropy ${H_{Pl}}({m_{1}})$ , of mass functions also meet the following equality relation

{H_{Pl}}({m_{1}},{m_{2}})={H_{Pl}}({m_{1}}||{m_{2}})+{H_{Pl}}({m_{1}})

5. An example of application

In this section, an illustrative example regarding parameter estimation is provided to show the potential application of proposed cross plausibility entropy ${H_{Pl}}({m_{1}},{m_{2}})$ . The example is originally from literature [28].

Assuming there are $n$ patients randomly taken from a population which has a proportion $\theta$ to suffer from a disease, and each of them is represented by $X_{i}$ to show if he/she has the disease (i.e., $X_{i}=1$ ) or not (i.e., $X_{i}=0$ ). Then, these random samples ${\bf{X}}=({X_{1}},{X_{2}},\cdots,{X_{n}})$ , which are independent and identically distributed (iid), can be viewed as the outcome of a Bernoulli variable. For realizations ${\bf{x}}=({x_{1}},{x_{2}},\cdots,{x_{n}})\in{\Omega_{\bf{X}}}={\{0,1\}^{n}}$ , the probability can be calculated by

{p_{X}}({\bf{x}};\theta)=\prod\limits_{i=1}^{n}{{\theta^{{x_{i}}}}{{(1-\theta)% }^{1-{x_{i}}}}}

The task is to estimate the unknown parameter $\theta$ according to state descriptions ${\bf{x}}=({x_{1}},{x_{2}},\cdots,{x_{n}})$ . However, due to the uncertainty, these states are only partially known, and let $m_{i}$ be the mass function concerning the state $x_{i}$ associated with patient $i$ . Table 1 gives a data set composed of $n=6$ observations, in which the fourth one $m_{4}$ is uncertain and represented by $m_{4}(\{1\})=\alpha$ , $m_{4}(\{0\})=\beta$ , and $m_{4}(\{1,0\})=1-\alpha-\beta$ , where $\alpha,\beta,1-\alpha-\beta\in[0,1]$ .

Table 1 Data set for the example of parameter estimation.

Observation $i$	1	2	3	4	5	6
$m_{i}(\{1\})$	0.0	0.0	0.0	$\alpha$	1.0	1.0
$m_{i}(\{0\})$	1.0	1.0	1.0	$\beta$	0.0	0.0
$m_{i}(\{1,0\})$	0.0	0.0	0.0	$1-\alpha-\beta$	0.0	0.0

Literature [28] proposed an evidential expectation-maximization ( ${{\rm{E}}^{2}}{\rm{M}}$ ) algorithm to estimate the parameter $\theta$ , in terms of a maximum likelihood principle. Figure 1 gives the results of using ${{\rm{E}}^{2}}{\rm{M}}$ algorithm with respect to different $\alpha$ and $\beta$ . It is found that, by using the ${{\rm{E}}^{2}}{\rm{M}}$ algorithm, the result of estimated $\theta$ is unchanged if the plausibility transformations of different $m_{4}$ caused by changed $\alpha$ and $\beta$ are the same. For example, let $m_{4}^{1}$ and $m_{4}^{2}$ be $m_{4}^{1}(\{1\})=0.6$ , $m_{4}^{1}(\{0\})=0.4$ , $m_{4}^{1}(\{1,0\})=0$ , and $m_{4}^{2}(\{1\})=0.4$ , $m_{4}^{2}(\{0\})=0.1$ , $m_{4}^{2}(\{1,0\})=0.5$ , respectively. There are $Pl\_{P_{m_{4}^{1}}}(\{1\})=Pl\_{P_{m_{4}^{2}}}(\{1\})=0.6$ and $Pl\_{P_{m_{4}^{1}}}(\{0\})=Pl\_{P_{m_{4}^{2}}}(\{0\})=0.4$ , i.e., $Pl\_{P_{m_{4}^{1}}}=Pl\_{P_{m_{4}^{2}}}$ . Then, by using the ${{\rm{E}}^{2}}{\rm{M}}$ algorithm, parameter $\theta$ is estimated as $\theta^{1,*}=0.4201$ associated with $m_{4}^{1}$ and $\theta^{2,*}=0.4201$ associated with $m_{4}^{2}$ . Two observations with different uncertainty degrees, $m_{4}^{1}$ and $m_{4}^{2}$ , lead to the same estimation of $\theta$ .

Figure 1 Estimation of

\theta

with the use of

{{\rm{E}}^{2}}{\rm{M}}

algorithm by considering different values of

\alpha

and

\beta

Now, let us use a cross entropy-based method to solve the issue of estimating parameter $\theta$ , where $\theta$ is derived by minimizing a total cross entropy loss, which coincides with the maximum likelihood principle widely used in machine learning.

For the data set shown in Table 1, since there is uncertain observation $m_{4}$ involving epistemic uncertainty, the proposed cross plausibility entropy is used to calculate the total cross entropy loss $L_{Pl}$ . Let $P$ be a distribution relying on parameter $\theta$ with $P({1})=\theta$ and $P({0})=1-\theta$ , then

\begin{array}[]{l}L_{Pl}=\sum\limits_{i=1}^{6}{{H_{Pl}}({m_{i}},P)}\\ \quad=-3{\log_{2}}(1-\theta)-2{\log_{2}}\theta\\ \qquad-(1-\beta){\log_{2}}\theta-(1-\alpha){\log_{2}}(1-\theta)\\ \quad=-(4-\alpha){\log_{2}}(1-\theta)-(3-\beta){\log_{2}}\theta\end{array}

By letting $\frac{{\partial L_{Pl}}}{{\partial\theta}}=0$ , we have

\theta^{*}_{Pl}=\frac{{3-\beta}}{{7-\alpha-\beta}}

Figure 2 shows the results of using the proposed cross plausibility entropy with the consideration of different $\alpha$ and $\beta$ . Compared with the results of ${{\rm{E}}^{2}}{\rm{M}}$ algorithm, the estimated $\theta$ is changing with $m_{4}$ having different uncertainty degrees. For example, for the cases of $m_{4}=m_{4}^{1}$ and $m_{4}=m_{4}^{2}$ , it is obtained that ${\theta^{1,*}_{Pl}}=0.4333$ and ${\theta^{2,*}_{Pl}}=0.4462$ .

Figure 2 Estimation of

\theta

with the use of proposed cross plausibility entropy by considering different values of

\alpha

and

\beta

For the comparison, Dezert and Dambreville's cross entropy $U({m_{1}},{m_{2}})$ is also used in the example to obtain the estimation of parameter $\theta$ . Let $m_{\theta}$ be the estimation of $\theta$ , in which $m_{\theta}(\{1\})=\theta$ , $m_{\theta}(\{0\})=1-\theta-\varepsilon$ , and $m_{\theta}(\{1,0\})=\varepsilon$ , where $\varepsilon\to 0$ . Then, in terms of Dezert and Dambreville's cross entropy, the total loss is

\begin{array}[]{l}{L_{U}}=\sum\limits_{i=1}^{6}{U({m_{i}},{m_{\theta}})}\\ \quad=-3\ln(1-\theta-\varepsilon)-2\ln\theta\\ \quad\quad-\alpha(\alpha+\beta)\ln\theta+(1-\alpha-\beta)(1-\theta)\\ \quad\quad-\beta(\alpha+\beta)\ln(1-\theta-\varepsilon)-(1-\alpha-\beta)\ln% \varepsilon\\ \quad\quad+(1-\alpha-\beta)[1-(1-\theta-\varepsilon)]\end{array}

By means of $\frac{{\partial L_{U}}}{{\partial\theta}}=0$ , it obtains

\theta_{U}^{*}=\frac{{2+\alpha(\alpha+\beta)}}{{5+{{(\alpha+\beta)}^{2}}}}(1-\varepsilon)

where $\varepsilon\to 0$ . Figure 3 graphically shows the results obtained by using Dezert and Dambreville's cross entropy $U({m_{1}},{m_{2}})$ .

Figure 3 Estimation of

\theta

with the use of Dezert and Dambreville's cross entropy by considering different values of

\alpha

and

\beta

And the cross entropy ${H_{D}}({m_{1}},{m_{2}})$ from Gao et al. [21] is also used in the example. Similarly, a total loss $L_{D}$ is calculated as follows

\begin{array}[]{l}{L_{D}}=\sum\limits_{i=1}^{6}{H_{D}({m_{i}},{m_{\theta}})}\\ \quad=-3{\log_{2}}(1-\theta-\varepsilon)-2{\log_{2}}\theta\\ \quad\quad-\alpha{\log_{2}}\theta-\beta{\log_{2}}(1-\theta-\varepsilon)-(1-% \alpha-\beta){\log_{2}}\frac{\varepsilon}{3}\end{array}

Then, the estimation of $\theta$ is derived via $\frac{{\partial L_{D}}}{{\partial\theta}}=0$ as below

\theta_{D}^{*}=\frac{{2+\alpha}}{{5+\alpha+\beta}}(1-\varepsilon)

in which $\varepsilon\to 0$ . The results with the use of cross entropy ${H_{D}}({m_{1}},{m_{2}})$ are shown in Figure 4.

Figure 4 Estimation of

\theta

with the use of cross entropy by considering different values of

\alpha

and

\beta

In addition, cross entropies inspired by two widely used entropies of mass functions, pignistic entropy [29, 4] and Yager entropy [30], are also considered for comparison. In terms of the formula of pignistic entropy ${H_{{\rm{pignistic}}}}(m)=-\sum\limits_{{\theta_{i}}\in\Omega}{{\rm{Bet}}{{\rm% {P}}_{m}}({\theta_{i}}){{\log}_{2}}{\rm{Bet}}{{\rm{P}}_{m}}({\theta_{i}})}$ , where ${\rm{Bet}}{{\rm{P}}_{m}}({\theta_{i}})=\sum\limits_{{\theta_{i}}\in A}{\frac{{% m(A)}}{{|A|}}}$ , cross pignistic entropy is naturally defined as ${H_{{\rm{pignistic}}}}({m_{1}},{m_{2}})=-\sum\limits_{{\theta_{i}}\in\Omega}{{% \rm{Bet}}{{\rm{P}}_{{m_{1}}}}({\theta_{i}}){{\log}_{2}}{\rm{Bet}}{{\rm{P}}_{{m% _{2}}}}({\theta_{i}})}$ . Then, a total cross entropy loss can be obtained by

\begin{array}[]{l}L_{\rm{pignistic}}=\sum\limits_{i=1}^{6}{{H_{\rm{pignistic}}% }({m_{i}},P)}\\ \quad=-3{\log_{2}}(1-\theta)-2{\log_{2}}\theta\\ \qquad-\frac{{1+\alpha-\beta}}{2}{\log_{2}}\theta-\frac{{1-\alpha+\beta}}{2}{% \log_{2}}(1-\theta)\\ \quad=-\frac{{5+\alpha-\beta}}{2}{\log_{2}}\theta-\frac{{7-\alpha+\beta}}{2}{% \log_{2}}(1-\theta)\end{array}

where $P({1})=\theta$ and $P({0})=1-\theta$ . By letting $\frac{{\partial L_{\rm{pignistic}}}}{{\partial\theta}}=0$ , we have

\theta^{*}_{\rm{pignistic}}=\frac{{5+\alpha-\beta}}{{12}}

whose graphical results with different values of $\alpha$ and $\beta$ are given in Figure 5.

Figure 5 Estimation of

\theta

with the use of cross pignistic entropy by considering different values of

\alpha

and

\beta

Similarly, according to the expression of Yager entropy ${H_{{\rm{Yager}}}}(m)=-\sum\limits_{A\in\Omega}{m(A){{\log}_{2}}{{Pl}_{m}}(A)}$ , cross Yager entropy is defined as ${H_{{\rm{Yager}}}}({m_{1}},{m_{2}})=-\sum\limits_{A\in\Omega}{{m_{1}}(A){{\log% }_{2}}{{Pl}_{{m_{2}}}}(A)}$ . Then, the corresponding total cross entropy loss is

\begin{array}[]{l}L_{\rm{Yager}}=\sum\limits_{i=1}^{6}{{H_{\rm{Yager}}}({m_{i}% },P)}\\ \quad=-3{\log_{2}}(1-\theta)-2{\log_{2}}\theta\\ \qquad-\alpha{\log_{2}}\theta-\beta{\log_{2}}(1-\theta)-(1-\alpha-\beta){\log_% {2}}1\\ \quad=-(2+\alpha){\log_{2}}\theta-(3+\beta){\log_{2}}(1-\theta)\end{array}

By calculating $\frac{{\partial L_{\rm{Yager}}}}{{\partial\theta}}=0$ , it obtains

\theta^{*}_{\rm{Yager}}=\frac{{2+\alpha}}{{5+\alpha+\beta}}

which is same with $\theta_{D}^{*}$ that is obtained by using cross entropy ${H_{D}}({m_{1}},{m_{2}})$ . Figure 6 shows the results of using cross Yager entropy by considering different $\alpha$ and $\beta$ .

Figure 6 Estimation of

\theta

with the use of cross Yager entropy by considering different values of

\alpha

and

\beta

For the sake of further comparison of these methods, by letting $\alpha=\beta$ , the uncertain observation $m_{4}$ becomes $m_{4}(\{1\})=m_{4}(\{0\})=\alpha$ and $m_{4}(\{1,0\})=1-2\alpha$ , where $\alpha\in[0,0.5]$ . If $\alpha=0$ , $m_{4}$ has the maximum uncertainty, and the uncertainty of $m_{4}$ is the least while $\alpha=0.5$ . And it is noted that there is not any preference in $m_{4}$ between states $\{1\}$ and $\{0\}$ because ${Bel}_{m_{4}}(\{1\})={Bel}_{m_{4}}(\{0\})=\alpha$ and ${Pl}_{m_{4}}(\{1\})={Pl}_{m_{4}}(\{0\})=1-\alpha$ . Figure 7 shows the estimation results of $\theta$ by using different methods for the case of new $m_{4}$ in which $\alpha=\beta$ .

Figure 7 Estimation results of

\theta

generated by using different methods where observation 4 is set as

m_{4}(\{1\})=m_{4}(\{0\})=\alpha

m_{4}(\{1,0\})=1-2\alpha

Table 2 Generalized Bernoulli distribution with plausibility distribution observations.

Observation $i$	1	2	3	4	5	6
${Pl}_{i}(\{1\})$	0.0	0.0	0.0	$1-\beta$	1.0	1.0
${Pl}_{i}(\{0\})$	1.0	1.0	1.0	$1-\alpha$	0.0	0.0

In theory, if we only have observations 1,2,3,5,6, in terms of the maximum likelihood principle, the estimation of $\theta$ should be $\theta^{*}=2/5=0.4$ . With the consideration of observation 4, i.e., unbiased $m_{4}$ : (i)When $\alpha=0$ , the estimation of $\theta$ should lie in the interval $[2/6,3/6]$ (whose midpoint is $2.5/6$ ) since the existence of epistemic uncertainty in observation 4 with $m_{4}(\{1,0\})=1$ ; (ii) When $\alpha=0.5$ , the estimation of $\theta$ should be $2.5/6$ since in this case the example becomes a mixture model for a two-dimensional Bernoulli distribution with $m_{4}(\{1\})=m_{4}(\{0\})=0.5$ in which there is only random uncertainty; (iii)In the process of increasing $\alpha$ 's value from 0 to 0.5, the value of $\theta$ should be changing monotonically, because for the unbiased $m_{4}$ the only change is its uncertainty degree which is decreasing monotonically; (iv)Therefore, there is a path for the estimation of ${\theta}$ from start point ${\theta}{|_{\alpha=0}}\in[\frac{2}{6},\frac{3}{6}]$ to end point ${\theta}{|_{\alpha=0.5}}=\frac{{2.5}}{6}$ .

From Figure 7, the result of ${{\rm{E}}^{2}}{\rm{M}}$ algorithm is not reasonable since the estimated ${\theta}$ is 0.4 when ${\alpha=0.5}$ . Cross pignistic entropy ${H_{{\rm{pignistic}}}}({m_{1}},{m_{2}})$ produces insensitive result for the change of observation $m_{4}$ . The proposed cross plausibility entropy ${H_{Pl}}({m_{1}},{m_{2}})$ gives that the value of ${\theta}$ is declining monotonically with the rise of $\alpha$ , while Dezert and Dambreville's cross entropy $U({m_{1}},{m_{2}})$ , cross entropy ${H_{D}}({m_{1}},{m_{2}})$ , and cross Yager entropy ${H_{{\rm{Yager}}}}({m_{1}},{m_{2}})$ present opposite trend of change. The difference between result of ${H_{Pl}}({m_{1}},{m_{2}})$ and those of $U({m_{1}},{m_{2}})$ , ${H_{D}}({m_{1}},{m_{2}})$ , and ${H_{{\rm{Yager}}}}({m_{1}},{m_{2}})$ , is caused by the underlying logic of different entropy measures. The plausibility entropy is based on plausibility function and it tends to get the maximum uncertainty degree that a mass function could possibly have, therefore the cross plausibility entropy ${H_{Pl}}({m_{1}},{m_{2}})$ gives a relatively big estimation of $\theta$ . In contrast, it seems that $U({m_{1}},{m_{2}})$ , ${H_{D}}({m_{1}},{m_{2}})$ and ${H_{{\rm{Yager}}}}({m_{1}},{m_{2}})$ are to obtain relatively small estimation of $\theta$ .

Compared with $U({m_{1}},{m_{2}})$ , ${H_{D}}({m_{1}},{m_{2}})$ and ${H_{{\rm{Yager}}}}({m_{1}},{m_{2}})$ , the proposed ${H_{Pl}}({m_{1}},{m_{2}})$ is more recommended in theory because, at first, the underlying entropy definitions of the formers are of some defects as analyzed in Section 4.1 and related references [19, 5], and the underlying mechanism of using cross plausibility entropy ${H_{Pl}}({m_{1}},{m_{2}})$ to obtain the estimation of $\theta$ is more clear. In this example, with the use of cross plausibility entropy ${H_{Pl}}({m_{1}},{m_{2}})$ , the classical Bernoulli distribution based on probabilities is generalized to a new Bernoulli distribution with plausibility distribution observations as shown in Table 2. According to Table 2, the estimation of parameter $\theta$ can be obtained immediately as $\theta^{*}_{Pl}=\frac{{3-\beta}}{{7-\alpha-\beta}}$ . Moreover, this Bernoulli distribution with plausibility distribution observations can be easily extended to the case of multi-dimensional Bernoulli distribution.

6. Conclusion

In this paper, new definitions of cross entropy and relative entropy of mass functions have been given on the basis of a recently presented total uncertainty measure of mass functions called plausibility entropy. And, properties of the cross plausibility entropy and relative plausibility entropy have been presented in the study. In addition, an illustrative example of application has been provided to show the effectiveness of the presented entropies compared with other methods and entropy definitions of mass functions. The presented cross plausibility entropy and relative plausibility entropy of mass functions can be used in the scenarios of multi-source information fusion based on Dempster-Shafer evidence theory for target recognition, fault diagnosis, and so on.

In the future study, on one hand, more theoretical analysis about the presented cross and relative plausibility entropies will be conducted; on the other hand, practical applications with the use of cross and relative plausibility entropies for multi-sensor information fusion, decision support systems, intelligent diagnosis, and so forth, will be further considered.

Data Availability Statement

Data will be made available on request.

Funding

The work was partially supported by the National Natural Science Foundation of China under Grant 62173272.

Conflicts of Interest

The authors declare no conflicts of interest.

Ethical Approval and Consent to Participate

Not applicable.

References

Shannon, C. E. (1948). A mathematical theory of communication. The Bell System Technical Journal, 27(3), 379–423.
[CrossRef] [Google Scholar]
Dempster, A. P. (2008). Upper and lower probabilities induced by a multivalued mapping. In Classic works of the Dempster-Shafer theory of belief functions (pp. 57-72). Berlin, Heidelberg: Springer Berlin Heidelberg.
[CrossRef] [Google Scholar]
Shafer, G. (1976). A Mathematical Theory of Evidence. Princeton University Press.
[Google Scholar]
Jousselme, A. L., Liu, C., Grenier, D., & Bosse, E. (2006). Measuring ambiguity in the evidence theory. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, 36(5), 890–903.
[CrossRef] [Google Scholar]
Abellan, J., & Bosse, E. (2018). Drawbacks of Uncertainty Measures Based on the Pignistic Transformation. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 48(3), 382–388.
[CrossRef] [Google Scholar]
Deng, Y. (2020). Uncertainty measure in evidence theory. Science China Information Sciences, 63(11), 210201.
[CrossRef] [Google Scholar]
Abellan, J., & Bosse, E. (2017). Critique of recent uncertainty measures developed under the evidence theory and belief intervals. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 50(3), 1186–1192.
[CrossRef] [Google Scholar]
Klir, G. J. (2006). Uncertainty and information: foundations of generalized information theory. Kybernetes, 35(7/8), 1297-1299.
[CrossRef] [Google Scholar]
Abellan, J., & Masegosa, A. (2008). Requirements for total uncertainty measures in Dempster-Shafer theory of evidence. International Journal of General Systems, 37(6), 733–747.
[CrossRef] [Google Scholar]
Jirousek, R., & Shenoy, P. P. (2018). A new definition of entropy of belief functions in the Dempster-Shafer theory. International Journal of Approximate Reasoning, 92, 49–65.
[CrossRef] [Google Scholar]
Deng, Y. (2016). Deng entropy. Chaos, Solitons & Fractals, 91, 549–553.
[CrossRef] [Google Scholar]
Urbani, M., Gasparini, G., & Brunelli, M. (2023). A numerical comparative study of uncertainty measures in the Dempster–Shafer evidence theory. Information Sciences, 639, 119027.
[CrossRef] [Google Scholar]
Zhou, M., Zhu, S. S., Chen, Y. W., Wu, J., & Herrera-Viedma, E. (2021). A generalized belief entropy with nonspecificity and structural conflict. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 52(9), 5532-5545.
[CrossRef] [Google Scholar]
Kavya, R., Jabez, C., & Subhrakanta, P. (2023). A new belief interval-based total uncertainty measure for Dempster-Shafer theory. Information Sciences, 642, 119150.
[CrossRef] [Google Scholar]
Deng, Z., & Wang, J. (2021). Measuring total uncertainty in evidence theory. International Journal of Intelligent Systems, 36(4), 1721–1745.
[CrossRef] [Google Scholar]
Zhou, Q., & Deng, Y. (2022). Fractal-based belief entropy. Information Sciences, 587, 265-282.
[CrossRef] [Google Scholar]
Moral‐Garcia, S., & Abellan, J. (2021). Required mathematical properties and behaviors of uncertainty measures on belief intervals. International Journal of Intelligent Systems, 36(8).
[CrossRef] [Google Scholar]
Cui, Y., & Deng, X. (2023). Plausibility entropy: a new total uncertainty measure in evidence theory based on plausibility function. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 53(6), 3833–3844.
[CrossRef] [Google Scholar]
Dezert, J., & Tchamova, A. (2022). On the effectiveness of measures of uncertainty of basic belief assignments. Information & Security, 52, 9–36. https://dx.doi.org/10.11610/isij.5201
[Google Scholar]
Dezert, J. (2022, July). An effective measure of uncertainty of basic belief assignments. In 2022 25th International Conference on Information Fusion (FUSION) (pp. 1-8). IEEE.
[CrossRef] [Google Scholar]
Dezert, J., & Dambreville, F. (2023, June). Cross-entropy and relative entropy of basic belief assignments. In 2023 26th International Conference on Information Fusion (FUSION) (pp. 1-8). IEEE.
[CrossRef] [Google Scholar]
Gao, X., Pan, L., & Deng, Y. (2022). Cross entropy of mass function and its application in similarity measure. Applied Intelligence, 52(8), 8337-8350.
[CrossRef] [Google Scholar]
Zhou, Q., Pedrycz, W., Liang, Y., & Deng, Y. (2023). Information Granule-based Uncertainty Measure of Fuzzy Evidential Distribution. IEEE Transactions on Fuzzy Systems, 31(12), 4385–4396.
[CrossRef] [Google Scholar]
Cobb, B. R., & Shenoy, P. P. (2006). On the plausibility transformation method for translating belief function models to probability models. International journal of approximate reasoning, 41(3), 314-330.
[CrossRef] [Google Scholar]
Deng, X., Xue, S., Jiang, W., & Zhang, X. (2024). Plausibility extropy: the complementary dual of plausibility entropy. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 54(11), 6936–6947.
[CrossRef] [Google Scholar]
Deng, X., & Jiang, W. (2025). Upper bounds of uncertainty for Dempster combination rule-based evidence fusion systems. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 55(1), 817–828.
[CrossRef] [Google Scholar]
Mao, K., Wang, Y., Zhou, W., Ye, J., & Fang, B. (2025). Evaluation of belief entropies: from the perspective of evidential neural network. Artificial Intelligence Review, 58(5), 133.
[CrossRef] [Google Scholar]
Denoeux, T. (2013). Maximum likelihood estimation from uncertain data in the belief function framework. IEEE Transactions on Knowledge and Data Engineering, 25(1), 119–130.
[CrossRef] [Google Scholar]
Xiao, F. (2019). Multi-sensor data fusion based on the belief divergence measure of evidences and the belief entropy. Information Fusion, 46, 23-32.
[CrossRef] [Google Scholar]
Yager, R. R. (1983). Entropy and specificity in a mathematical theory of evidence. International Journal of General Systems, 9, 249–260.
[CrossRef] [Google Scholar]
Abellan, J. (2017). Analyzing properties of Deng entropy in the theory of evidence. Chaos, Solitons & Fractals, 95, 195-199.
[CrossRef] [Google Scholar]
Moral-Garcia, S., & Abellan, J. (2020). Critique of modified Deng entropies under the evidence theory. Chaos, Solitons & Fractals, 140, 110112.
[CrossRef] [Google Scholar]
Hullermeier, E., & Waegeman, W. (2021). Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods. Machine Learning, 110(3), 457–506.
[CrossRef] [Google Scholar]

Cite This Article

APA Style

Deng, X., & Jiang, W. (2025). Cross and Relative Entropies of Mass Functions Inspired by the Plausibility Entropy. Chinese Journal of Information Fusion, 2(3), 212–222. https://doi.org/10.62762/CJIF.2025.592789

Article Metrics

Citations:

Google Scholar

Crossref

Scopus

Web of Science

Article Access Statistics:

PDF Downloads: 54

Publisher's Note

ICCK stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Copyright © 2025 by the Author(s). Published by Institute of Central Computation and Knowledge. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

Chinese Journal of Information Fusion

ISSN: 2998-3371 (Online) | ISSN: 2998-3363 (Print)

Email: [email protected]

Portico

All published articles are preserved here permanently:
https://www.portico.org/publishers/icck/