entropy
Article
An Adaptive Weight Method for Image Retrieval Based Multi-Feature Fusion
Xiaojun Lu ID , Jiaojuan Wang, Xiang Li, Mei Yang and Xiangde Zhang *
College of Sciences, Northeastern University, Shenyang 110819, China; luxiaojun@mail.neu.edu.cn (X.L.); 17640044931@163.com (J.W.); lxiang_1226@163.com (X.L.); yyangm1104@163.com (M.Y.)
* Correspondence:zhangxiangde@mail.neu.edu.cn;Tel.:+86-24-8368-7680
Received: 23 June 2018; Accepted: 31 July 2018; Published: 6 August 2018
Abstract: With the rapid development of information storage technology and the spread of the Internet, large capacity image databases that contain different contents in the images are generated. It becomes imperative to establish an automatic and efficient image retrieval system. This paper proposes a novel adaptive weighting method based on entropy theory and relevance feedback. Firstly, we obtain single feature trust by relevance feedback (supervised) or entropy (unsupervised). Then, we construct a transfer matrix based on trust. Finally, based on the transfer matrix, we get the weight of single feature through several iterations. It has three outstanding advantages: (1) The retrieval system combines the performance of multiple features and has better retrieval accuracy and generalization ability than single feature retrieval system; (2) In each query, the weight of a single feature is updated dynamically with the query image, which makes the retrieval system make full use of the performance of several single features; (3) The method can be applied in two cases: supervised and unsupervised. The experimental results show that our method significantly outperforms the previous approaches. The top 20 retrieval accuracy is 97.09%, 92.85%, and 94.42% on the dataset of Wang, UC Merced Land Use, and RSSCN7, respectively. The Mean Average Precision is 88.45% on the dataset of Holidays.
Keywords: image retrieval; multi-feature fusion; entropy; relevance feedback
1. Introduction
As an important carrier of information, it is significant to do efficient research with images [1–6]. Large-scale image retrieval has vast applications in many domains such as image analysis, search of image over internet, medical image retrieval, remote sensing, and video surveillance [7–24]. There are two common image retrieval systems: text-based image retrieval system and content-based image retrieval system. Text-based image retrieval system requires experienced experts to mark images, which is very expensive and time-consuming [7]. Content-based retrieval systems can be divided into two categories [8]. One is based on global features indexed with hashing strategies; another is local scale invariant features indexed by a vocabulary tree or a k-d tree. The two characteristics have pros and cons, and their performance complements each other [6,8]. In recent years, many excellent works focused on improving the accuracy and efficiency have been done [6]. A dynamically updating Adaptive Weights Allocation Algorithm (AWAA) which rationally allocates fusion weights proportional to their contributions to matching is proposed previously [7], which helps ours gain more complementary and helpful image information during feature fusion. In a previous paper [8], the authors improve reciprocal neighbor based graph fusion approach for feature fusion by the SVM prediction strategy, which increases the robustness of original graph fusion approach. In another past paper [9], the authors propose a graph-based query specific fusion approach where multiple retrieval sets are merged and are reranked by conducting a link analysis on a fused graph, which is capable of adaptively integrating the strengths of the retrieval methods using local or holistic features for
Entropy 2018, 20, 577; doi:10.3390/e20080577 www.mdpi.com/journal/entropy
Entropy 2018, 20, 577 2 of 17
different queries without any supervision. In a previous paper [10], the authors propose a simple yet effective late fusion method at score level by score curve and weighting different features in a query-adaptive manner. In another previous paper [11], the authors present a novel framework for color image retrieval through combining the ranking results of the different descriptors through various post-classification methods. In a past work [12], the authors propose robust discriminative extreme learning machine (RDELM), which enhances the discrimination capacity of ELM for RF. In a previous paper [13], the authors present a novel visual word integration of Scale Invariant Feature Transform (SIFT) and Speeded-Up Robust Features (SURF). The visual words integration of SIFT and SURF adds the robustness of both features to image retrieval. In another past work [14], an improved algorithm for center adjustment of RBFNNs and a novel algorithm for width determination have been proposed to optimize the efficiency of the Optimum Steepest Decent (OSD) algorithm, which achieves fast convergence speed, better and same network response in fewer train data. In a previous paper [15], an edge orientation difference histogram (EODH) descriptor and image retrieval system based on EODH and Color-SIFT was shown. In a previous paper [16], the authors investigate the late fusion of FREAK and SIFT to enhance the performance of image retrieval. In a previous paper [17], the authors propose to compress the CNN features using PCA and obtain a good performance. In a previous paper [18], the authors improve recent methods for large scale image search, which includes introducing a graph-structured quantizer and using binary.
Although the above methods have achieved good results, the performance of the retrieval system still has much room for improvement. In order to improve the performance of the retrieval system, it is an effective strategy to integrate multiple features for image retrieval [19–27]. Measurement level fusion is widely used, but how to determine the weight of each feature to improve the retrieval performance is still a very important problem [10,20,28]. In a previous paper [20], the author uses average global weight to fuse Color and Texture features for image retrieval. In a previous paper [9], the authors propose a graph-based query specific fusion approach without any supervision. In a previous paper [10], the author uses the area under the score curve of retrieval based on a single feature as the weight of the feature. The performances of different weight determination methods are different. The adaptive weights can achieve better retrieval performance than the global weights. In order to further improve the performance of the retrieval system, unlike previous weight determination methods, this paper proposes a new adaptive weight determination method based on relevance feedback and entropy theory to fuse multiple features. Our method has three outstanding advantages. (1) The retrieval system combines the performance of multiple features and has better retrieval accuracy and generalization ability than single feature retrieval system; (2) In each query, the weight of a single feature is updated dynamically with the query image, which makes the retrieval system make full use of the performance of several single features; (3) Unsupervised image retrieval means that there is no manual participation in the retrieval process. In an image search, no supervision is more popular than supervision. If we pursue higher retrieval accuracy, supervision is necessary. But from the perspective of user experience, unsupervised is better. It is worth mentioning that the method can be applied in two cases: supervised and unsupervised. Getting our method, firstly, we obtain single feature trust based on relevance feedback (supervised) or entropy (unsupervised); next, we construct a transfer matrix based on trust; finally, based on the transfer matrix, we get the weight of single feature through several iterations, which makes full use of single feature information of image and can achieve higher retrieval accuracy.
2. Related Work
For the image retrieval system integrating multi-features at measurement level, this paper mainly focus on how to determine the weight of each feature to improve the retrieval accuracy. In this section, we mainly introduce some work related to our method.
Entropy 2018, 20, 577 3 of 17
2.1. Framework
The main process of common system framework for image retrieval based on fusion of multiple features at the metric level is as follows [28–32]. Firstly, we extract several features of image and build benchmark image database. Then, when users enter images, we calculate the similarity between the query image and images of the database based on several features, separately. Finally, we get the comprehensive similarity measure by weighting several similarities and output retrieval results based on it.
2.2. The Ways to Determine Weight
A lot of work has been done to improve the performance of the retrieval system with multiple features [33,34]. At present, feature fusion is mainly carried out on three levels [8]: feature level, index level, and sorting level. The method proposed in this paper is applicable to the fusion of measurement level. Traditionally, there are two ways to determine the weight of feature, the global weight [11,20,32], and the adaptive weight [10,35], the pros/cons of each are listed in Table 1. The former is reciprocal of the number of features or decided by experienced experts, which leads the retrieval system to have poor generalization performance and low retrieval performance for different retrieval images. The latter is derived from retrieval feedback based on this feature, which is better than the global weight. However, in the sum or product fusion, the distinction between good features and bad features, is not obvious. If the weights of the bad features in the retrieval work are large, it will also reduce the retrieval performance to a certain extent. In order to clearly distinguish good features and bad features and the retrieval system can make full use of their performance to achieve better retrieval accuracy, a new adaptive weight retrieval system is proposed. Firstly, we obtain single feature trust based on relevance feedback (supervised) or entropy (unsupervised). Next, we construct a transfer matrix based on trust. Finally, based on the transfer matrix, we get the weight of single feature through several iterations, which makes full use of single feature information of image, and can achieve higher retrieval accuracy.
Table 1. Comparison of ways to determine weight. Pros
Method
the global weight
the adaptive weight
Cons
short retrieval time
good generalization performance/excellent retrieval performance
poor generalization performance/low retrieval performance
long retrieval time
The common weighted fusion methods of measurement level are maximum fusion, multiplication fusion [10], and sum fusion [11,32]. The comprehensive metric obtained by maximum fusion is obtained from the feature with the maximum weight. The comprehensive metric obtained by multiplication fusion is the product of different weighted similarity measures. The comprehensive metric obtained by sum fusion is the adding of different weighted similarity measures. Specifically, K features labeled as are fused, q is a query image, pk ∈ {p1, p2, . . . , pn} is a target image of database Ω = {p1, p2, . . . , pn}. Each method of fusion is shown as follows:
The maximum fusion:
sim(q) = arg max{wq(i)|i = 1,2,……,K} (1) Di (q)
The multiplication fusion:
Entropy 2018, 20, x Entropy 2018, 20, 577
4 of 18
4 of 17
(2)
(2)
K
sim(q) w (i)D (q),{i 1,2,……,K} qi
K i1
sim(q) = ∏wq(i)Di(q), {i = 1,2,……,K}
The multiplication fusion:
Themultiplicationfusion: sim(q)w(i)D(q),{i1,2,……,K}
Ki Here, q is a query image. K is the number of feature. w
i=1
K
qi i1
(3) is weight of F F ,F ,…,F .
sim(q) = ∑wq(i)Di(q), {i = 1,2,….q..,K} i 1 2 K(3) i=1
Di (q) D1 (q), D2 (q),…, DK (q) is the similarity vector between the query image q and images of
Here, q is a query image. K is the number of feature. w i is weight of F ∈ {F ,F ,…,F }. database =p,p,…,p,whichiscalculatedbasedonfeqatureFF,Fi,…,F1.2sim(q)Kis
2.3. Relevance Feedback
12n i12K
Di(q) ∈ {D1(q), D2(q), . . . , DK(q)} is the similarity vector between the query image q and images of Comprehensive similarity measure. databaseΩ={p1,p2,…,pn},whichiscalculatedbasedonfeatureFi ∈{F1,F2,…,FK}.sim(q)is
Comprehensive similarity measure.
2.3. RTehlevarnecleeFveaendcbeackfeedback algorithm [34] is used to solve the semantic gap problem in content-based image retrieval, and the results obtained by relevance feedback are very similar to
The relevance feedback algorithm [34] is used to solve the semantic gap problem in content-based those of human [36,37]. The main steps of relevance feedback are as follows: first, the retrieval
image retrieval, and the results obtained by relevance feedback are very similar to those of
system provides primary retrieval results according to the retrieval keys provided by the user; then,
human [36,37]. The main steps of relevance feedback are as follows: first, the retrieval system provides the user determines which retrieval results are pleasant; finally, the system then provides new
primary retrieval results according to the retrieval keys provided by the user; then, the user determines
retrieval results according to the user’s feedback. In this paper, we get the trust of single feature
which retrieval results are pleasant; finally, the system then provides new retrieval results according to
under the supervised condition through relevance feedback. Under the condition of supervision,
the user’s feedback. In this paper, we get the trust of single feature under the supervised condition
this paper obtains the trust of single feature through relevance feedback.
through relevance feedback. Under the condition of supervision, this paper obtains the trust of single feature through relevance feedback.
3. Proposed Method
3. Proposed Method
In this section, we will introduce our framework and adaptive weight strategy.
In this section, we will introduce our framework and adaptive weight strategy.
3.1. Our Framework
3.1. Our Framework
For a specific retrieval system, the weight of each feature is static in different queries. It causes lowFreotriaevsaplecpiefircforermtriaenvcael.sIynstoermd,etrhteowoeviegrhctoomfeatchhefsehaoturtrceoimsisntagt,icainedwiffiemreangteqrueetriesv.aIltscyasutesems lboawserdetornievmaullptie-rffeoartmuraenicse.pIrnopoordsedr.toTohveebracosimcefrtahmeeswhortkcoomfitnhge,raetnreiewvaimlsaygsetermetrisevshaloswynsteinmFbigasuerde o1.n multi-feature is proposed. The basic framework of the retrieval system is shown in Figure 1.
Figure 1. The proposed retrieval system framework. Figure 1. The proposed retrieval system framework.
In the database creation phase, firstly, we extract features separately; then, we calculate the
In the database creation phase, firstly, we extract features separately; then, we calculate the
entropy of different feature dimensions based on each feature; finally, we save features and
entropy of different feature dimensions based on each feature; finally, we save features and entropies
Entropy 2018, 20, 577 5 of 17
to get the image feature database. The original image database is a collection of large numbers of images. The established image feature database and the original image database are in a one-to-one correspondence, for example, the image 1.jpg is stored in the image database. The storage form of the image feature database is 1.jpg (image name), feature, and entropy. In this paper, what we call an image database is actually an image feature database.
In the image search phase, when users enter images, firstly, we calculate the similarity between the query image and the images of database based on each feature separately; then, we get the trust of a single feature; finally, we get the comprehensive similarity measure by weighting several measures and output retrieval results based on it.
Specifically, K features labeled as are fused, q is a query image, pk ∈ {p1, p2, . . . , pn} is a target image of database Ω = {p1, p2, . . . , pn}. The proposed fusion method is as follows.
Firstly, considering that it will take a long time to calculate similarity measures using several features, we get binary feature as follows:
ForeachbitoffeatureFi ∈{F1,F2,…,FK},weoutputbinarycodesFi ∈{F1,F2,…,FK}by: m
Di(q) = 1−
1 (di(1),di(2),…,di(n)) (i ∈ {1,2,…,K}) (7) ∑ di(k)
∑ Fi(cj)
ave(Fi) = j=1 1 Fi(cj) ≥ ave(Fi)
0 Fi(cj) < ave(Fi)
(4) i ∈ {1,2,...,K} (5)
Fi(cj) =
Here, ave(Fi) is the mean of feature Fi ∈ {F1,F2,...,FK}, m is the dimension of feature Fi ∈ {F1,F2,...,FK},Fi(cj)isthej-thcomponentoffeatureFi ∈{F1,F2,...,FK}.
Then, we calculate the distance between q and p, then normalize it:
m
di(k)=di(q,pk)= ∑wj|Fqi(j)−Fpki(j)| k∈{1,2,...,n},i∈{1,2,...,K} (6)
j=1
m
n k=1
Here, Di(q) ∈ {D1(q), D2(q), . . . , DK(q)} is the similarity vector between the query image q and imagesofdatabaseΩ={p1,p2,...,pn},whichiscalculatedbasedonfeatureFi ∈{F1,F2,...,FK}.n is the total number of images. Fqi, Fpki respectively represent the feature Fi ∈ {F1, F2, . . . , FK} of q and ofpk ∈{p1,p2,...,pn}.
We calculate the comprehensive measure sim(q) by fusing multiple features: K1 ∼(i) K (i)
sim(q)=∑wq Di(q)+ ∑ wq Di(q) (8) i=1 i=K1 +1
∼∼∼∼
Here wq(i) ∈ wq(1), wq(2), . . . , wq(K1) , wq(i) ∈ {wq(K1+1), wq(K1+2), . . . , wq(K)}, K1 are the
weight of a good feature, the weight of a bad feature, and the number of good features, respectively. Finally, we sort the similarity sim(q) and get the final search results.
3.2. Entropy of Feature
Information entropy is the expected value of the information contained in each message [38], represented as (n is the number of messages):
Entropy 2018, 20, 577
6 of 17
Hj=−
n 1nfij ∑fij
∑ log i=1 log2ni=1 n 2 fij
,j∈{1,2,...,m} (10)
N
H(x) = E(I(x)) = ∑ p(x) log2 p(x)
j=1
−1
(9)
Here, X is a random phenomenon. X contains N possibility. p(x) is the probability of x. H(X) is the nondeterminacy of the occurrence of X.
In our work, the entropy of j-th dimension feature is calculated as follows:
∑ fij i=1
Here, N is the number of images in the database. M is the feature dimension. fij, i ∈ {1, 2, . . . , n}, j ∈ {1, 2, . . . , m} is the j-th dimension feature of i-th image.
The weights of j-th dimension is calculated as follows:
e(1−Hj)
wj= m (1−H),j∈{1,2,...,m} (11)
∑ej j=1
Here, Hj is the entropy of j-th dimension feature. wj is the weight of j-th dimension.
When all the values of feature are equal, the entropy Hj is 1. The weight of each feature component is equal to 1 .
m
3.3. Adaptive Weight Strategy
To overcome the problem of low retrieval performance caused by the weight determination method used with multiple feature fusion, this paper proposes a new method to obtain single feature weight. Our method can be applied to supervised learning and unsupervised learning. The specific methods are as follows:
Under the circumstances of supervision, the weight of a single feature is obtained based Relevance Feedback. Di(q) ∈ {D1(q), D2(q), . . . , DK(q)} is the similarity vector between the query image q and images of database, which is calculated based on feature Fi ∈ {F1,F2,...,FK}. We sort Di(q) ∈ {D1(q), D2(q), . . . , DK(q)} and return search results by it. The results are labeled as ai = {a1i, a2i, . . . . . . , ati}. Here, t represents the predefined number of returned images. The retrieved results are evaluated according to relevant feedback. The prex, prey ∈ {pre1, pre2, . . . , preK} as trust of single feature retrieval is calculated. That is to say, we rely on the feedback to evaluate the retrieval results, and then use the evaluation index on the dataset to calculate the retrieval performance that is the trust of the feature. For example, on the Wang dataset with the precision as the evaluation index, we search images based on Fi ∈ {F1, F2, . . . , FK}. If we find have h1 similar images in the h retrieval results by relevant feedback, we believe the trust of Fi ∈ {F1,F2,...,FK} is h1/h. By several iterations, the weight of single feature is as follows: firstly, we structure the transfer matrix Hkk = {H(x, y)}, representing the performance preference among each feature. Note that the featureFx ∈{F1,F2,...,FK}goestofeatureFy ∈{F1,F2,...,FK}withabiasofH(x,y),thedetailed construction process of HKK = {H(x, y)} is as follows:
if prey>=prex
H(x, y) = eα(prey−prex) (α ≥ 1)
else
H(x,y) = |prey − prex|
Entropy 2018, 20, 577 7 of 17
When the trust of Fy ∈ {F1,F2,…,FK} is greater than Fx ∈ {F1,F2,…,FK}, in order to obtain better retrieval result, we believe that Fx ∈ {F1,F2,…,FK} can be replaced by Fy ∈ {F1,F2,…,FK}. The replacement depends on the parameter α. The larger α is, the more the retrieval system depends onFy ∈{F1,F2,…,FK}.Theα≥1isbecauseFy ∈{F1,F2,…,FK}isbetterthanFx ∈{F1,F2,…,FK}, we need to get eα(prey−prex) > |prey − prex|, so that the weight of Fy ∈ {F1, F2, . . . , FK} is larger and retrieval system relies more on Fy ∈ {F1,F2,…,FK}. When the trust of Fy ∈ {F1,F2,…,FK} is equal to Fx ∈{F1,F2,…,FK}, we believe that the Fx ∈ {F1,F2,…,FK} can be replaced by Fy ∈{F1,F2,…,FK} the replacement bias H(x,y) is 1. When the trust of Fy ∈ {F1,F2,…,FK} is less than Fx ∈ {F1,F2,…,FK}, we think that Fx ∈ {F1,F2,…,FK} can still be replaced by Fy ∈{F1,F2,…,FK},butthereplacementbiasH(x,y)isrelativelysmall.Onebenefitisthatalthough retrieval performance based on some of the features of image retrieval is poor, we still believe that it is helpful for the retrieval task.
Then, the weight of a single feature is obtained by using the preference matrix. We initialize the weight w1 to w1 = 1 , 1 , . . . , 1 . w = {wF , wF , . . . , wF } is the weight of a single feature. The wd is
KKK12K
the newly acquired weights through iterations. The wd−1 is the weight of the previous iteration. We
use the transfer matrix HKK = {H(x, y)} to iterate the weights based on formula 12.
w′F w′F H(F1,F2) ··· H(F1,FK) w′F 111
w′F2 .
w′F2 = γw′d−1 . .
w′FK (d−1)
H(F2,F1) ···
H(F2,FK) w′F2 . .
. . H(FK,FK) w′FK
,(γ ∈ [0,1]) (12) (d−1)
. w′FK (d)
+(1−r)
. .
H(FK,F1) ···
The wd depends not only on the choice of features depending on the transfer matrix, but also on the wd−1 obtained from the previous calculation. The degree of dependence on the above two depends on the parameter γ. An obvious advantage of this voting mechanism is that it will not affect the final result because of a relatively poor decision. The process is as follows:
wd = 1 , 1 , . . . , 1 , d = 1 KKK
repeat
w′d = γw′d−1 + (1 − r)HHw′d−1 wd = wd/sum(wd)
d←d+1
Until ∥wd − wd−1∥ < ε (ε ≥ 0)
(γ ∈ [0, 1])
return
• Good features and bad features
wd
In our method, the weight of a single feature is different for different queries. In order to improve the retrieval accuracy, we hope that the features with better retrieval performance can have larger weight than those with poor retrieval performance. For this reason, we divide features into good features and bad features according to retrieval performance. We search image based on Fy ∈ {F1,F2,...,FK} and Fx ∈ {F1,F2,...,FK}, respectively. If the retrieval performance of Fy ∈ {F1,F2,...,FK} is better than Fx ∈ {F1,F2,...,FK}, we think that Fy ∈ {F1,F2,...,FK} is a good feature and Fx ∈ {F1, F2, . . . , FK} is a bad feature. Good features and bad features are specifically defined as follows:
if else
prey >= prex
prey ∈{good_feature}
prex ∈{bad_feature}
(13)
Here,prey ∈{pre1,pre2,…,preK}istheretrievalperformanceofFy ∈{F1,F2,…,FK},prex ∈ {pre1,pre2,…,preK}istheretrievalperformanceofFx ∈{F1,F2,…,FK}.
Entropy 2018, 20, 577 8 of 17
• Our method for unsupervised
Image retrieval based on the above adaptive weight strategy is a supervised retrieval process and users need to participate in the feedback of single feature trust. In the actual application process, users may prefer the automatic retrieval system. That is to say, unsupervised retrieval system without manual participation is more popular. Therefore, considering the advantages of unsupervised image retrieval, we further study this method and propose an adaptive weight method under unsupervised conditions. The unsupervised method is basically the same as the supervised method. The only difference is, in contrast to the supervised process, the weight of a single feature is obtained based entropy rather than relevant feedback.
First, the entropy of Di(q) = (d∗i(1), d∗i(2), . . . . . . , d∗i(n)) is:
Hi=−
Here, Di(q) ∈ {D1(q), D2(q), . . . , DK(q)} is the similarity vector between the query image q and images of database, which is calculated based on feature Fi ∈ {F1, F2, . . . , FK}. n is the total number of images. d∗i(j) is the similarity between the query image q and j-th image of database.
Then, the trust of Di(q) ∈ {D1(q), D2(q), . . . , DK(q)} is:
prei = Hi (15)
Here, Di(q) ∈ {D1(q),D2(q),…,DK(q)} is the similarity vector between the query image q and images of database, which is calculated based on feature Fi ∈ {F1,F2,…,FK}. prei ∈ {pre1,pre2,…,preK}istheretrievalperformanceofFi ∈{F1,F2,…,FK}.
After gaining trust, the weight seeking process is the same as the supervised state.
4. Performance Evaluation
4.1. Features
The features we choose in this article are as follows:
• Color features. For each image, we compute 2000-dim HSV histogram (H, S, and V are 20, 10, and 10).
• CNN-feature1. The model we used to get CNN feature is VGG-16 [39]. We directly use pre-trained
models to extract features from the fc7 layer as CNN features.
• CNN-feature2. The model we used to get CNN feature is AlexNet which is pre-trained by Simon,
M., Rodner, E., Denzler, J., in their previous work [40]. We directly use the model to extract features from the fc7 layer as CNN features. The dimension of the feature is 4096.
The extraction methods of color feature, cnn-feature1, and cnn-feature2 belong to the results of the original papers and are well-known. So we did not retell it. However, the feature extraction code we adopted has been shared to the website at https://github.com/wangjiaojuan/An-adaptive-weight- method-for-image-retrieval-based-multi-feature-fusion.
4.2. Database and Evaluation Standard
• Wang (Corel 1K) [41]. That contains 1000 images that are divided into 10 categories. The precision of Top-r images is used as the evaluation standard of the retrieval system.
• Holidays [42]. That includes 1491 personal holiday pictures and is composed of 500 categories. mAp is used to evaluate the retrieval performance.
n ∗i ∑ d (j)
1 n d∗i(j) j=1
n∑ n log2 ∗i ,i∈{1,2,…,k} (14)
log2 j=1 ∑ d∗i(j) d (j) j=1
Entropy 2018, 20, 577 9 of 17
• UC Merced Land Use [43]. That contains 21 categories. Each category has 100 remote sensing images. Each image is taken as query in turn. The precision of Top-r images is used as the evaluation standard of the retrieval system.
• RSSCN7 [44]. That contains 2800 images which are divided into 7 categories. Each category has 400 images. Each image is taken as query in turn. The precision of Top-r images is used as the evaluation standard of the retrieval system.
The precision of Top-r images is calculated as follows:
precision = Nr (16)
r
Here, Nr is the number of relevant images matching to the query image, r is the total number of results returned by the retrieval system.
The mAp is calculated as follows:
mAp = |Q| ∑ RNi ∑ P(RS i) (17)
1 |Q| 1 RNi j i=1 j=1
Here, |Q| is the number of query images, suppose qi ∈ Q is a retrieval image, RNi is the total number of relevant images matching to qi, RSji is RSji_th similar image of query result and NRji is location information, P(RSji) is the evaluation of retrieval results of qi and is calculated as follows:
P(RSji) = RSji (18) NRji
4.3. Evaluation of the Effectiveness of Our Method
The main innovations of our method are as follows. (1) Based on entropy, we weigh features to improve the accuracy of similarity measurement; (2) Under the supervised condition, we obtain the single feature weight based on related feedback and fuse multi-feature at the measurement level to improve the retrieval precision; (3) Under the unsupervised condition, we obtain the single feature weight based on entropy and fuse multiple features at the measurement level to improve the retrieval precision. To verify the effectiveness of the method, we carried out experiments on Holidays, Wang, UC Merced Land Use, and RSSCN7.
We have done the following experiments. (1) Retrieve image based on CNN1-feature, Color feature, and CNN2-feature, respectively. At the same time, experiments are carried out under two conditions: entropy and no entropy; (2) under the state of supervision, retrieve image by fusing three different features which respectively uses relevance feedback and our method; (3) under the state of unsupervision, retrieve image by fusing three different features which respectively uses average global weights and our method. An implementation of the code is available at https://github.com/ wangjiaojuan/An-adaptive-weight-method-for-image-retrieval-based-multi-feature-fusion.
4.3.1. Unsupervised
Under the unsupervised condition, in order to verify the effectiveness of the adaptive weight method proposed in this paper, we carried out experiments on Holidays, Wang, UC Merced Land Use, and RSSCN7 datasets. Table 2 shows a comparison of retrieval results based on AVGand OURS. On the Holidays dataset, our method is better than RF, and improves the retrieval precision by 5.12%. On the Wang dataset, our method improves the retrieval accuracy by 0.35% (Top 20), 0.47% (Top 30), and 0.58% (Top 50) compared with AVG. On the UC Merced Land Use dataset, our method improves the retrieval accuracy by 6.61% (Top 20), 9.33% (Top 30), and 12.59% (Top 50) compared with AVG. On the RSSCN7 dataset, our method improves the retrieval accuracy by 2.61% (Top 20), 3.14% (Top 30), and 3.84% (Top 50) compared with AVG.
Entropy 2018, 20, 577 10 of 17
Table 2. Comparison of retrieval results based on AVG and OURS under unsupervised conditions.
Database Holidays
Wang (Top) UC Merced Land Use (Top) RSSCN7 (Top)
20 30 50 20 30 50 20 30 50
AVG 0.7872 0.9446 0.9274 0.8924 0.8468 0.7851 0.6866 0.8842 0.8611 0.8251 OURS 0.8384 0.9481 0.9321 0.8982 0.9129 0.8784 0.8125 0.9103 0.8925 0.8635
On Wang, UC Merced Land Use, RSSCN7, and Holidays, 50 images were randomly selected as query images, separately. We search similar images by our method. Figure 2 shows the change of weight with precision of each single feature. The abscissa is the features. From left to right, three points as 1 group, shows the precision and weights of each single feature of the same image retrieval. For example, in Figure 2a, the abscissa of 1–3 represents the three features of the first image in the 50 images selected from the Holidays. The blue line represents the weight, and the red line indicates the retrieval performance. We can see that the feature whose retrieval performance is excellent can obtain a relatively large weight by our method. That is to say, our method can make better use of good
Entropy 2018, 20, x
performance features, which is helpful to improve the retrieval performance.
11 of 18
1
0.5
0
1
0.5
0
1
0.5
0
1
0.5
0
0 50
0 50
0 50
0 50
(a)
(b)
(c)
(d)
precision weight
100 150
precison weight
100 150
precision weight
100 150
precision weight
100 150
precision/weight precision/weight precision/weight precision/weight
Figure 2. Under unsupervised condition, the change of weight obtained by our method with
Figure 2. Under unsupervised condition, the change of weight obtained by our method with precision. precision. (a) Experiment result on Holidays; (b) Experiment result on Wang; (c) Experiment result
(a) Experiment result on Holidays; (b) Experiment result on Wang; (c) Experiment result on UC Merced on UC Merced Land Use; (d) Experiment result on RSSCN7.
Land Use; (d) Experiment result on RSSCN7.
Entropy 2018, 20, 577 11 of 17
On Wang, UC Merced Land Use, and RSSCN7, one image was randomly selected as a query image and Top 10 retrieval results obtained by our method, respectively. On Holidays, one image was randomly selected as query image, respectively, and the Top 4 retrieval results obtained by our method. Figure 3 shows the retrieval results. The first image in the upper left corner is a query image that is labeled “query”. The remaining images are the corresponding similar images that are labeled by a similarity measure such as 0.999. In accordance with similarity from large to small, we arrange retrievEnatlrorpeys2u01l8ts, 2f0r, ox m left to right and from top to bottom. 12 of 18
Figure 3. Under unsupervised condition, retrieval results were displayed. (a) Experiment result on Figure 3. Under unsupervised condition, retrieval results were displayed. (a) Experiment result
Holidays; (b) Experiment result on Wang; (c) Experiment result on UC Merced Land Use; (d)
on Holidays; (b) Experiment result on Wang; (c) Experiment result on UC Merced Land Use; (d)
(a)
query
0.999975770811 0.999964770801 0.999876770711 0.999815770011 (b)
query
0.999411770811 0.999384678397 0.999371467218 0.999358383456
0.999348437009 0.999344320923 0.999343212143 0.999341404438
(c)
query
0.999411770811 0.999384678397 0.999371467218 0.999358383456
0.999348437009 0.999344320923 0.999343212143 0.999341404438
(d)
query
0.999797553324 0.999794341459 0.999786553474 0.999785047191
0.999778188276 0.999777801615 0.999773053008 0.999772452336
0.999348645533
0.999331928432
0.999348645533
0.999331928432
0.999780623353
0.99977231857
Experiment result on RSSCN7.
Experiment result on RSSCN7.
4.3.2. Supervised
4.3.2. Supervised
Under supervised conditions, in order to verify the effectiveness of the adaptive weight
Under supervised conditions, in order to verify the effectiveness of the adaptive weight method
method proposed in this paper, we carried out experiments on Holidays, Wang, UC Merced Land propoUsesed, aindthRiSsSpCaNp7erd,awtaesectas.rTriaebdleo3ustheoxwpsearicmomenptasroisnonHooflriedtariyesv,aWl reasnuglt,sUbCaseMdeornceRdFLanadndOURseS,. and
On the Holidays dataset, our method is better than RF to improve the retrieval precision by 0.26%.
RSSCN7 datasets. Table 3 shows a comparison of retrieval results based on RF and OURS. On the On the Wang dataset, our method improves the retrieval accuracy by 0.38% (Top 20), 0.38% (Top
Holidays dataset, our method is better than RF to improve the retrieval precision by 0.26%. On the
30), and 0.34% (Top 50) compared with RF. On the UC Merced Land Use dataset, our method
Wang dataset, our method improves the retrieval accuracy by 0.38% (Top 20), 0.38% (Top 30), and
improves the retrieval accuracy by 0.38% (Top 20), 0.45% (Top 30), and 0.05% (Top 50) compared
0.34% (Top 50) compared with RF. On the UC Merced Land Use dataset, our method improves the
with RF. On the RSSCN7 dataset, our method improves the retrieval accuracy by 0.84% (Top 20),
retrieval accuracy by 0.38% (Top 20), 0.45% (Top 30), and 0.05% (Top 50) compared with RF. On the
0.84% (Top 30), and 0.63% (Top 50) compared with RF.
Table 3. Comparison of retrieval results based on RF and OURS under supervised conditions.
Database Holidays
Wang (Top) UC Merced Land Use(Top) RSSCN7(Top) 20 30 50 20 30 50 20 30 50
RF 0.8819 0.9671 0.9539 0.9260 0.9247 0.8881 0.8250 0.9358 0.9191 0.8892 OURS 0.8845 0.9709 0.9577 0.9294 0.9285 0.8926 0.8255 0.9442 0.9275 0.8955
Entropy 2018, 20, 577 12 of 17
RSSCN7 dataset, our method improves the retrieval accuracy by 0.84% (Top 20), 0.84% (Top 30), and 0.63% (Top 50) compared with RF.
Table 3. Comparison of retrieval results based on RF and OURS under supervised conditions.
Entropy 2018, 20, x Database Holidays
RF OURS
Wang (Top) UC Merced Land Use (Top) RSSCN7 (Top)
20 30 50 20 30 50 20 30 50
0.8892 0.8955
Similar to unsupervised state, on Wang, UC Merced Land Use, RSSCN7, and Holidays, 50
0.8819 0.9671 0.9539 0.9260 0.9247 0.8881 0.8250 0.9358 0.9191
images were randomly selected as query images, separately. We search similar images by our
0.8845 0.9709 0.9577 0.9294 0.9285 0.8926 0.8255 0.9442 0.9275
method. Figure 4 shows the change of weight with precision of each single feature. The abscissa is
the features. From left to right, three points as 1 group, shows the precision and weight of each
single feature of same image retrieval. For example, in Figure 2a, the abscissa 1–3 represents the
Similar to unsupervised state, on Wang, UC Merced Land Use, RSSCN7, and Holidays, 50 images
three features of the first image in the 50 images selected from the Holidays. The blue line
were randomly selected as query images, separately. We search similar images by our method. Figure 4
represents the weight and the red line indicates the retrieval performance. We can see that the
shows the change of weight with precision of each single feature. The abscissa is the features. From
retrieval performance of feature got by relevance feedback is excellent, and can obtain a relatively lefttorighlta,rgtehrweeigphtobinytosuarsm1etghorod.uTph,asthisotwo sayth, oeuprrmeectihsoidoncaannmdakweebiegtthertuosfeeoafcghoosdinpgerlfeorfmeantucereofsame
features, which is helpful to improve the retrieval performance.
image retrieval. For example, in Figure 2a, the abscissa 1–3 represents the three features of the first
Similar to unsupervised state, on Wang, UC Merced Land Use, RSSCN7, one image was
image in the 50 images selected from the Holidays. The blue line represents the weight and the red
randomly selected as query image, Top 10 retrieval results were obtained by through our method,
line indicates the retrieval performance. We can see that the retrieval performance of feature got by
respectively. On Holidays, one image was randomly selected as query image, respectively, Top 4 relevancerfeteridevbaalcreksuisltseoxbcteailnledntb,yaonudrmcaetnhoodb.Ftaigiunrea5rsehloawtisvtehleyrelatrriegvealwreesiuglths.tTbhyefoirustrimaegtehinodth.eThatisto
upper left corner is a query image that is labeled “query”. The remaining images are the
say, our method can make better use of good performance features, which is helpful to improve the
corresponding similar images that are labeled by similarity measure such as 0.999. In accordance
retrieval performance.
with similarity from large to small, we arrange them from left to right and from top to bottom.
1
0.5
precision weight
1 0.5 0
1 0.5 0
1
0.5
precision weight
0
0 50 100 150
(a)
0 50 100 150
(b)
0 50 100 150
0
0 50 100 150
(d)
Figure 4. Under supervised condition, the change of weight that obtained by our method with
Figure 4. Under supervised condition, the change of weight that obtained by our method with precision. precision. (a) Experiment result on Holidays; (b) Experiment result on Wang; (c) Experiment result
(a) Experiment result on Holidays; (b) Experiment result on Wang; (c) Experiment result on UC Merced on UC Merced Land Use; (d) Experiment result on RSSCN7.
Land Use; (d) Experiment result on RSSCN7.
(c)
precision weight
precision weight
13 of 18
precision/weight precision/weight precision/weight precision/weight
Entropy 2018, 20, 577 13 of 17
Similar to unsupervised state, on Wang, UC Merced Land Use, RSSCN7, one image was randomly selected as query image, Top 10 retrieval results were obtained by through our method, respectively. On Holidays, one image was randomly selected as query image, respectively, Top 4 retrieval results obtained by our method. Figure 5 shows the retrieval results. The first image in the upper left corner is a query image that is labeled “query”. The remaining images are the corresponding similar images that are labeled by similarity measure such as 0.999. In accordance with similarity from large to small, we arrange them from left to right and from top to bottom.
Figure 5. Under supervised condition, retrieval results were displayed. (a) Experiment result on Holidays; (b) Experiment result on Wang; (c) Experiment result on UC Merced Land Use; (d) Experiment result on RSSCN7.
4.4. Comparison with Others Methods
In order to illustrate the performance of supervised and unsupervised methods compared with existing methods. In Table 4, we show the comparison results on the Wang dataset (Top 20). Under the state of unsupervision, the precision of our method is 97.09%, which is about 26% higher than previous methods listed [13,14]. Compared with a previous paper [12], it increased by approximately 9.26%. Compared with a previous paper [15], it increased by 24.42%. Compared with a previous paper [16], it increased by about 22.29%. Under the state of unsupervision, the precision of our method is 94.81%,
Entropy 2018, 20, 577 14 of 17
which is about 23.72% higher than [13,14]. Compared with a previous paper [12], it increased by about 6.98%. Compared with a previous paper [15], it increased by 22.14%. Compared with a previous paper [16], it increased by about 20.01%. From the results, we can see that the method has achieved good results both under supervision and unsupervision. As suggested in Section 3, the supervised method requires users to participate in the feedback of single feature trust, which may cause some users’ aversion. The unsupervised method does not require users to participate in the selection of features, and directly outputs the retrieved images. The unsupervised method or supervised method is determined by the designer according to the actual use of the retrieval system. When we focus on user experience, we choose to be unsupervised. If we focus on higher retrieval accuracy, we choose to be supervised. After deciding whether to adopt supervised or unsupervised, the designer can make use of the corresponding solutions proposed in this paper to improve retrieval performance.
Table 5 shows the comparison results on the Holidays dataset. The map of our method is 88.45%. Compared with a previous paper [7], it increased by about 1.55%. Compared with a previous paper [8], it increased by 2.93%. Compared with a previous paper [9], it increased by about 3.81%. Compared with a previous paper [10], it increased by about 0.45%. Compared with [17], it increased by about 9.15%. Compared with a previous paper [18], it increased by about 3.65%.
(Note: To avoid misunderstanding, we do not use an abbreviation of each solution here, but the methods used in comparison are introduced in introduction.)
87.70 99.35 98.15 100.00 100.00 99.10 99.95 100.00 98.00 88.65 97.09
Method
mAp
81.95 98.80 97.25 100.00 100.00 97.45 99.45 100.00 92.15 81.05 94.81
Table 4. Comparison with others methods on Wang.
Method
Africa
Beach Buildings Buses
Dinosaurs Elephants Flowers Horses
Mountains Food
Mean
Ours Supervised Unsupervised
[11] [12] [13] [14] 51.00 – 69.75 58.73
90.00 – 54.25 48.94 58.00 – 63.95 53.74 78.00 – 89.65 95.81
100.00 – 98.7 98.36 84.00 – 48.8 64.14 100.00 – 92.3 85.64 100.00 – 89.45 80.31 84.00 – 47.3 54.27 38.00 – 70.9 63.14 78.3 87.83 70.58 70.31
[15] 74.60
37.80 53.90 96.70 99.00 65.90 91.20 86.90 58.50 62.20 72.67
[18] 84.8
[16] 63.64
60.99 68.21 92.75 100.00 72.64 91.54 80.06 59.67 58.56 74.80
Table 5. Comparison with others methods on Holidays. Ours [7] [8] [9] [10] [17]
88.45 86.9 85.52 84.64 88.0 79.3
5. Discussion
Fusing multiple features can elevate the retrieval performance of retrieval system effectively. Meanwhile, in the process of multi-feature fusion, the proper single feature weight is helpful to further improve retrieval performance. This paper proposes a method to obtain single feature weights to fuse multiple features for image retrieval.
Retrieval results on daily scene datasets, which are Holidays and Wang, and remote sensing datasets, which are UC Merced Land Use and RSSCN7, show that compared with single feature and fusing multiple features by averaging global weights and relevance feedback, our method has better retrieval performance.
In the future work, there are two aspects of work that are worth doing. On the one hand, considering image retrieval based on multi-feature fusion increases the retrieval time; we will research how to improve the efficiency of retrieval. Many researches on image retrieval have been carried out on
Entropy 2018, 20, 577 15 of 17
large-scale datasets, which may contain up to several million pictures, and it is very time-consuming to search for the images we need from the massive images. It is significant to improve the efficiency of retrieval. On the other hand, considering other forms of entropy have achieved good results in the image field [45,46], we will research other forms of entropy used in image retrieval. Meanwhile, considering the image decomposition and the classification of image patches has achieved outstanding results [47–50]. We can use the idea of image decomposition and the classification of image patches to extract better image description for retrieval system. It is significant to improve the performance of retrieval.
Author Contributions: X.L. (Xiaojun Lu), J.W. conceived and designed the experiments, performed the experiments and analyzed the data. J.W., X.L. (Xiang Li) and M.Y. wrote the manuscript. X.Z. refined expression of the article. All authors have read and approved the final version of the manuscript.
Funding: This work was supported by the National Natural Science Foundation of China (Grant No. 61703088). Conflicts of Interest: The authors declare no conflicts of interest.
References
1. Dharani, T.; Aroquiaraj, I.L. A survey on content based image retrieval. In Proceedings of the 2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering (PRIME), Salem, India, 21–22 February 2013; pp. 485–490.
2. Lu, X.; Yang, Y.; Zhang, W.; Wang, Q.; Wang, Y. Face Verification with Multi-Task and Multi-Scale Feature Fusion. Entropy 2017, 19, 228. [CrossRef]
3. Zhu, Z.; Zhao, Y. Multi-Graph Multi-Label Learning Based on Entropy. Entropy 2018, 20, 245. [CrossRef]
4. Al-Shamasneh, A.R.; Jalab, H.A.; Palaiahnakote, S.; Obaidellah, U.H.; Ibrahim, R.W.; El-Melegy, M.T. A New Local Fractional Entropy-Based Model for Kidney MRI Image Enhancement. Entropy 2018, 20, 344. [CrossRef]
5. Liu, R.; Zhao, Y.; Wei, S.; Zhu, Z.; Liao, L.; Qiu, S. Indexing of CNN features for large scale image search. arXiv 2015, arXiv:1508.00217. [CrossRef]
6. Shi, X.; Guo, Z.; Zhang, D. Efficient Image Retrieval via Feature Fusion and Adaptive Weighting. In Proceedings of the Chinese Conference on Pattern Recognition, Singapore, 5–7 November 2016; Springer: Berlin, Germany, 2016; pp. 259–273.
7. Zhou, Y.; Zeng, D.; Zhang, S.; Tian, Q. Augmented feature fusion for image retrieval system. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, Shanghai, China, 23–26 June 2015; ACM: New York, NY, USA, 2015; pp. 447–450.
8. Coltuc, D.; Datcu, M.; Coltuc, D. On the Use of Normalized Compression Distances for Image Similarity Detection. Entropy 2018, 20, 99. [CrossRef]
9. Zhang, S.; Yang, M.; Cour, T.; Yu, K.; Metaxas, D.N. Query Specific Fusion for Image Retrieval; Computer Vision–ECCV 2012; Springer: Berlin/Heidelberg, Germany, 2012; pp. 660–673.
10. Zheng, L.; Wang, S.; Tian, L.; He, F.; Liu, Z.; Tian, Q. Query-adaptive late fusion for image search and person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1741–1750.
11. Walia, E.; Pal, A. Fusion framework for effective color image retrieval. J. Vis. Commun. Image Represent. 2014, 25, 1335–1348. [CrossRef]
12. Liu, S.; Feng, L.; Liu, Y.; Wu, J.; Sun, M.; Wang, W. Robust discriminative extreme learning machine for relevance feedback in image retrieval. Multidimens. Syst. Signal Process. 2017, 28, 1071–1089. [CrossRef]
13. Ali, N.; Bajwa, K.B.; Sablatnig, R.; Chatzichristofis, S.A.; Iqbal, Z.; Rashid, M.; Habib, H.A. A novel image retrieval based on visual words integration of SIFT and SURF. PLoS ONE 2016, 11, e0157428. [CrossRef] [PubMed]
14. Montazer, G.A.; Giveki, D. An improved radial basis function neural network for object image retrieval. Neurocomputing 2015, 168, 221–233. [CrossRef]
15. Tian, X.; Jiao, L.; Liu, X.; Zhang, X. Feature integration of EODH and Color-SIFT: Application to image retrieval based on codebook. Signal Process. Image Commun. 2014, 29, 530–545. [CrossRef]
16. Ali, N.; Mazhar, D.A.; Iqbal, Z.; Ashraf, R.; Ahmed, J.; Khan, F.Z. Content-Based Image Retrieval Based on Late Fusion of Binary and Local Descriptors. arXiv 2017, arXiv:1703.08492.
Entropy 2018, 20, 577 16 of 17
17. Babenko, A.; Slesarev, A.; Chigorin, A.; Lempitsky, V. Neural codes for image retrieval. In Proceedings of the European conference on computer vision, Zurich, Switzerland, 6–12 September 2014; Springer: Cham, Switzerland, 2014; pp. 584–599.
18. Jégou, H.; Douze, M.; Schmid, C. Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 2010, 87, 316–336. [CrossRef]
19. Liu, P.; Guo, J.-M.; Chamnongthai, K.; Prasetyo, H. Fusion of color histogram and LBP-based features for texture image retrieval and classification. Inf. Sci. 2017, 390, 95–111. [CrossRef]
20. Kong, F.H. Image retrieval using both color and texture features. In Proceedings of the 2009 International Conference on Machine Learning and Cybernetics, Hebei, China, 12–15 July 2009; Volume 4, pp. 2228–2232.
21. Zheng, Y.; Huang, X.; Feng, S. An image matching algorithm based on combination of SIFT and the rotation
invariant LBP. J. Comput.-Aided Des. Comput. Graph. 2010, 22, 286–292.
22. Yu, J.; Qin, Z.; Wan, T.; Zhang, X. Feature integration analysis of bag-of-features model for image retrieval.
Neurocomputing 2013, 120, 355–364. [CrossRef]
23. Wang, X.; Han, T.X.; Yan, S. An HOG-LBP human detector with partial occlusion handling. In Proceedings
of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan, 29 September–2 October
2009; pp. 32–39.
24. Fagin, R.; Kumar, R.; Sivakumar, D. Efficient similarity search and classification via rank aggregation.
In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, CA,
USA, 10–12 June 2003; ACM: New York, NY, USA, 2003; pp. 301–312.
25. Terrades, O.R.; Valveny, E.; Tabbone, S. Optimal classifier fusion in a non-bayesian probabilistic framework.
IEEE Trans. Pattern Anal. Mach. Intell. 2009, 31, 1630–1644. [CrossRef] [PubMed]
26. Li, Y.; Zhang, Y.; Tao, C.; Zhu, H. Content-based high-resolution remote sensing image retrieval via unsupervised feature learning and collaborative affinity metric fusion. Remote Sens. 2016, 8, 709. [CrossRef]
27. Mourão, A.; Martins, F.; Magalhães, J. Assisted query formulation for multimodal medical case-based retrieval. In Proceedings of the ACM SIGIR Workshop on Health Search & Discovery: Helping Users and
Advancing Medicine, Dublin, Ireland, 28 July–1 August 2013.
28. De Herrera, A.G.S.; Schaer, R.; Markonis, D.; Müller, H. Comparing fusion techniques for the ImageCLEF
2013 medical case retrieval task. Comput. Med. Imaging Graph. 2015, 39, 46–54. [CrossRef] [PubMed]
29. Deng, J.; Berg, A.C.; Li, F.-F. Hierarchical semantic indexing for large scale image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA,
20–25 June 2011; pp. 785–792.
30. Lin, K.; Yang, H.-F.; Hsiao, J.-H.; Chen, C.-S. Deep learning of binary hash codes for fast image retrieval.
In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops
(CVPRW), Boston, MA, USA, 7–12 June 2015; pp. 27–35.
31. Ahmad, J.; Sajjad, M.; Mehmood, I.; Rho, S.; Baik, S.W. Saliency-weighted graphs for efficient visual content
description and their applications in real-time image retrieval systems. J. Real Time Image Pr. 2017, 13,
431–447. [CrossRef]
32. Yu, S.; Niu, D.; Zhao, X.; Liu, M. Color image retrieval based on the hypergraph and the fusion of two
descriptors. In Proceedings of the 2017 10th International Congress on Image and Signal Processing,
BioMedical Engineering and Informatics (CISP-BMEI), Shanghai, China, 14–16 October 2017; pp. 1–6.
33. Liu, Z.; Blasch, E.; John, V. Statistical comparison of image fusion algorithms: Recommendations. Inf. Fusion
2017, 36, 251–260. [CrossRef]
34. Zhou, X.S.; Huang, T.S. Relevance feedback in image retrieval: A comprehensive review. Multimedia Syst.
2003, 8, 536–544. [CrossRef]
35. Zhu, X.; Jing, X.Y.; Wu, F.; Wang, Y.; Zuo, W.; Zheng, W.S. Learning Heterogeneous Dictionary Pair
with Feature Projection Matrix for Pedestrian Video Retrieval via Single Query Image. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 4–9 February 2017; pp. 4341–4348.
36. Wang, X.; Wang, Z. A novel method for image retrieval based on structure elements’ descriptor. J. Vis. Commun. Image Represent. 2013, 24, 63–74. [CrossRef]
37. Bian, W.; Tao, D. Biased discriminant euclidean embedding for content-based image retrieval. IEEE Trans. Image Process. 2010, 19, 545–554. [CrossRef] [PubMed]
Entropy 2018, 20, 577 17 of 17
38. Zheng, W.; Mo, S.; Duan, P.; Jin, X. An improved pagerank algorithm based on fuzzy C-means clustering and information entropy. In Proceedings of the 2017 3rd IEEE International Conference on Control Science and Systems Engineering (ICCSSE), Beijing, China, 17–19 August 2017; pp. 615–618.
39. Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556.
40. Simon, M.; Rodner, E.; Denzler, J. Imagenet pre-trained models with batch normalization. arXiv 2016, arXiv:1612.01452.
41. Hiremath, P.S.; Pujari, J. Content based image retrieval using color, texture and shape features. In Proceedings of the 15th International Conference on Advanced Computing and Communications (ADCOM 2007), Guwahati, India, 18–21 December 2007; pp. 780–784.
42. Jégou, H.; Douze, M.; Schmid, C. Hamming embedding and weak geometry consistency for large scale image search-extended version. In Proceedings of the 10th European Conference on Computer Vision, Marseille, France, 12–18 October 2008.
43. Yang, Y.; Newsam, S. Bag-of-visual-words and spatial extensions for land-use classification. In Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Jose, CA, USA, 2–5 November 2010; ACM: New York, NY, USA, 2010; pp. 270–279.
44. Zou, Q.; Ni, L.; Zhang, T.; Wang, Q. Deep learning based feature selection for remote sensing scene classification. IEEE Geosci. Remote Sens. Lett. 2015, 12, 2321–2325. [CrossRef]
45. Ramírez-Reyes, A.; Hernández-Montoya, A.R.; Herrera-Corral, G.; Domínguez-Jiménez, I. Determining the entropic index q of Tsallis entropy in images through redundancy. Entropy 2016, 18, 299. [CrossRef]
46. Hao, D.; Li, Q.; Li, C. Digital Image Stabilization Method Based on Variational Mode Decomposition and Relative Entropy. Entropy 2017, 19, 623. [CrossRef]
47. Zhu, Z.; Yin, H.; Chai, Y.; Li, Y.; Qi, G. A novel multi-modality image fusion method based on image decomposition and sparse representation. Inf. Sci. 2018, 432, 516–529. [CrossRef]
48. Wang, K.; Qi, G.; Zhu, Z.; Chai, Y. A novel geometric dictionary construction approach for sparse representation based image fusion. Entropy 2017, 19, 306. [CrossRef]
49. Zhu, Z.; Qi, G.; Chai, Y.; Chen, Y. A novel multi-focus image fusion method based on stochastic coordinate coding and local density peaks clustering. Future Internet 2016, 8, 53. [CrossRef]
50. Fang, Q.; Li, H.; Luo, X.; Ding, L.; Rose, T.M.; An, W.; Yu, Y. A deep learning-based method for detecting non-certified work on construction sites. Adv. Eng. Inform. 2018, 35, 56–68. [CrossRef]
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).