|
广西师范大学学报(自然科学版) ›› 2012, Vol. 30 ›› Issue (3): 125-134.
李志欣, 陈宏朝, 吴王景莉, 周生明
LI Zhi-xin, CHEN Hong-chao, WU Jing-li, ZHOU Sheng-ming
摘要: 针对图像检索中存在的“语义鸿沟”问题,本文提出一种语义学习模型进行图像的自动标注。首先提出连续的概率潜在语义分析(PLSA)模型及对应的参数估计算法,并利用最大惩罚似然的方法解决协方差矩阵的奇异性问题;然后,提出一个根据不同模态数据各自的特点进行处理的概率模型,该模型使用连续PLSA和传统PLSA分别建模视觉特征和文本关键词,并通过不对称学习算法发现两种模态之间共有的语义主题,从而能更精确地对未知图像进行标注。通过在分别包含5 000幅和31 695幅图像的两个标准Corel数据集上进行实验,并与几种典型的图像标注方法进行比较的结果表明,文中方法具有更高的精度和更好的效果。
中图分类号:
[1] SMEULDERS A W M,WORRING M,SANTINI S,et al.Content-based imageretrievalat the end of the early years[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2000,22(12):1349-1380. [2] DATTA R,JOSHI D,LI Jia,et al.Image retrieval:ideas,influences,andtrends of the new age[J].ACM Computing Surveys,2008,40(2):5. [3] 李志欣,施智平,李志清,等.图像检索中语义映射方法综述[J].计算机辅助设计与图形学学报,2008,20(8):1085-1096. [4] CHANG E,GOH K,SYCHAY G,et al.CBSA:content-based soft annotation for multimodal image retrieval using Bayes point machines[J].IEEE Transactions on Circuits and Systems for Video Technology,2003,13(1):26-38. [5] LI Jia,WANG J Z.Automatic linguistic indexing of pictures by a statisticalmodeling approach[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2003,25(9):1075-1088. [6] CARNEIRO G,CHAN A B,MORENO P J,et al.Supervised learning of semantic classes for image annotation and retrieval[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2007,29(3):394-410. [7] JEON J,LAVRENKO V,MANMATHA R.Automatic image annotation and retrieval using cross-media relevance models[C]//Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.New York:ACM Press,2003:119-126. [8] LAVRENKO V,MANMATHA R,JEON J.A model for learning the semanticsof pictures[C]//THRUN S,SAUL L K,SCHOLKOPF B.Advances in Neural Information Processing Systems 16.Cambridge:MIT Press,2004:553-560. [9] FENG S L,MANMATHA R,LAVRENKO V.Multiple Bernoulli relevance models for image and video annotation[C]//Proceedings of IEEE Computer Society Conferenceon Computer Vision and Pattern Recognition.Los Alamitos:IEEE Computer SocietyPress,2004:1002-1009. [10] DUYGULU P,BARNARD K,de FREITAS J F G,et al.Object recognitionas machine translation:learning a lexicon for a fixed image vocabulary[M]//Lecture Notes in Computer Science:vol.2353.Berlin:Springer-Varlag,2002:97-112. [11] BARNARD K,DUYGULU P,FORSYTH D,et al.Matching words and pictures[J].Journal of Machine Learning Research,2003,3(2):1107-1135. [12] BLEI D M,JORDAN M I.Modeling annotated data[C]//Proceedingsof the 26thAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval.New York:ACM Press,2003:127-134. [13] MONAY F,GATICA-PEREZ D.Modeling semantic aspects for cross-media image indexing[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2007,29(10):1802-1817. [14] 李志欣,施智平,李志清,等.融合语义主题的图像自动标注[J].软件学报,2011,22(4):801-812. [15] HOFMANN T.Unsupervised learning by probabilistic latent semanticanalysis[J].Machine Learning,2001,42(1/2):177-196. [16] BLEI D M,NG A Y,JORDAN M I.Latent Dirichlet allocation[J].Journal of Machine Learning Research,2003,3(1):993-1022. [17] LI Zhi-xin,SHI Zhi-ping,LIU Xi,et al.Automatic image annotation with continuous PLSA[C]//Proceedings of the 35th IEEE International Conference on Acoustics,Speech and Signal Processing.Los Alamitos:IEEE Computer Society Press,2010:806-809. [18] 李志欣,施智平,刘曦,等.建模连续视觉特征的图像语义标注方法[J].计算机辅助设计与图形学学报,2010,22(8):1412-1420. [19] ORMONEIT D,TRESP V.Averaging,maximum penalized likelihood andBayesian estimation for improving Gaussian mixture probability density estimates[J].IEEE Transactions on Neural Networks,1998,9(4):639-650. |
[1] | 刘电霆, 吴丽娜. 社会网络中基于信任的LDA主题模型领域专家推荐[J]. 广西师范大学学报(自然科学版), 2018, 36(4): 51-58. |
[2] | 唐振军. 基于PCA特征距离的图像哈希算法[J]. 广西师范大学学报(自然科学版), 2016, 34(4): 9-18. |
[3] | 宋俊, 韩啸宇, 黄宇, 黄廷磊, 付琨. 一种面向实体的演化式多文档摘要生成方法[J]. 广西师范大学学报(自然科学版), 2015, 33(2): 36-41. |
[4] | 马媛媛, 吕康, 徐久成. 基于粒计算多层次结构相似度的图像检索[J]. 广西师范大学学报(自然科学版), 2013, 31(3): 127-131. |
[5] | 唐振军, 戴玉敏, 张显全, 张师超. 基于DCT特征点的感知图像Hash函数[J]. 广西师范大学学报(自然科学版), 2012, 30(3): 135-141. |
[6] | 李双群, 徐久成, 张灵均, 李晓艳. 基于相容粒的彩色图像检索算法[J]. 广西师范大学学报(自然科学版), 2011, 29(3): 173-178. |
[7] | 罗辛, 潘乔, 王洪亚, 陈美, 北研二. 基于SOFM的高速图像检索算法实现[J]. 广西师范大学学报(自然科学版), 2011, 29(2): 180-184. |
|
版权所有 © 广西师范大学学报(自然科学版)编辑部 地址:广西桂林市三里店育才路15号 邮编:541004 电话:0773-5857325 E-mail: gxsdzkb@mailbox.gxnu.edu.cn 本系统由北京玛格泰克科技发展有限公司设计开发 |