Semi-Supervised Learning Based Tag Recommendation for Docker Repositories

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:hongqinshuling
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Docker has been the mainstream technology of providing reusable software artifacts recently. Developers can easily build and deploy their applications using Docker. Currently, a large number of reusable Docker images are publicly shared in online communities, and semantic tags can be created to help developers effectively reuse the images. However, the communities do not provide tagging services, and manually tagging is exhausting and time-consuming. This paper addresses the problem through a semi-supervised leing-based approach, named SemiTagRec. SemiTagRec contains four components: (1) the predictor, which calculates the probability of assigning a specific tag to a given Docker repository;(2) the extender, which introduces new tags as the candidates based on tag correlation analysis; (3) the evaluator, which measures the candidate tags based on a logistic regression model; (4) the integrator, which calculates a final score by combining the results of the predictor and the evaluator, and then assigns the tags with high scores to the given Docker repositories. SemiTagRec includes the newly tagged repositories into the training data for the next round of training. In this way, SemiTagRec iteratively trains the predictor with the cumulative tagged repositories and the extended tag vocabulary, to achieve a high accuracy of tag recommendation. Finally, the experimental results show that SemiTagRec outperforms the other approaches and SemiTagRec’s accuracy, in terms of Recall@5 and Recall@10, is 0.688 and 0.781 respectively.
其他文献
冰雪雕塑艺术是黑龙江冰雪艺术的一个非常重要的部分,对于北方地区来说有着深刻的文化内涵,是众多优秀文化的沉淀。尤其是这些年,随着冰雪文化艺术的发展,冰雪雕塑大师有着非
期刊
期刊
介入放射学是现代医学影像的一门新兴学科 ,集诊断和治疗于一体。以其创伤小、痛苦少、疗效高、恢复快的优点 ,作为现代医学中的第三大治疗手段在全国各地如雨后春笋般地发展
期刊
目的 探讨螺旋CT检查在慢性化脓性中耳炎诊断分型及治疗中的价值。方法 对 65例 79耳慢性化脓性中耳炎行螺旋CT检查 ,将轴位HRCT、冠状位MPR及听骨链 3D重建图像与手术结果
期刊
目的调查饮水机对桶装饮用水中双酚A的二次污染状况,了解深圳居民桶装饮用水中双酚A的暴露水平。方法随机抽取饮水机34台,分别采集未经饮水机加热的桶装饮用水、经饮水机加热
首先论述了高光谱遥感在获取森林调查数据方面的优势;随后总结了国内外关于高光谱技术在树种识别、龄级分类、郁闭度调查及森林健康监测等方面的应用,并同时列举出高光谱遥感应用于林业上的主要处理技术;最后对高光谱遥感在我国林业上的应用现状进行了分析与展望。
期刊