Frequency and Similarity-Aware Partitioning for Cloud Storage Based on Space-Time Utility Maximizati

来源 :Tsinghua Science and Technology | 被引量 : 0次 | 上传用户:xiuxiumumu
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
With the rise of various cloud services, the problem of redundant data is more prominent in the cloud storage systems. How to assign a set of documents to a distributed file system, which can not only reduce storage space, but also ensure the access efficiency as much as possible, is an urgent problem which needs to be solved.Space-efficiency mainly uses data de-duplication technologies, while access-efficiency requires gathering the files with high similarity on a server. Based on the study of other data de-duplication technologies, especially the Similarity-Aware Partitioning(SAP) algorithm, this paper proposes the Frequency and Similarity-Aware Partitioning(FSAP) algorithm for cloud storage. The FSAP algorithm is a more reasonable data partitioning algorithm than the SAP algorithm. Meanwhile, this paper proposes the Space-Time Utility Maximization Model(STUMM), which is useful in balancing the relationship between space-efficiency and access-efficiency. Finally, this paper uses 100 web files downloaded from CNN for testing, and the results show that, relative to using the algorithms associated with the SAP algorithm(including the SAP-Space-Delta algorithm and the SAP-Space-Dedup algorithm), the FSAP algorithm based on STUMM reaches higher compression ratio and a more balanced distribution of data blocks. With the rise of various cloud services, the problem of redundant data is more prominent in the cloud storage systems. How to assign a set of documents to a distributed file system, which can not only reduce storage space, but also ensure the access efficiency as much as possible, is an urgent problem which needs to be solved. space-efficiency mainly uses data de-duplication technologies, while access-efficiency requires gathering the files with high similarity on a server. Based on the study of other data de- duplication technologies, especially the Similarity-Aware Partitioning (SAP) algorithm, this paper proposes the Frequency and Similarity-Aware Partitioning (FSAP) algorithm for cloud storage. The FSAP algorithm is a more reasonable data partitioning algorithm than the SAP algorithm. Meanwhile, this paper Propping the Space-Time Utility Maximization Model (STUMM), which is useful in balancing the relationship between space-efficiency and access-efficiency. Finally, this paper uses 100 web files downloaded from CNN for testing, and the results show that, relative to using the algorithms associated with the SAP algorithm (including the SAP-Space-Delta algorithm and the SAP-Space-Dedup algorithm), the FSAP algorithm based on STUMM reaches higher compression ratio and a more balanced distribution of data blocks.
其他文献
近年来,济南市中小企业发展迅速,但也存在着企业技术创新能力不强、政府服务不到位、不及时等问题,建议通过加大政府资金政策等扶持力度、建立健全社会化服务系统等方面来促
目前的中国油画市场由于各种因素的影响,油画市场尤其是现当代油画得到了很大的发展。我们可以从历年油画作品得拍卖记录上了解到油画的价格不断攀升,甚至一些一线艺术家的作品
《黄河怨》是一首能体现女高音程度的艺术歌曲,演唱难度大,有很震撼的艺术影响力。表达了丧子亡夫,被日寇凌辱的妇人对侵略者罪行的愤怒揭露侵略者的残酷暴行。本文将对这首作品
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
期刊
本文通过对永宁县闽宁镇精准扶贫政策措施的分析,肯定了闽宁镇在产业扶贫、易地搬迁扶贫、金融扶贫、教育扶贫以及社会保障兜底五个层面取得的成果,同时指出了扶贫工作中存在
穆时英是中国现代小说家,新感觉派的代表人物之一,穆时英从1929年开始从事文学创作,次年发表第一部小说《咱们的世界》,很快就震惊文坛,后来,穆时英转战现代主义创作,发表了《夜总会
在对株洲市的投资发展现状分析的基础上,针对当前投资过程中存在的问题进行深刻探讨,试图通过其原因分析,并结合当今供给侧结构性改革的要求,提出以有效投资推进供给侧结构性
目的:本文主要探讨人性化服务在小儿外科病房护理工作中的临床应用价值。方法:在我院2013年1月至2014年12月期间所收治的外科患儿中选取60例作为此次研究对象,随机分为观察组和
现代企业管理制度的特征是产权清晰、权责明确、政企分开及管理科学,而现代企业的科学管理离不开科学化、规范化的制度体系.企业授权管理制度作为企业科学管理的内容之一,对
声乐套曲《冬之旅》的艺术价值对于我们美声演唱专业来说是非常有必要去研究的。《春梦》这首歌是整部套曲的第十一首其旋律非常优美,对于这首歌的演唱分析我们要从多方面进行