我是靠谱客的博主 舒心大侠,最近开发中收集的这篇文章主要介绍记录下minhash计算流程,觉得挺不错的,现在分享给大家,希望可以做个参考。

概述

以下是从教科书截取过来的讲解的非常清晰,记录一下:


Now, let us simulate the algorithm for computing the signature matrix.
Initially, this matrix consists of all ∞’s:


First, we consider row 0 of Fig. 3.4. We see that the values of h1(0) and h2(0) are both 1. The row numbered 0 has 1’s in the columns for sets S1 and S4, so only these columns of the signature matrix can change. As 1 is less than ∞, we do in fact change both values in the columns for S1 and S4. The current estimate of the signature matrix is thus:



Now, we move to the row numbered 1 in Fig. 3.4. This row has 1 only in S3, and its hash values are h1(1) = 2 and h2(1) = 4. Thus, we set SIG(1, 3) to 2 and SIG(2, 3) to 4. All other signature entries remain as they are because their columns have 0 in the row numbered 1. The new signature matrix:






最后

以上就是舒心大侠为你收集整理的记录下minhash计算流程的全部内容,希望文章能够帮你解决记录下minhash计算流程所遇到的程序开发问题。

如果觉得靠谱客网站的内容还不错,欢迎将靠谱客网站推荐给程序员好友。

本图文内容来源于网友提供,作为学习参考使用,或来自网络收集整理,版权属于原作者所有。
点赞(46)

评论列表共有 0 条评论

立即
投稿
返回
顶部