ulf1/
Split strings into (character-based) k-shingles
USD raised
Recent activities
add 1 wildcard for n letters
v0.6.2 seems broken
Actually store the README.rst file in the repo before uploading to pypi
add code quality checks
Rename `shingling_k` to `shingleseqs_k`
Selection Algorithm for k-shingles
centered padding for shingle sequences
Delete commented code lines
Set `memo` dictionary with start values
Check `if snew not in memo:` before calling `select_most_frequent_shingles`
add single-chars to memo automatically
There could be multiple matches. Select the matched pattern with lowest number of wildcards
Rename VOCAB to TOKENLIST
add containment similarity metric
Docstring missing
add ks.metrics.jaccard to README.md
re.error: missing ), unterminated subpattern at position 6
doesn't split consecutive wildcards
numpy warning
ensure that len(s)>=k-1 before shingling
Offset is required
typo encode_multi_match_test
© 2019 BoostIO, Inc.