Biboroku

Tagged: Math

Near-Duplicate Detection using MinHash: Background

Written by Taro Sato, on . Tagged: stats Python math

There are numerous pieces of duplicate information served by multiple sources on the web. Many news stories that we receive from the media tend to originate from the same source, such as the Associated Press. When such contents are scraped off the web for archiving, a need may arise to categorize documents by their similarity (not in the sense of the meaning of the text but the character-level or lexical matching). ... Continue reading.

Half-Light Radii for Various Profiles

Written by Taro Sato, on . Tagged: astro math

For a radial profile of I(r), the enclosed flux within the radius r is given by F(r)=02πdϕ0rdrrI(r,ϕ) . I’m only concerned about azimuthal symmetric cases, so F(r)=2π0rdrrI(r) . ... Continue reading.