Monthly Archives: April 2014

Using Custom Theme with SyntaxHighlighter Evolved

I have been using SyntaxHighlighter Evolved for displaying code snippets on this site. While the WordPress plugin has been working very well, I seem to lose my custom CSS styles every time the updated plugin gets installed. I want to … Continue reading

Posted in Uncategorized | Tagged , | 2 Comments

PCA and Biplot using Python

There are several ways to run principal component analysis (PCA) using various packages (scikit-learn, statsmodels, etc.) or even just rolling out your own through singular-value decomposition and such. Visualizing the PCA result can be done through biplot. I was looking … Continue reading

Posted in Uncategorized | Tagged , , , , | 2 Comments

Near-duplicate Detection using MinHash: Background

There are numerous pieces of duplicate information served by multiple sources on the web. Many news stories that we receive from the media tend to originate from the same source, such as the Associated Press. When such contents are scraped … Continue reading

Posted in Uncategorized | Tagged , | 7 Comments

A Trick for Computing the Sum of Geometric Series

Say if I need to compute the sum of a series like this one: (1)   where . This series looks like a geometric series in which case the sum can be computed from     The coefficients vary, so … Continue reading

Posted in Uncategorized | Tagged | Leave a comment