biboroku

Interpreting A/B Test using Python

Written by Taro Sato on 2014-05-12

Suppose we ran an A/B test with two different versions of a web page, \( a \) and \( b \), for which we count the number of visitors and whether they convert or not. We can summarize this in a contingency table showing the frequency distribution of the... Continue reading.

Brand Positioning by Correspondence Analysis

Written by Taro Sato on 2014-05-07

I was reading an article about visualization techniques using multidimensional scaling (MDS), the correspondence analysis in particular. The example used R, but as usual I want to find ways to do it on Python, so here goes. The correspondence analysis... Continue reading.

PCA and Biplot using Python

Written by Taro Sato on 2014-04-24

There are several ways to run principal component analysis (PCA) using various packages (scikit-learn, statsmodels, etc.) or even just rolling out your own through singular-value decomposition and such. Visualizing the PCA result can be done through... Continue reading.

Near-Duplicate Detection using MinHash: Background

Written by Taro Sato on 2014-04-12

There are numerous pieces of duplicate information served by multiple sources on the web. Many news stories that we receive from the media tend to originate from the same source, such as the Associated Press. When such contents are scraped off the web... Continue reading.

Using Japanese on Debian Wheezy

Written by Taro Sato on 2012-09-21

The goal is to make the system capable for Japanese input, while letting the base system remain English. For the Japanese input method, I had been using Anthy, but I will be using mozc, which is now better supported and presumably much better (it is).... Continue reading.

Installing F.lux on Debian Squeeze

Written by Taro Sato on 2010-11-20

In another feeble attempt to become a “morning person,” I’ve decided to follow the usual advice that I should not be looking at bright LCD screen at night to help myself going into the “sleep mode.” Hence F.lux. I want it to launch automatically at the... Continue reading.

Testing if a Point is Inside a Polygon in Python

Written by Taro Sato on 2010-06-24

Finally got around to find this out by Googling. It’s a useful function so I reproduce it here for copy & paste: def inside_polygon(x, y, points): """ Return True if a coordinate (x, y) is inside a polygon defined by the list of verticies [(x1, y1),... Continue reading.

Installing SExtractor on Mac OS X Snow Leopard

Written by Taro Sato on 2010-03-22

(November 25, 2010) I screwed up SExtractor installation for no reason and in an attempt to fix it, I’ve found a few things. After wasting a few hours again, I think this installation note can work for most people who can actually build software from... Continue reading.

Installing Squid Proxy Server on Mac OS X Snow Leopard

Written by Taro Sato on 2010-03-22

My goal is to set up a very basic proxy server on my Mac box on campus, so that I can have full access to subscription-based academic journals via the proxy on my laptop even when I am off campus. Some schools provide such (library) proxies but my... Continue reading.

Setting Up Squid Proxy Server on Mac OS X Leopard

Written by Taro Sato on 2009-07-30

My goal is to set up a very basic proxy server on my Mac box on campus, so that I can have full access to subscription-based academic journals via the proxy on my laptop even when I am off campus. Some schools provide such (library) proxies but my... Continue reading.