Yeah that keithschwarz.com writeup is super readable and pleasant, with simple illustrations and figures that just leap out and practically explain everything themselves.... :]
The links to Python and Node versions were also helpful. The comments in the Python version [1] were especially helpful at explaining what was going on.
I think the README mis-describes the performance comparison with the naive "huge array of samples" method.
In particular, both methods are O(1) after a precomputation step. What the Alias Method appears to get you is a significant reduction in space required.
http://www.keithschwarz.com/darts-dice-coins/
And the corresponding HN discussion.
https://news.ycombinator.com/item?id=3401510