Mining Massive Datasets

This article reviews the MOOC course: Mining Massive Datasets.

I would rate this course as 8/10. The content of this course is mainly based on the instrutors’ book Mining Massive Datasets.

The seven week course includes a huge amount of contents related to MapReduce, link analysis, minihash, locality-sensitive hashing, distance measures, frequent itemsets, data stream mining, large graph analysis, recommender systems, dimensionality reduciton, clustering, computational advertising, SVM, decision tress.

Continue reading

MongoDB for Java Developers

This article reviews the MOOC course: MongoDB for Java Developers.

I would rate this course as 9/10. It is a good entry level MongoDB course for Java Developers.

The seven week course includes topics such as introduction, CRUD, scheme design, performance, aggregation framework, application engineering, case studies and a final exam.

Continue reading

System Design Interview

This article talks how to prepare system design questions for IT company.

System design is a very broad topic. Even a software engineer with many years of working experience at top IT company may not be an expert on system design. If you want to become an expert, you need to read many books, articles, and solve real large scale system design problems.

This repository only teaches you to handle the system design interview with a systematic approach in a short time. You can dive into each topic if you have time. Of course, welcome to add your thoughts!

Continue reading

Never Too Late

It’s never too late to begin a new challenge! Check out when the CEOs begin to start their companies.


Build Your Blog on GitHub with Jekyll

The three stages when you write a blog:

1.You uses an existed blog system such as Blogger or WordPress.

2.Due to limitations of the public blog system, you uses a third party blog system and put it on your own space or on a public cloud, such as GD-cms on GAE.

3.You want to write the blog locally with full control, and also let other website manages the backup.

Continue reading