(note (code cslai))

Notes on codes, projects and everything

2016 March 25

Processing JSON with dask.bag

Often times, I am dealing with JSONL files, though panda’s DataFrame is great (and blaze to certain extend), however it is offering too much for the job. Most of the received data is in the form of structured text and I do all sorts of work with them. For example checking for consistency, doing replace based on values of other columns, stripping whitespace etc.

(more…)

Random Posts

Some status updates

There are a lot of things I want to post to both here and my personal blogs. However I was sucked into sanctuary for the most of last month. I guess after a month of playing, it is probably time to slowly resume my personal projects.

(more…)
It’s game time!

A new day, and a new post on job application. So this time instead of asking a snippet, I was actually asked to deliver some sort of a full application. Not sure why this was required, but I had fun creating them nonetheless. Though I would say I am not really a fan of creating visual stuff though ~~(oh the crappy animation nearly killed me)~~.

(more…)
Maintaining State with YUI Event

When one start writting Javascript in patterns like the module pattern, then sooner or later he would want to maintain the state when an event handler is called. The reason I am still using YUI to handle my event handling code is because I like how state can be maintained.

(more…)
Statistical Analysis for Social Audit Project

This is the formal draft of my statistical analysis report for the social audit project previously mentioned here. As the project is public by nature, I am cross-posting here for own reference.
(more…)
Random notes on Pandas and Scikit-learn

So I first heard about Panda probably a year ago when I was in my previous job. It looked nice, but I didn’t really get the chance to use it. So practically it is a library that makes data looks like a mix of relational database table and excel sheet. It is easy to do query with it, and provides a way to process it fast if you know how to do it properly (no, I don’t, so I cheated).

(more…)

(note (code cslai))

2016 March 25

Processing JSON with dask.bag

Random Posts

Some status updates

It’s game time!

Maintaining State with YUI Event

Statistical Analysis for Social Audit Project

Random notes on Pandas and Scikit-learn