How to evaluate sentiment classifiers for Twitter time-ordered data?

How to evaluate sentiment classifiers for Twitter time-ordered data?