This is an interesting article on the topic. Here is one of the quotes that I think is blunt but spot on.
"assuming that you manage to get cleaner data from twitter..
then you have to figure out what people actually are saying “in whatever language it is they’re writing in.
"We’ve put years and years of work to understand grammar, capitalization, and crap like that, and there ain’t none.”
"assuming that you manage to get cleaner data from twitter..
then you have to figure out what people actually are saying “in whatever language it is they’re writing in.
"We’ve put years and years of work to understand grammar, capitalization, and crap like that, and there ain’t none.”