Identifying conversational message threads by integrating classification and data clustering