A statistical approach to crosslingual natural language tasks