An overview of imperfection representation in semistructured data