A Systematic Approach to Anonymity

Sabah S. Al-Fedaghi


Personal information anonymity concerns anonymizing information that identifies individuals, in contrast to anonymizing activities such as downloading copyrighted items on the Internet. It may refer to encrypting personal data, generalization and suppression as in k-anonymization, ‘untraceability’ or ‘unidentifiability’ of identity in the network, etc. A common notion is hiding the “identities” of persons to whom the data refers to. We introduce a systematic framework of personal information anonymization by utilizing a new definition of private information based on referents to persons in linguistic assertions. Anonymization is classified with respect to its content, its proprietor (the person it refers to) or its possessor. A general methodology is introduced to anonymize private information, based on canonical forms that include a personal identity. The methodology is applied both to textual and tabular data.


