09-24-2017, 07:53 PM
Hello all,
I'd like to ask you all for help. I'm developing a program that would manipulate address books (from phones, mail clients, cloud storages etc). For this, I need to know the data elements they all use. I've settled on vCard as the de-facto standard contact exchange container, but still there's a plethora of elements and settings that it can use.
So what I need many people to do:
1. Export your contacts from your phone, mail client, whatever, into a big .vcf file
2. Filter out the personal information, leaving only the technical details
3. Send the file to me
As for point 1, my phone (and I suppose many other Android phones as well) has an option to backup all contacts to an SD card: Contacts -> ... (top right) ->Manage Contacts -> Back up. Thunderbird, in its Address Book view, has a Tools -> Export menu item (note that you need to set the file type to vCard in the Save dialog).
For filtering the personal information out, I've written a simple Lua script that can do the job for you: https://raw.githubusercontent.com/madmax...ymizer.lua
Call the script with the parameters: VcfAnonymizer.lua InputFile.vcf OutputFile.vcf ; if not given, input file defaults to Contacts.vcf, output file defaults to AnonymizedContacts.vcf
You can inspect the resulting file using a text editor, if you see any personal data left over, let me know and I'll try to fix the script.
I'm only interested in the "name" part of each vcard content line - that is, the part before the colon. Usually, it contains the name of the element, plus possibly some properties (encoding, type, sorting etc.). I'd like to find out what combinations there are in the wild.
Also some VCard exporters are buggy (the one on my phone is ) and output data in a malformed way. The anonymizing script will detect some of it and print a warning to the console (do not copypaste the entire warning here, it may contain personal data in the output). Just let me know that there were warnings.
Thank you in advance for your cooperation.
I'd like to ask you all for help. I'm developing a program that would manipulate address books (from phones, mail clients, cloud storages etc). For this, I need to know the data elements they all use. I've settled on vCard as the de-facto standard contact exchange container, but still there's a plethora of elements and settings that it can use.
So what I need many people to do:
1. Export your contacts from your phone, mail client, whatever, into a big .vcf file
2. Filter out the personal information, leaving only the technical details
3. Send the file to me
As for point 1, my phone (and I suppose many other Android phones as well) has an option to backup all contacts to an SD card: Contacts -> ... (top right) ->Manage Contacts -> Back up. Thunderbird, in its Address Book view, has a Tools -> Export menu item (note that you need to set the file type to vCard in the Save dialog).
For filtering the personal information out, I've written a simple Lua script that can do the job for you: https://raw.githubusercontent.com/madmax...ymizer.lua
Call the script with the parameters: VcfAnonymizer.lua InputFile.vcf OutputFile.vcf ; if not given, input file defaults to Contacts.vcf, output file defaults to AnonymizedContacts.vcf
You can inspect the resulting file using a text editor, if you see any personal data left over, let me know and I'll try to fix the script.
I'm only interested in the "name" part of each vcard content line - that is, the part before the colon. Usually, it contains the name of the element, plus possibly some properties (encoding, type, sorting etc.). I'd like to find out what combinations there are in the wild.
Also some VCard exporters are buggy (the one on my phone is ) and output data in a malformed way. The anonymizing script will detect some of it and print a warning to the console (do not copypaste the entire warning here, it may contain personal data in the output). Just let me know that there were warnings.
Thank you in advance for your cooperation.