04-18-2018, 01:18 AM
Alright, this seems like the right place to put this
From a now deleted reddit post , the an owner of the website profileengine.com dumped data scraped from publicly available Facebook profiles between 2007 to 2010. The files were up for 2-3 days then the Internet Archive took them down. The torrent files released were using Internet Archive as a tracker, which is now invalidated.
The below were taken from http://cilidaquan.xyz/cldq/profile_engine/1-0-0.html
So far, pretty good number of seeds.
This stuff is high volume. The README recommends not extracting images in non-journaling file system. DO NOT EXTRACT IMAGES IN EXT2/FAT32/NTFS, you will run out of space very quickly
Readme recommends using ZFS and having 5TB of disk space.
Total 1 TB of data, 0.6% is text, the rest is image data.
Magnet links
magnet:?xt=urn:btih:9ef76357f6ea4607a02274154deefb94b21b10b6&dn=profile_engine_database (this is the text data, contains people's names and associates)
magnet:?xt=urn:btih:dccd3299662373420a26c8fcf6b53da2fb3b12a0&dn=profile_engine_images_square_1
magnet:?xt=urn:btih:03ef5b00bbb5fea909b34b192203872d2858fbc4&dn=profile_engine_images_large_2
magnet:?xt=urn:btih:428b8fee5f522f32c8af822abea5aff812ee2c6c&dn=profile_engine_images_large_3
magnet:?xt=urn:btih:5f8f6ca856c06461e9edfaee8877a882228fd919&dn=profile_engine_images_large_4
magnet:?xt=urn:btih:99a864bb298a1dcc83f115abd9911bd8000b27d7&dn=profile_engine_images_large_5
magnet:?xt=urn:btih:c8196bf0da72a400b6fcb02db257852cc6bdab74&dn=profile_engine_images_large_6
magnet:?xt=urn:btih:95e6fa0998826a620434c80c546f4d18b37e072b&dn=profile_engine_images_large_7
magnet:?xt=urn:btih:4989c8c1a9ca3956b358bf94b092da17442b6e1f&dn=profile_engine_images_large_8
magnet:?xt=urn:btih:d10f3e2cf5ed750aebc2cd7a8b8ac9e5d4338d70&dn=profile_engine_images_large_9
magnet:?xt=urn:btih:c039d3f183d138ee2299696f4bb330100a732cf0&dn=profile_engine_images_large_10
magnet:?xt=urn:btih:966467b85c06b6ed71190ea056a64fd0a06c5e13&dn=profile_engine_images_large_11
magnet:?xt=urn:btih:be9bece2aab5beb24204d6d898b0219ec66b4f5f&dn=profile_engine_images_large_12
From a now deleted reddit post , the an owner of the website profileengine.com dumped data scraped from publicly available Facebook profiles between 2007 to 2010. The files were up for 2-3 days then the Internet Archive took them down. The torrent files released were using Internet Archive as a tracker, which is now invalidated.
The below were taken from http://cilidaquan.xyz/cldq/profile_engine/1-0-0.html
So far, pretty good number of seeds.
This stuff is high volume. The README recommends not extracting images in non-journaling file system. DO NOT EXTRACT IMAGES IN EXT2/FAT32/NTFS, you will run out of space very quickly
Readme recommends using ZFS and having 5TB of disk space.
Total 1 TB of data, 0.6% is text, the rest is image data.
Magnet links
magnet:?xt=urn:btih:9ef76357f6ea4607a02274154deefb94b21b10b6&dn=profile_engine_database (this is the text data, contains people's names and associates)
magnet:?xt=urn:btih:dccd3299662373420a26c8fcf6b53da2fb3b12a0&dn=profile_engine_images_square_1
magnet:?xt=urn:btih:03ef5b00bbb5fea909b34b192203872d2858fbc4&dn=profile_engine_images_large_2
magnet:?xt=urn:btih:428b8fee5f522f32c8af822abea5aff812ee2c6c&dn=profile_engine_images_large_3
magnet:?xt=urn:btih:5f8f6ca856c06461e9edfaee8877a882228fd919&dn=profile_engine_images_large_4
magnet:?xt=urn:btih:99a864bb298a1dcc83f115abd9911bd8000b27d7&dn=profile_engine_images_large_5
magnet:?xt=urn:btih:c8196bf0da72a400b6fcb02db257852cc6bdab74&dn=profile_engine_images_large_6
magnet:?xt=urn:btih:95e6fa0998826a620434c80c546f4d18b37e072b&dn=profile_engine_images_large_7
magnet:?xt=urn:btih:4989c8c1a9ca3956b358bf94b092da17442b6e1f&dn=profile_engine_images_large_8
magnet:?xt=urn:btih:d10f3e2cf5ed750aebc2cd7a8b8ac9e5d4338d70&dn=profile_engine_images_large_9
magnet:?xt=urn:btih:c039d3f183d138ee2299696f4bb330100a732cf0&dn=profile_engine_images_large_10
magnet:?xt=urn:btih:966467b85c06b6ed71190ea056a64fd0a06c5e13&dn=profile_engine_images_large_11
magnet:?xt=urn:btih:be9bece2aab5beb24204d6d898b0219ec66b4f5f&dn=profile_engine_images_large_12