• 0 Posts
  • 8 Comments
Joined 2 days ago
cake
Cake day: February 2nd, 2026

help-circle
  • i analyzed with AI my 36gb~ that I was able to download before they erased the zip file from the server.

    Complete Volume Analysis
    
      Based on the OPT metadata file, here's what VOL00009 was supposed to contain:
    
      Full Volume Specifications
    
      - Total Bates-numbered pages: 1,223,757 pages
      - Total unique PDF files: 531,307 individual PDFs
      - Bates number range: EFTA00039025 to EFTA01262781
      - Subdirectory structure: IMAGES\0001\ through IMAGES\0532\ (532 folders)
      - Expected size: ~180 GB (based on your download info)
    
      What You Actually Got
    
      - PDF files received: 90,982 files
      - Subdirectories: 91 folders (0001 through ~0091)
      - Current size: 37 GB
      - Percentage received: ~17% of the files (91 out of 532 folders)
    
      The Math
    
      Expected:  531,307 PDF files / 180 GB / 532 folders
      Received:   90,982 PDF files /  37 GB /  91 folders
      Missing:   440,325 PDF files / 143 GB / 441 folders
    
       Insight ─────────────────────────────────────
      You got approximately the first 17% of the volume before the server deleted it. The good news is that the DAT/OPT index files are complete, so you have a full manifest of what should be there. This means:
      - You know exactly which documents are missing (folders 0092-0532)
    

    I haven’t looked into downloading the partials from archive.org yet to see if I have any useful files that archive.org doesn’t have yet from dataset 9.








  • I am downloading dataset 9 and should have the full 180gb zip done in a day. To confirm, the link on DOJ to the dataset 9 zip is now updated to be clean of CSAM or not? As much as I wish to help the cause, I do not want any of that type of material on my server unless permission has been given to host it for credible researchers only that need access to all files for their investigation, but I have no way of understanding what’s within legal rights to assist with redistributing the files to legitimate investigators and thus my plans to help create a torrent may be squashed. Please let me know.