Size and structure of REAL space (2022q12)

Hi all,

First off, big fan of VirtualFlow—amazing resource and documentation!
I’m planning to run screens on the Expanse cluster at SDSC with some help from their team (not a Linux pro myself yet). I’d be grateful for clarification on a couple of key points:

  • What is the approximate size and file structure of the REAL Space 2022q12 library in ready-to-dock format?

  • How many files should I expect?

  • Is it feasible to download it to a local HPC cluster, or is AWS hosting required?

  • Are there any AWS egress costs or other constraints when accessing or downloading this dataset?