Clarification on VFLP input

janujan9954 · August 30, 2021, 4:52pm

So in the input-files directory, I have one massive .smi file, where each line has a SMILES string for each compound. So then in the all.ctrl file I specified the collection_folder to be …/input-files. However, when I ran VFLP I got no output. Is the problem the collection_folder parameter that I specified? Or is it the way my .smi file has been formatted (needing one file per SMILES string vs having them all in one file)?

Thanks

Christoph · August 30, 2021, 5:00pm

Dear @janujan9954 ,

you need one SMILES file per ligand.

This was already mentioned in more detail in other threads, such as :

In summary, the ligand library of VFVS and VFLP are of the same structure (one file per ligand, with meta tranches, tranches and collections). Except that for VFLP it is a SMILES file instead of a PDBQT or MOL2 file.

Hope this helps,
Christoph

janujan9954 · August 30, 2021, 5:30pm

Hi Cristoph,

Thanks for the prompt response. I am a little confused however; in VFVS the ligands are stored in tar.gz files, which contain tar.gz files in those as well. If the ligand library of VFVS and VFLP are similar, do I need to compress the directory that holds all of the .smi files into a tar.gz file, and specify that in my collection_folder parameter?

Thanks

janujan9954 · September 1, 2021, 6:47pm

Just to clarify, if I want to convert 20 million compounds, I will need 20 million separate .smi files all organized similarly to the structure for VFVS? I feel like I am misunderstanding something…

Christoph · September 2, 2021, 4:37pm

Hi @janujan9954 ,

Yes, each ligand is stored in its own SMILES file.

Yes, the tar archives and compressed tar archives are needed for VFLP as well.

Hope this helps,
Christoph