A question on VFLP input format

mnis · September 9, 2020, 9:28am

Hi,
Can someone please provide some more details on how to prepare the input data for VFLP? Below are the steps I followed to run the pipeline.

I’ve got a bunch of smile files that I curated by hand. I kept them in the following structure:

input-files
|___ ligand1.smi
|___ ligand2.smi
.
.
.

After running ./vf_prepare_folders.sh, I start the jobs with the following command

./vf_start_jobline.sh 1 2 templates/template1.slurm.sh submit 1

The job does start with the following output

Syncing the jobfile of jobline 1 with the controlfile file ../../workflow/control/all.ctrl.
Syncing the jobfile of jobline 2 with the controlfile file ../../workflow/control/all.ctrl.

Submitted batch job 2
The job for jobline 1 has been submitted at Mon Sep  7 09:57:12 UTC 2020.

Submitted batch job 3
The job for jobline 2 has been submitted at Mon Sep  7 09:57:13 UTC 2020.

But once the jobs are finished, I don’t see any output in the output-files folder. Is it because of the wrong input file structure?
Can I just add smiles files as input-files/*.smi or do I need input-files/ligand-library/*.smi ?
Or do I need a deeper level hierarchy like tranche names? If yes, then how do I decide, which letters to use for defining the meta-tranche?
Do I also need to make any changes in todo.all or anywhere else? If yes, what changes?

Thanks,
mnis

janujan9954 · August 12, 2021, 7:59pm

Hi I would like some more info on the input format as well. I have one massive .smi file with all the compounds and don’t know if VFLP will split them into separate pdbqt’s for me.

Christoph · August 26, 2021, 9:25pm

Hi @janujan9954 and @mnis ,

Welcome to the forum @mnis

The question was already posted a few times, see for instance here:

Hope this helps,
Christoph