Blog
How to Parameterize a Small-Molecule Ligand for a GROMACS Simulation: CGenFF vs ACPYPE vs ATB
- June 30, 2026
- Posted by: Stem Skills Lab
- Category: Molecular Modeling

To parameterize a small-molecule ligand for GROMACS, pick the tool that matches your protein force field: CGenFF for CHARMM36, GAFF via antechamber and ACPYPE for AMBER, the Automated Topology Builder for GROMOS, and LigParGen for OPLS. Each one assigns atom types, partial charges and bonded terms, then outputs a GROMACS topology you add to your system.
You chose a force field, ran pdb2gmx, and it failed on your ligand with a message like Residue 'LIG' not found in residue topology database. That is expected. The previous step in this series covered how to choose a force field for your GROMACS simulation, and it ended on a promise: the ligand needs its own treatment. This guide is that treatment. It explains why pdb2gmx cannot build a small-molecule topology, matches each parameterization tool to its force field, and walks the practical workflow and the pitfalls that quietly ruin protein-ligand systems.
This is a spoke in our learn molecular dynamics with GROMACS series. To see where ligand parameterization sits in a research career, the full computational biology skills roadmap maps the path from first install to thesis.
Why can’t pdb2gmx parameterize a ligand?
Because pdb2gmx is a lookup tool, not a parameter generator. It builds a topology by matching each residue in your structure against a residue topology database, the .rtp files shipped with every force field. Those databases contain the standard building blocks: the twenty amino acids, the common nucleotides, water, and a handful of ions. The GROMACS manual is explicit that pdb2gmx works from these predefined building blocks, which is why a drug-like molecule that no force field developer has pre-tabulated simply is not found.
A small-molecule ligand has bonds, angles, dihedrals and partial charges that nobody added to the .rtp file in advance. Generating those values is a separate job, done by a dedicated parameterization tool, and the output is a standalone topology (an .itp file plus matching coordinates) that you include in your system alongside the protein topology that pdb2gmx did produce. Understanding this split is the whole point: pdb2gmx handles the known, a parameterization tool handles the unknown.
Which tool matches which force field?
This is the rule that prevents the most common beginner mistake. A ligand topology is only valid with the force field it was built for, because partial charges and atom types are fitted as a consistent set. You cannot take a GAFF ligand and drop it into a CHARMM36 protein and expect meaningful energies. Pick the protein force field first, then use its companion ligand tool.
| Tool | Force field it serves | Charge method | Access | GROMACS output path |
|---|---|---|---|---|
| CGenFF | CHARMM36 / CHARMM36m | Charges by analogy (penalty-scored) | Web server + downloadable program | cgenff_charmm2gmx.py conversion script |
| GAFF/GAFF2 via ACPYPE | AMBER (ff14SB, ff19SB) | AM1-BCC (via antechamber) | Command line (AmberTools + ACPYPE) | ACPYPE writes .top/.itp/.gro directly |
| ATB | GROMOS (54A7 and newer) | QM-derived (submitted job) | Web server (atb.uq.edu.au) | Download GROMACS-format .itp |
| LigParGen | OPLS-AA | 1.14*CM1A or CM1A-LBCC | Web server (Yale) | Download GROMACS .itp/.gro |
Read the table top to bottom and you see the same constraint from the force-field post, now in reverse: the parameterization tool you are allowed to use is decided the moment you picked CHARMM, AMBER, GROMOS or OPLS. There is no universal ligand topology.
Want the guided, hands-on version?
Our live Molecular Modeling & MD Simulations cohort bootcamp takes you from zero to running real docking and MD workflows, with a portfolio project for your grad-school applications.
How does CGenFF parameterization work for CHARMM36?
CGenFF is the CHARMM General Force Field, designed to extend CHARMM36 to drug-like organic molecules. Its developers describe it as “a force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields” (Vanommeslaeghe et al., Journal of Computational Chemistry, 2010). It works by analogy: the program reads your molecule, finds the closest already-parameterized chemistry in its database, and transfers atom types, charges and bonded terms.
The practical workflow is short but order-sensitive:
- Build the ligand with correct protonation and explicit hydrogens, then save it as a
.mol2file (Avogadro or Open Babel both do this). - Submit that file to the CGenFF web server or run the standalone program to get a CHARMM stream file (
.str) containing the topology and parameters. - Convert to GROMACS format with the
cgenff_charmm2gmx.pyscript from the MacKerell lab, passing your residue name, the.mol2, the.strand the CHARMM36 force-field folder.
The detail that beginners skip is the penalty score. CGenFF reports a numeric penalty for every charge and parameter it assigns by analogy. As a working guide from the program’s own output, penalties at or below 10 indicate a good analogy, roughly 10 to 50 means the value deserves validation, and anything above 50 signals the assignment is unreliable and benefits from quantum-chemistry refinement. Check your penalties before trusting the topology. Justin Lemkul’s widely used protein-ligand complex GROMACS tutorial walks the full CGenFF-to-GROMACS path with a real drug molecule and is the canonical reference for this route.
How does the AMBER route work with antechamber and ACPYPE?
If you chose an AMBER protein force field, the companion is GAFF, the General AMBER Force Field, introduced for “rational drug design” and built to cover most organic molecules made of C, N, O, H, S, P and the halogens (Wang et al., Journal of Computational Chemistry, 2004). The path has two stages. First, antechamber (part of the free AmberTools package) assigns GAFF atom types and AM1-BCC partial charges. Then ACPYPE, the AnteChamber Python Parser Interface, converts the AMBER topology into GROMACS format.
A representative command sequence looks like this:
antechamber -i ligand.pdb -fi pdb -o ligand.mol2 -fo mol2 -c bcc -nc 0
acpype -i ligand.mol2The -nc 0 flag sets the net molecular charge; get it wrong and every downstream charge is wrong, so confirm the formal charge of your protonated ligand first. ACPYPE then writes a folder containing the GROMACS .itp, .top and .gro files ready to include. ACPYPE was published as a tool “to generate topologies for chemical compounds” for use with packages including GROMACS (Sousa da Silva and Vranken, BMC Research Notes, 2012), and it remains the standard bridge from AmberTools to GROMACS.
When should you use ATB or LigParGen instead?
Use the Automated Topology Builder (ATB) when your protein force field is GROMOS. ATB is a web server that generates GROMOS-compatible topologies from a submitted structure using quantum-chemistry-derived charges, described by its authors as “an automated topology builder and repository for biomolecular simulations” (Malde et al., Journal of Chemical Theory and Computation, 2011). You upload the molecule, wait for the job to finish (QM submissions are not instant), and download a GROMACS-format topology. ATB also maintains a public repository, so a common ligand may already be parameterized.
Use LigParGen when you chose OPLS-AA. It is a Yale web server that produces OPLS-AA parameters with CM1A or CM1A-LBCC charges and offers a direct GROMACS download (Dodda et al., Nucleic Acids Research, 2017). For both servers the principle is the same as before: the output is only valid inside the force-field family it was built for.
What pitfalls ruin a ligand topology?
Most failed protein-ligand simulations trace back to one of three errors made before parameterization even starts.
- Wrong protonation state. A crystal structure rarely shows hydrogens. If you parameterize a carboxylic acid that should be deprotonated at physiological pH, or miss a protonated amine, the charges are simply wrong. Decide the protonation state for your pH first, add explicit hydrogens, and only then parameterize.
- Wrong net charge. The charge you tell antechamber (or that the server infers) must match the real formal charge of the protonated molecule. A mismatch corrupts the entire charge distribution.
- Ignoring atom typing and charge groups. Each tool assigns atom types by pattern matching; unusual chemistry (uncommon rings, metals, rare heteroatoms) is where analogy breaks down. This is exactly what the CGenFF penalty score flags, and it is why you validate high-penalty atoms rather than trusting them blindly.
A defensible system is one where you can state, in a methods section, which tool produced the ligand parameters, what charge model it used, and that you checked protonation and any high-penalty assignments. That sentence is what separates a thesis-grade workflow from a lucky one.
Want the guided, hands-on version?
Our live Molecular Modeling & MD Simulations cohort bootcamp takes you from zero to running real docking and MD workflows, with a portfolio project for your grad-school applications.
Frequently asked questions
Do I need a paid licence for any of these tools?
No. AmberTools, ACPYPE, the ATB and LigParGen web servers, and academic access to CGenFF are all free for non-commercial research use. You can build a complete protein-ligand workflow without paying for software.
Can I reuse a ligand topology from a published paper?
Only if the paper used the same force field you are using, and ideally only if it published the actual topology file. Charges and atom types are force-field specific, so a CHARMM ligand cannot be reused in an AMBER system. Check the repository (the ATB hosts many ready topologies) before regenerating.
What net charge should I enter for my ligand?
The formal charge of the molecule in the protonation state you simulate at, summed over all atoms. For a neutral drug it is 0; for a phosphate or carboxylate it may be negative. Confirm it from the chemistry, not from a default value.
My CGenFF penalties are high. Is the topology unusable?
Not necessarily, but treat it with caution. Penalties above 50 mean the analogy is weak and the assignment should be validated or refined with quantum chemistry before production runs. High penalties on a peripheral atom matter less than high penalties near the binding interface.
Does the ligand tool depend on the water model too?
Indirectly. The water model is tied to the protein force field, and the ligand tool is tied to the same force field, so keeping the trio consistent (force field, water model, ligand tool) is the same single decision applied three times.
Written by the StemSkills Lab team, structural and computational scientists with more than ten years of combined experience in sequence and structural bioinformatics, drug discovery and design, and multiscale molecular modeling.