ss07 Python Scripting
ss07 Python Scripting
Scripting with
Python
Schrödinger Suite 2007
Schrödinger Press
Copyright © 2007 Schrödinger, LLC. All rights reserved. CombiGlide, Epik, Glide,
Impact, Jaguar, Liaison, LigPrep, Maestro, Phase, Prime, PrimeX, QikProp, QikFit,
QikSim, QSite, SiteMap, and Strike are trademarks of Schrödinger, LLC.
Schrödinger and MacroModel are registered trademarks of Schrödinger, LLC.
MCPRO is a trademark of William L. Jorgensen.
To the maximum extent permitted by applicable law, this publication is provided “as
is” without warranty of any kind. This publication may contain trademarks of other
companies.
Please note that any third party programs (“Third Party Programs”) or third party
Web sites (“Linked Sites”) referred to in this document may be subject to third
party license agreements and fees. Schrödinger, LLC and its affiliates have no
responsibility or liability, directly or indirectly, for the Third Party Programs or for
the Linked Sites or for any damage or loss alleged to be caused by or in connection
with use of or reliance thereon. Any warranties that we make regarding our own
products and services do not apply to the Third Party Programs or Linked Sites, or
to the interaction between, or interoperability of, our products and services and the
Third Party Programs. Referrals and links to Third Party Programs and Linked Sites
do not constitute an endorsement of such Third Party Programs or Linked Sites.
May 2007
Contents
Document Conventions .................................................................................................... vii
2.3 Some Useful Things to Know About the Python Language .............................. 4
In addition to the use of italics for names of documents, the font conventions that are used in
this document are summarized in the table below.
In descriptions of command syntax, the following UNIX conventions are used: braces { }
enclose a choice of required items, square brackets [ ] enclose optional items, and the bar
symbol | separates items in a list from which one item must be chosen. Lines of command
syntax that wrap should be interpreted as a single command.
In this document, to type text means to type the required text in the specified location, and to
enter text means to type the required text, then press the ENTER key.
References to literature sources are given in square brackets, like this: [10].
Chapter 1
Chapter 1: Introduction
Since the beginning, Maestro has had a command language. However, until recently, scripting
abilities have been limited to simple lists of Maestro commands. Feedback from users has
shown us that many people need abilities well beyond that. For this reason Schrödinger
provides improved scripting ability with the well-known scripting language Python. The
combination of Python, the Maestro command language, and Python functions for using
Maestro functionality, is very powerful and greatly expands the capability for automating and
customizing Maestro.
On top of the Maestro interface, the Python tools provided with Schrödinger’s software include
interfaces to the Job Control facility. With these interfaces, scripting capabilities can be
extended into automated workflows that combine Schrödinger’s products in ways that suit the
user’s needs.
Python is easy to use and fun to learn. It is a straightforward language that doesn’t require the
kind of learning curve normally associated with a programming language.
This document aims to provide a non-technical introduction to using Python within Maestro.
This tutorial provides lots of examples and includes a brief description of the most important
things you need to know about Python. While a detailed description of how to program in
Python is beyond the scope of this tutorial, that information is readily available elsewhere.
Class and function documentation for Schrödinger modules is present only in the HTML
Python documentation, which you can open from the HTML documentation index at
$SCHRODINGER/docs/ss07_index.html. In addition, an overview of changes between
versions 3.0 and 3.5 of the Python releases is provided. See the HTML documentation for
questions about specific functionality not covered in this tutorial.
Chapter 2
1. Please see the notice regarding third party programs and third party Web sites on the copyright page at the front
of this manual.
Indentation matters in Python. Unlike other scripting and programming languages which use
{} to group statements together, statements in Python are grouped simply by the level to which
they are indented relative to one another. This means you need to take care in order to ensure
the indentation reflects the logic you intend. Consider the following two examples:
# Example 1:
x=5
y=4
if x > 5 :
x = x-1
x=y+1
print x
This example prints 5 because the line x=y+1 is not part of the if block.
# Example 2:
x=5
y=4
if x > 5 :
x = x-1
x = y+1
print x
This second example prints 5, since the lines x=x-1 and x=y+1 are executed if and only if x is
indeed greater than 5. All Python scripts use indentation to indicate grouping of statements,
and it is usually considered good form to place each Python statement on its own line.
Note: You can use either space characters or tabs for indents, but you should not mix them.
We recommend using spaces.
The # character is used for comments. Comments begin with a # and extend to the end of the
current line. The Python interpreter ignores comments.
The keyword def is used to define functions, which are groups of Python statements called to
perform a particular task. Function parameters may have default values. For example:
# Example 3
def myfunc( x, y=10 ):
x = x + y
print x
myfunc( 5 )
myfunc( 5, 5 )
Here we have defined a function called myfunc(). It takes two arguments, x and y where y
has a default value of 10. This means you can call myfunc in two ways as shown in this
example. In the first call to myfunc() a value for x is supplied, but no value for y. This means
that the default value for y is used. As a result myfunc computes the sum of 5 and 10 and
prints the result which is 15. In the second use, values are supplied for both x and y. By
providing an explicit value for y, the default value is overridden and the result is the sum of 5
and 5 which is 10.
Variables do not need to be explicitly typed or destroyed. In the above examples, the types
(integer, string, real number) of the variables x and y are defined only by the context and do not
need to be declared before the variable is used. Also, unlike other languages such as C and
C++, there is no need to allocate or free memory in Python—this is handled automatically.
All Python script files should have a .py suffix. Without this suffix, Maestro does not recog-
nize the files as Python scripts.
1. While Perl is used widely for system administration and web programming, Python is
more popular in scientific programming and larger scale development. Python is very
scalable: it can be used to write simple scripts or to develop full-fledged applications.
2. Of the Maestro users surveyed, most expressed either a strong preference for Python over
Perl, or no preference at all.
3. Python has a clean, straightforward syntax; a benefit for new or occasional users. Con-
sider the following two fragments, which perform the same task:
# The three input parameters are assigned passed values or the default.
my $axis = shift || 'Y';
my $step = shift || 10;
my $slp = shift || 0.1;
my $total_rotate=0;
# If the axis is not X,Y or Z then we can die with an error string
# which will be posted as a Maestro error dialog.
if( $axis !~ m/^X|^Y|^Z/i ) {
die "$axis is not a valid axis value - must use X,Y or Z";
}
# Now we begin the code which actually does the rotation. Start with
# a loop that finishes when we've rotated 360 degrees:
# Now we begin the code which actually does the rotation. Start with
# a loop that finishes when we've rotated 360 degrees:
In language comparisons like this, there really is no "right" answer. However, while you may
find some aspects of Python a little odd at first, we urge you to persist. Python is easy to learn
and easy to use.
There are a number of accounts available of programmers who have made the switch from Perl
to Python. One of the best is the Why Python?2 essay, by well-known programmer and author
Eric S. Raymond.
2. Please see the notice regarding third party programs and third party Web sites on the copyright page at the front
of this manual.
However, one important point about the way we use Python, both in Maestro and outside of it,
is that much of the heavyweight computation and manipulation of chemical structures is not
actually done in Python. As you will see from the examples in this document, Python in
Maestro is primarily used for controlling the program. The real work is actually done by
Maestro itself. The time and overhead required to interpret a Python script is generally insig-
nificant relative to the work done by Maestro to execute the commands sent to it from a Python
script.
Chapter 3
While issuing commands with Python scripts is an extremely powerful mechanism for control-
ling Maestro, it is essentially a one-way communication. There is no way for the scripts to
receive information about Maestro, the structure in the Workspace or entries in the Project
Table. Therefore we have introduced an additional mechanism so Python scripts can directly
manipulate structures and entries, right down to the level of changing the properties of indi-
vidual atoms. In practice many Python scripts use a combination of these two approaches.
You can create your Python scripts with any text editor. Some editors, like Emacs, include
special capabilities for Python such as colorizing keywords, comments, and strings; and main-
taining the correct indentation. Another possibility is the standard Python editor idle, which
you can run with the command $SCHRODINGER/run idle.
Example 1. myfirst.py
In your editor, create the following script and name the file myfirst.py:
# myfirst.py
def myfirst():
print "Hello World from Maestro"
Take care to indent the second line. It indicates that print "Hello World from Maestro"
is part of the myfirst() function. Save your script and start Maestro from the directory in
which you saved your script.
Modules generally contain a number of functions. In this example, our module contains a
single function, also named myfirst(). There is, however, no reason why a function needs to
be named after the module: it can have any name you wish.
It is important to understand the distinction between functions and modules. When you ask
Maestro to run a script it is actually executing a single function inside a module; it does not, in
general, run the entire contents of the module. You can in fact place all the functions you write
into a single file, although this practice is not recommended.
A common practice is to place a set of related functions into a module. Some may not be
designed to be directly called from Maestro. You might, for example, call one function from
Maestro and that function in turn might call other functions in the same module.
We have provided a number of useful functions that you can use as starting points to create
your own. You will see more on this in the next chapter.
Note: We specified both the module and function as part of the pythonrun command.
In this example, we had direct access to our script by starting Maestro from the directory
containing the script, but this can quickly become restrictive. See Section 3.8 on page 13 for
details on where to store modules.
pythonimport myfirst
Maestro reloads the myfirst.py file so that the next time you execute the script using
pythonrun, the updated version is used.
Example 2. myfirst.py
# myfirst.py
def myfirst( param = "" ):
print "Hello World from Maestro " + param
Save your changes and then use pythonimport to reload myfirst.py. Now run the script:
pythonrun myfirst.myfirst
It displays the same output as the first version of myfirst.py: “Hello World from Maestro”.
This is because we added an empty string as the default parameter. If you do not supply a value
for param, the script supplies an empty string. If we had not added the default value (param =
""), then running myfirst without a value for the parameter results in an error.
This is a very significant feature of running Python from within Maestro. Unlike older Maestro
command scripts, where everything needed to be hard-coded into the script, Python functions
can take any number of parameters, some or all of which can have default values. This allows
Python scripts to be very general. For example, you can write a script that operates on a file-
name supplied by the user at the time the script is run.
Note: With the exception of quoted strings (""), which are treated as a single parameter, all
parameters entered from the Maestro command input area must be separated by white
space.
pythonrun myfirst.myfirst
from any directory in which you run Maestro.
Note: Maestro looks first in the current directory (the one from which you started Maestro)
first before looking in $HOME/.schrodinger/maestroversion/scripts. So if you
have a local copy of the script, the local copy is used instead.
You can achieve the same result as pythonrun but you must remember to first import the
module and use the Python style syntax to call any functions. So using the example for
myfirst.py as described above:
is equivalent to:
Chapter 4
The previous chapter showed how to run a very simple Python script from within Maestro and
introduced modules and functions. This chapter discusses how you can write scripts to issue
Maestro commands.
There are several variants of this. In our examples we use the from schrodinger import
maestro form. This means that in the example scripts we will always reference the functions
in this module in the fully qualified form as maestro.function(). However, if you look at
Python documentation or other scripts, you will see that there are other ways you can use the
import command. For example:
You can even go further and use an import expression such as:
from schrodinger.maestro.maestro import command as c
This means that instead of using maestro.command() you can simply use c().
Note: You can write your own modules and import them into other modules. The only
requirement is that your modules are in the module search path as described in the
previous chapter.
Note: You can review the commands that have been issued in the normal operation of
Maestro by choosing Command Script Editor from the Edit menu.
There are a couple of enhancements we can make to this script. In the previous example the
axis of rotation is hard coded into the Python function. We could create similar functions called
rotx() and rotz() to rotate around the other axes. Another option is to supply the axis of
rotation as a parameter and add that into the command string before issuing the command.
Here is the reworked example in which we have not only parameterized the axis of rotation but
also taken care to verify that the supplied argument is a valid axis:
# rot.py
from schrodinger.maestro import maestro
pythonrun rot.rot x 30
to get a rotation of 30 degrees around the X-axis.
Note: While this might be a good example of building your own function that uses parame-
ters, you will probably want to use the Maestro rotate command directly to rotate
structures in the Workspace:
rotate x=30
It is also possible to issue more than one command in a single call to maestro.command() by
using a triple quoted string with each command on a separate line. For example, the commands
above can also be issued as:
maestro.command("""
rotate x=5
rotate y=5
rotate z=5
""")
This is a useful way to issue a series of Maestro commands taken directly from the Maestro
command script editor. Note that each command must be on a new line.
Example 2. spin.py
Before we finish this introduction we will look at a more sophisticated example—a Python
function that does something not currently possible with a Maestro command. The following
example uses the form of maestro.command() where the options are specified separately
from the keyword:
#spin.py
from schrodinger.maestro import maestro
import time
def spin(axis="Y",step=10,slp=0.1):
total_rotate=0
return
When this is run as:
it spins the structure in the Workspace around the Y axis 360 degrees in increments of 20
degrees. The axis, increment, and the delay are all specified as parameters.
• In addition to including the maestro module we also include the standard Python mod-
ule time. This allows us access to the time.sleep() function, used in this script to
introduce a short delay after each rotation.
• Note how the total_rotate variable is used in the while loop. You can use any com-
bination of parameters and variables in combination with the functions in the maestro
module.
• In this particular function we need to use maestro.redraw() to force the contents of
the Workspace to be redrawn after each rotation.
If you want to experiment with issuing commands from your own scripts, this example is a
good place to start. As an exercise, create a script that rotates the structure first one way and
then another.
It is also possible to use Maestro's command alias function to create new commands. For
example, if you issue:
In general, if a task can be performed by issuing a Maestro command, that is the preferred way
to achieve it from a Python script. Not only does this generally result in the shortest possible
script, but Maestro commands also automatically update the internal state of Maestro,
redrawing the Workspace as needed, and so on.
As powerful as these techniques are, there are still things you cannot achieve using Maestro
commands alone. The following chapters describe how to control Maestro at a lower level of
detail.
Chapter 5
Previous chapters covered how to run Python scripts in the Maestro environment and how to
issue commands from those scripts. In this chapter we discuss more sophisticated and powerful
features that allow you to directly manipulate the structure in the Workspace.
To get the Workspace structure in a Python script, use a statement such as:
st = maestro.workspace_get()
Once you have a structure there are a number of operations you can perform on it. You should
however note the following:
• The structure that is returned from workspace_get() is the actual Workspace structure
as used by Maestro. Any changes made will be reflected in the Workspace.
• The structure object returned does not contain any properties, such as those associated
with the entry in the Project Table. You have to use the project module to get access to
properties.
• If you want to pass the structure to another operation we suggest you make a copy of it
with Structure.copy().
maestro.workspace_set(regenerate_markers=True)
Then you can use:
maestro.redraw_request()
to request that Maestro redraw the contents of the Workspace when your script has finished
executing. You can also use maestro.redraw() to redraw the contents of the Workspace
before your script has finished executing.
Setting regenerate_markers to True (the default) forces Maestro to update markers for
distances, labels, and so on, and to apply any changes to the Workspace structure. It should be
set to True in most circumstances.
Note: workspace_set() will not work correctly if the specified Structure is passed directly
from Project Table’s row.structure (or if it is managed by another C library). The
workaround is to pass a copy of the Structure.
Here x is the x coordinate for the atom, but any of the properties shown in Table 5.1 and
Table 5.2 can be used.
Property Description
x x coordinate
y y coordinate
z z coordinate
Property Description
entry_id Entry ID
entry_name Formerly entry name, now returns entry ID. Use entry_id
instead. Entry name should be obtained as a property from the
project via s_m_entry_name
molecule_number Number of the molecule to which this atom belongs
Atom properties can also be obtained using the data names as they appear in the Maestro file.
For example the following two ways of printing the x coordinate are equivalent.
total_order=0
for iatom in st.atom:
for ibond in iatom.bond:
total_order += ibond.order
This example uses the bond order property. The available bond properties are listed in
Table 5.3.
Property Description
st.deleteBond( 12, 17 )
deletes the bond between atom 12 and 17.
There is also the areBound() method which returns True if the two atoms specified have a
bond between them.
Note that the structureutil module contains a measure() method, which allows
measurements of any type to be peformed between different Structure objects and also
supports determining the angle between two planes (as defined between two sets of three
atoms).
to_del = [1,2,5]
st.deleteAtoms(to_del)
st = maestro.workspace_get()
print "There are %d molecules" % len(st.molecule)
imol=1
for mol in st.molecule:
print "There are %d atoms in the molecule" % len(mol.atom)
for atom in mol.atom:
# Do something for atoms in molecule
molst = mol.extractStructure()
molst.write( "molecule%d.mae" % imol )
imol += 1
See the HTML documentation for the structure module for more information on these iter-
ators. Note that the extractStructure() method provided for the chain, residue, molecule,
and ring objects does not inherit the title or any other structure-level properties of the structure
which it is extracted from.
Example 1. closecontact.py
This example illustrates how to issue Maestro commands and manipulate the Maestro Work-
space structure. We use a Python function to highlight close contacts between any atoms sepa-
rated by more than three bonds:
#closecontact.py
from schrodinger.maestro import maestro
from schrodinger import structure
from schrodinger import structureutil
def close_contacts(thresh=2.0) :
# Get the current Workspace structure and calculate the number of atoms:
st = maestro.workspace_get()
distance = 0
By now you recognize the import maestro statement. This example also imports two addi-
tional modules, structure and structureutil, that we provide to perform operations at a
lower level.
The first task in close_contacts() is to get the structure from the Workspace. Next we loop
over all atoms in the structure.
For each atom in the Workspace, we only want to calculate the distance away from the current
atom if it is more than three bonds away. An easy way to calculate this is with an ASL expres-
sion: not( withinbonds 3 atom.iatom).
Evaluating this returns a Python list containing the atoms that satisfy our expression. We use
that list in an inner loop to calculate the distance from the current atom. If a distance is less
than the given threshold, we use Maestro to mark the distance by issuing the distance
command.
This script contains common tasks: looping over all atoms, evaluating ASL expressions, and
making measurements on the structure. Note that since we did not make any changes to the
structure directly, we did not need to call maestro.workspace_set().
Example 2. rotH.py
The following script adds hydrogens to the structure in the Workspace, and attempts to rotate
the O-H and S-H groups on SER, THR, TYR, and CYS residues to place the hydrogen as close
as possible to an acceptor. This is also an example of using multiple functions in a Python
module. Only one of the functions is intended to be called from Maestro. The other simply
improves the readability of the script.
Note: This script is provided purely as an example of how to manipulate the workspace—it is
not intended to be a solution to the difficult problem of orienting hydrogens in
proteins!
#rotH.py
#Import the modules we need:
from schrodinger.maestro import maestro
from schrodinger import structure
from schrodinger import structureutil
# Now locate all the rotatable atoms in all the CYS, SER, TYR and THR.
# The simplest way to do this is using an ASL expression:
rotables = structureutil.evaluate_asl( st,
"(res.ptype SER, THR, TYR, CYS ) and (atom.ptype HG, HG1, HH )")
# rotables is now a list of the appropriate atoms. We can loop over that
# and do what we need to do with them:
for h in rotables:
# We need to locate a suitable dihedral angle to rotate:
dihedral = get_dihedral_atoms( st, h )
if not len(dihedral) == 4 :
# Under normal circumstances there shouldn't be any situations
# where we can't locate four atoms. However, there is the chance
# that we might get an incomplete residue in a PDB file, so we'll
# just ignore this residue if that's the case:
continue
asl = \
"( %s ) and ( sidechain or res. HOH ) and not fillres( atom.num %d )" \
% ( asl, dihedral[1] )
acceptors = structureutil.evaluate_asl( st, asl )
ang = 0.0
min_dist = 1000.0
best_ang = 0.0
# If it actually found some suitable acceptors, we can
# then scan the C-C-O-H dihedral see which dihedral gives the
# closest contact to an acceptor. The scan is done in increments
# of 5 degrees:
while len(acceptors) > 0 and ang < 360.0 :
# Set to the current angle:
# Note that we pass the dihedral
# as C2-C1-X-H as specifying it in this way indicates
# we want to rotate the hydrogen:
st.adjust( ang, dihedral[3], dihedral[2],
dihedral[1], dihedral[0] )
# For each acceptor atom measure the distance and find out
# whether we've actually found a closer match than we've found
# before:
for acc in acceptors:
dist = st.measure( h, acc )
if dist < min_dist:
min_dist = dist
best_ang = ang
best_acc = acc
# We've tried all the angles now. Reset it back to the one
# which gave us the closest contact with an acceptor:
if len(acceptors) > 0:
st.adjust( best_ang, dihedral[3], dihedral[2],
dihedral[1], dihedral[0] )
return
##########################################################################
def get_dihedral_atoms( st, h ):
""" For atom number h in the structure st, find four atoms to
be used to scan the C-C-X-H dihedral. These are returned as a list.
This function illustrates how to traverse the bonds of a structure """
ret_list = []
# Now find the O or S attached to the H. Note that bonds (like everything
# associated with structures) are indexed from "1" so we are
# looking for the first bond:
Xatom = st.atom[h].bond[1].atom2
ret_list.append( int(Xatom) )
return ret_list
Now that we have seen how to manipulate the structure from the Workspace, next we will look
at how to access the Project Table directly.
Chapter 6
While using Python scripts to manipulate the structure in the Workspace is useful for extending
the functionality of Maestro, you can also automate Maestro by operating on structures in the
Project Table. This chapter provides an overview of the possibilities.
pt = maestro.project_table_get()
# Loop over the selected entries
for sel_entry in pt:
# Do something for selected entries only using "sel"
only operates on the selected entries whereas:
pt = maestro.project_table_get()
# Loop over the selected entries
for irow in xrange(1,len(pt)+1):
# Do something for all entries using pt[irow]
operates on all entries.
Example 1. saveimage.py
Here is an example that saves a .jpg image for each selected entry:
#saveimage.py
def save_image():
pt = maestro.project_table_get()
If your project synchronization preferences are set to Automatic, each time you initiate an
entrywsincludeonly on one entry in the Workspace, all entries in the project are updated.
(To set these preferences, choose Preferences from the Maestro menu, then select the Project
folder.) You can use this to make a change to all the selected entries.
Example 2. meth.py
The following script methylates amides in every selected structure by combining Maestro
commands with direct manipulation of the Workspace structure for each selected entry:
#methylate.py
def methylate():
pt = maestro.project_table_get()
maestro.command("fragment organic Methyl")
You can use practically any combination of Maestro commands and direct manipulations in the
Workspace structure to achieve the results you want. We have also provided a similar function,
project.getPropertyNames() that returns the property names.
If your Python script needs to make selections in the Project Table that are not possible by
issuing a entryselectonly command, the preferred method is a selection filter and the
project.selectRows() function. To do this, write a simple Python function that is called
for every entry in the project. If your function returns True the entry is selected, if it returns
False the entry is not selected. Your function should accept two parameters: the CT corre-
sponding to the current entry and a Python dictionary with an entry for each property in the
Project Table. Note that it’s also possible to use project.selectRows() with a list of row
numbers to be selected. The following example demonstrates both these approaches:
Example 3. selring.py
This example selects entries that contain rings of a specified size:
#select_ring.py
pt = maestro.project_table_get()
matches = []
# The following two functions show how to select using a callback function
def select_ring2( ring_size ):
pt = maestro.project_table_get()
pt.selectRows(project.REPLACE, ring_size, function=myfunc)
"""
Example callback function to select based on a property
Return True if it should be selected
Return False if it should be deselected
"""
ring_size = args[0]
rings = structureutil.find_rings(ct)
for ring in rings:
if( len(ring) == ring_size ):
return True
return False
pythonrun select_ring.select_ring 4
only those entries in the project with four-membered rings are selected.
Example 4. selprop.py
This example selects entries from the project based on a combination of the entry's structure
and properties:. In this case we use the property dictionary to find the value of the property we
are interested in.
#selprop.py
pt = maestro.project_table_get()
matches = []
Example 5. color_by_energy_gradient.py
Here is an example that sets the color of each entry, based on the relative molecular mechanics
energy
Note: Because we return True from the function, the entry structure will be updated in the
Project Table
#color_by_energy_gradient.py
def color_relative():
"""
pt = maestro.project_table_get()
num_atoms = pt[row].structure.atom_total
ct = pt[row].structure
for i in range(1, num_atoms+1):
ct.atom[i].color = col
Example 6. count_ch.py
The following example adds properties for the number of hydrogens and carbons to every entry
in the project.
#count_ch.py
def count_ch( ):
pt = maestro.project_table_get()
matches = []
ct = pt[row].structure
num_h_atoms = 0
for a in ct.atom:
if a.atomic_number == 1:
num_h_atoms += 1
num_c_atoms = 0
for a in ct.atom:
if a.atomic_number == 6:
num_c_atoms += 1
pt.refreshTable()
Example 7. supersel.py
This is another example of looping over all entries. In this case an operation (superposition) is
performed on the entries and the RMS deviation is added as a property in the Project Table.
#supersel.py
from schrodinger.maestro import maestro
from schrodinger import structureutil
from schrodinger import project
def superimpose_select():
pt = maestro.project_table_get()
matches = []
count = 1
count = count + 1
Chapter 7
Example 1. allfiles.py
# allfiles.py
from schrodinger.maestro import maestro
from schrodinger import structureutil
import commands
import glob
import os
# This is the one that should be executed from Maestro. It uses the
# glob.glob subroutine to apply the action to all PDB files:
def mini_all_pdb() :
# first clean up by removing any existing files:
commands.getoutput( "rm *-min.pdb" )
directory=os.getcwd()
path = os.path.join(directory,"*.pdb")
filelist = glob.glob( path )
for afile in filelist:
mini_pdb( afile )
maestro.redraw()
# Run and wait for the job:
maestro.command("energystart")
maestro.job_wait(True)
maestro.redraw()
#The job is now finished - export the structure to a new PDB file
out_file = pdb_code + "-min.pdb"
maestro.command("entryexport format=pdb source=selected %s" % out_file )
Once a Job object is created, the available keys can be listed with the keys() method and
their values can be obtained as attributes of the object. The database values retrieved at creation
time will never be updated automatically. They must be explicitly updated with the
readAgain method.
This example from the interactive prompt demonstrates how to read the database, access
attributes, and update your information:
>>> import schrodinger.job.jobcontrol as jobcontrol
>>> j = jobcontrol.Job("isabel-0-434ac660")
>>> j.keys()
[’BackendFifo’, ’BackendPid’, ’ChildPid’, ’Command’, ’Dir’, ’Envs’,
’Home’, ’Host’, ’HostEntry’, ’HostsFile’, ’InputFiles’, ’JobDB’,
’JobDir’, ’JobFifo’, ’JobHost’, ’JobId’, ’JobPid’, ’JobPort’,
’JobUser’, ’LaunchTime’, ’LogFiles’, ’MonitorInterval’, ’Name’,
’OutputFiles’, ’Processors’, ’Program’, ’StartTime’, ’Status’,
’StatusTime’, ’SubJobs’, ’User’]
>>> j.LaunchTime
’2005-10-10-15:52:00’
>>> j.LogFiles
[’counterpoise.1146.blog’]
>>> j.StatusTime
’2005-10-10-17:18:31’
>>> import time; time.sleep(1200) # wait a while
>>> j.StatusTime
’2005-10-10-17:18:31’
>>> j.readAgain()
Chapter 8
All the scripts we have used so far have been run from the Maestro command line using the
pythonrun command. It is, however, possible to write scripts that display their own graphical
panels, similar to those of Maestro itself.
8.1 Tkinter
We support the Tkinter graphical user interface (GUI) toolkit for Python. It is simple to use and
comes as a standard part of Python. This document describes how to use Tkinter with Maestro.
Learning to program a GUI takes a bit of practice, but Tkinter and PMW make it relatively
easy. For information on using Tkinter, see the recommended O’Reilly books on Python3 or
the following page on the Pythonware website3 which offers a step-by-step tutorial. For infor-
mation on PMW, see http://pmw.sourceforge.net3. You can also find more examples using
Maestro and Tkinter at:
$SCHRODINGER/python-vversion/scripts/maestro/
First and foremost, never call mainloop() on any widget in your Tkinter script. If you do,
your script will run, but Maestro will be inactive while your Tkinter widget is visible. This
probably is not what you want since the real power of creating your own GUI panels is to allow
them to interact with Maestro.
3. Please see the notice regarding third party programs and third party Web sites on the copyright page at the front
of this manual.
Note: Any number of Python/Tkinter scripts can be sharing events with Maestro.
Example 1. simple.py
Here is an example that displays a simple panel in Maestro:
# simple.py
from schrodinger.maestro import maestro
from Tkinter import *
simple_top = 0
def simple():
global simple_top
# Don't put the panel up twice:
if( simple_top != 0 ):
return
simple_top = Tk()
quit_button = Button( simple_top, text='Quit', command = simple_quit_com)
quit_button.pack()
maestro.tk_toplevel_add(simple_top)
We use maestro.tk_toplevel_add to let Maestro know when we are ready to display the
panel and maestro.tk_toplevel_remove when we are finished with it. Also notice the use
of simple_top as a check to see if the panel is currently displayed. Before putting up a panel,
it is generally good practice to see if the panel already exists. It can be confusing to have
multiple instances of the same panel floating around.
Note: If you request to receive selection information from Maestro, then no other panel
within Maestro (or any other Python/Tkinter script) can receive selection information.
By the same measure, if another panel starts receiving selection information (say it is
opened from the main Maestro menu) then your panel loses the ability to receive selec-
tion information. This is a natural consequence of the way atom selection operates in
the Maestro Workspace; picking information can only go to one place.
Example 2. simple_pick.py
Getting selections from the Maestro Workspace is simple. All you need to do is tell Maestro
the name of the Python function you want called when a selection is received. Here we have
extended the previous example by adding an additional function to receive atom selections.
Remember to tell Maestro you no longer want to receive events when the panel is closed.
# simple_pick.py
from schrodinger.maestro import maestro
from Tkinter import *
from schrodinger import structure
simple_top = 0
def simple_pick_cb( at ):
st = maestro.workspace_get()
pdb_res = st.atom[at].pdbres
print "Picked residue is: %s " % pdb_res
def simple_pick():
global simple_top
# Don't put the panel up twice:
if( simple_top != 0 ):
return
simple_top = Tk()
quit_button = Button( simple_top, text='Quit', command = simple_quit_com)
quit_button.pack()
maestro.tk_toplevel_add(simple_top)
The function that receives selections should be prepared to receive a single parameter, the atom
number of the selected atom in the Workspace. Once you have the atom number you can use
that to operate directly on the Workspace structure as shown in this example, or use it to issue
Maestro commands.
Example 3. simple.py
Here’s another version of Example 1, which uses the Schrödinger interface to Tkinter:
# simple.py
from schrodinger.maestro import maestro
import schrodinger.ui.widget as stk
simple_top = 0
def simple():
global simple_top
# Don’t put the panel up twice:
if( simple_top != 0 ):
return
simple_top = stk.Tk()
quit_button = stk.Button( simple_top, text=’Quit’,
command = simple_quit_com)
quit_button.pack()
maestro.tk_toplevel_add(simple_top)
There are a number of other facilities that are provided by these modules - we suggest you look
at the reference documentation for more details.
Chapter 9
Normally, the Python functions you write for use in Maestro will be called explicitly with the
pythonrun command. However, there are some situations in which you may want to supply a
Python function that will be called when particular events occur during the normal operation of
Maestro. These functions, sometimes known as callbacks, are an important method for
extending and modifying the default behavior of Maestro. You have already seen one example
of a callback function at work in the previous chapter, where we used
maestro.picking_atom_start() to register a Python function that was called by Maestro
when an atom was selected in the Workspace. There are two additional callback functions that
can be used to extend the capabilities of Maestro.
maestro.periodic_callback_add(your_callback_function_name)
to register a Python function that will be called approximately 20 times a second. If you would
like to perform the action at a lower frequency, maintain a counter in your function and only
perform the action once every N times your callback function is called (i.e. every 20 to get
about once a second).
Example 1. spin.py
Here is a simple example to show how this works:
#spin.py
# Register a periodic callback to spin the molecule
from schrodinger import maestro
def start_spin():
maestro.periodic_callback_add( "spin.spin_cb");
When you issue the pythonrun spin.start_spin command, the contents of the Work-
space will be rotated around the Y-axis. This will continue until you explicitly issue the
pythonrun spin.stop_spin command.
When you register a callback you should use the fully qualified module.function form. You
can register as many periodic callback functions as you like during a Maestro session. When
finished performing the periodic action, remember to unregister your callback function using:
maestro.periodic_callback_remove(your_callback_function_name)
Note: A registered callback function should not attempt to remove itself.
maestro.hover_callback_add(your_callback_function_name)
that will be called by Maestro whenever the pointer is paused (“hovers”) over an atom in the
Workspace. In this example we demonstrate the mouse hover callback by utilizing the Maestro
feature that allows atom-specific information to be displayed in the Workspace status bar.
Example 2. hover.py
The following example replaces the default string in the status bar with the atom number and
partial charge of the atom the pointer is paused over:
#hover.py
from schrodinger import maestro
from schrodinger import structure
_last_atom = -1;
def set_hover():
maestro.hover_callback_add( "hover.hover_cb" )
def clear_hover():
maestro.hover_callback_remove( "hover.hover_cb" )
def hover_cb( at ):
global _last_atom
if( at == _last_atom ):
return
if at > 0 :
st = maestro.workspace_get()
pcharge = st.atom[at].partial_charge
maestro.feedback_string_set("Atom: %d Charge = %5.3f" % (at, pcharge))
_last_atom = at
return;
Note: Register the callback function using the full module.function form. When you no
longer need the mouse hover callback, unregister your function using:
maestro.hover_callback_remove(your_callback_function_name)
The callback function in this case is called with a string parameter thatindicates exactly what
has been modified in the Workspace. It will be one of:
• everything
• color
• geometry
• visibility
• representation
• properties
• coordinates
• connectivity
• unknown
In this way a script that does not care about, say, changes in the representation of the structure
in the Workspace can choose to take no action in these cases.
Here is a very simple example to illustrate how these functions can be used together in order to
display a sphere at the centroid of each molecule present in the Workspace. In this instance the
centroid calculation is only performed when we are notified that the Workspace has changed
and not each time we draw. This allows the draw performance to be kept as fast as possible:
from schrodinger.maestro import maestro
from schrodinger.graphics3d import sphere
sphere_group=0
def run():
# Register the callbacks and setup the drawing with the initial
# contents of the workspace
update("everything")
maestro.workspace_changed_function_add("spherecent.update")
maestro.workspace_draw_function_add("spherecent.draw_cb")
maestro.redraw()
def stop():
# Turn off the drawing:
maestro.workspace_changed_function_remove("spherecent.update")
maestro.workspace_draw_function_remove("spherecent.draw_cb")
maestro.redraw()
def draw_cb():
global sphere_group
if sphere_group != 0:
sphere_group.draw()
if changed in ["everything","geometry","connectivity","unknown"]:
sphere_group = sphere.SphereGroup()
st = maestro.workspace_get()
for mol in st.molecule:
xcent = 0.0
ycent = 0.0
zcent = 0.0
sphere_group.add(sp)
Chapter 10
Even if you are an experienced script writer, you are going to occasionally make mistakes.
Now the task of debugging commences. This chapter discusses strategies for determining what
has gone wrong with your script.
Example 1. spin_debug.py
It is easy to use pdb with your scripts. Simply create a special version of your main function
that passes control to pdb, then call the modified function from Maestro. To illustrate, we are
going to extend one of our previous examples:
#spin_debug.py
import pdb
def spin(axis="y",step=10,slp=0.1):
total_rotate=0
4. Please see the notice regarding third party programs and third party Web sites on the copyright page at the front
of this manual.
return
Now, calling spin_debug from Maestro using pythonrun will run the script in the Python
debugger, giving you access to all its debugging tools. From the (Pdb) prompt at the main
window enter "b spin.spin" to set a breakpoint at the spin.spin() method. Next enter
"c" to continue execution up to the breakpoint.
(Pdb) b spin.spin
Breakpoint 1 at /home/user/.schrodinger/maestroversion/scripts/
spin.py:23
(Pdb) c
> /home/user/.schrodinger/maestroversion/scripts/spin.py(23)spin()
-> total_rotate=0
Once you are at the breakpoint, use "n" to step line by line through the function and "p" to print
current values of the variables. For a full list of available commands see the Python pdb
website5.
5. Please see the notice regarding third party programs and third party Web sites on the copyright page at the front
of this manual.
Chapter 11
To this point, all our Python scripts have been run from the Maestro command line using
pythonrun. While this works well during development and for occasional use, it can be quite
cumbersome for frequently used, mature scripts. The Maestro Scripts menu offers a readily
accessible home for all your frequently used scripts.
You can add scripts to this menu directly in Maestro (see Chapter 13 of the Maestro User
Manual for details). Or you can add scripts to the Scripts menu manually by editing the
scripts.mnu file as described below.
The format for the scripts.mnu file is simple. There is one entry for each menu item. Each
entry consists of two lines. The first describes the menu item itself. The second defines the
command that will be run when that menu item is selected. In the next example we make our
spin script available from the Scripts menu by creating a $HOME/.schrodinger/
maestroversion/scripts.mnu file containing the entry:
Spin
pythonrun spin.spin Y 30 0.1
The next time you start Maestro the Spin item will be available on the Scripts menu. Note how
we have supplied arguments to the spin.spin command. This mechanism is not limited to
running Python scripts. You can issue any Maestro command from the Scripts menu.
Depending on your personal preferences, this can be an effective alternative to pythonrun.
supplying different arguments. To define a cascading submenu, place a colon (:) in the first
line of the entry:
Spin:X
pythonrun spin.spin X 10
Spin:Y
pythonrun spin.spin Y 10
Spin:Z
pythonrun spin.spin Z 10
Now we have a single Spin item in the top level Scripts menu. Spin contains a cascading
submenu with items X, Y, and Z. Each item causes the contents of the Workspace to rotate
about a different axis.
Note: You can only specify one level of cascading submenu items. You can not create an
entry like the following:
Category:subcategory:item
Chapter 12
• The maestro module can only be used for scripts that are running inside Maestro. While
the other modules like structureutil and mm can be used for scripts run with Python
outside Maestro, the maestro module requires code that exists inside Maestro. (The next
section describes a method for creating modules that can be used in a variety of situa-
tions.)
• Atoms and bonds are indexed from 1 (not 0) in functions that manipulate the structure.
• Emacs and Idle both provide many useful functions for creating and validating Python
scripts. In Emacs, typing CTRL-C CTRL-C checks the syntax of the current file.
• A script that is executing within Maestro can be interrupted by typing CTRL+C in the ter-
minal window in which Maestro was started.
• Modules are a great way to organize your code. When you create a set of functions you
think are useful in multiple scripts, create a module to hold the scripts and place it in the
module search path (for example .schrodinger/maestroversion/scripts in your
home directory). This makes the functions available for use in all your scripts.
• Even though the maestro module cannot be used outside Maestro, it is possible to write
a module that includes maestro, and can be included as a module in another script or run
as a stand-alone script. Here is an example:
# pdbname.py
def assign_pdb_names( ct, res_names = True, atom_names = True ):
# Body not shown:
if __name__ == '__main__':
#Running as stand-alone script:
structureutil.for_all_structures( sys.argv[1], assign_pdb_names,
Chapter 13
The main difference between running a script inside or outside Maestro is how you gain access
to structures. When running a script inside Maestro, you can get structures from the Workspace
or the project. However, in your stand-alone scripts you need to read the structures directly
from the files. Fortunately this is easy. Short filter scripts can be created that read structures
from one file and write modified versions to another file.
.ent PDB
.pdb
.sd SD
.sdf
.mol
.mol2 Tripos mol2
.mae.gz Gzip compressed Maestro
.maegz
.mae Maestro
Once you have a structure object, it knows how to write or append itself to a specified file.
Again, the format can be PDB, SD, MOL2, or Maestro. If the format is not specified explicitly,
the file suffix is used.
os.remove(sys.argv[2])
Note: In this case, the format is specified explicitly as format=maestro so this only oper-
ates on Maestro files, regardless of the file suffix. Any number of operations could be
performed on the structure before it is written out again.
There is also a mechanism for examining the properties of the structures in the file. For
example, you could examine those quantities that are calculated and added to a file by
programs like Glide, QikProp, or MacroModel. The "property" field of the structure class can
be used as a Python dictionary to get and set properties associated with that structure.
Note: When operating from files, you must use the property name exactly as it appears in the
file. Also if you create a property name, it must conform to the following prefix
convention, based on the type of data you wish to add:
The following example creates a new integer property. When the output file is imported into
Maestro, this new property appears in the Project Table as "My Sum".
s.property['i_user_My_Sum']= sum
Note: No explicit format is given, so the suffix of the input and output file determine the
format. The receptor (first structure) is omitted and the remaining ligand poses are only
written to the input file if the GlideScore is < -4.5.
#glidekeep.py
from schrodinger import structure
import sys
import os
os.remove(sys.argv[2])
cnt = 1
for s in structure.StructureReader( sys.argv[1] ):
if cnt != 1:
if s.property['r_i_glide_gscore'] < -4.5 :
s.append(sys.argv[2])
cnt += 1
Appendix A
Module Description
application Application-specific functions
graphics3d Support for drawing in the Workspace
infra Low-level package with tools not intended for general script usage. We
reserve the right to change these modules as we feel necessary.
job Functions for launching and managing jobs
maestro Functions used for interaction with Maestro
project Functions used with Maestro projects, either inside or outside Maestro
protein Package for capping, analyzing, and adding hydrogens to proteins.
structure Functions to read, write, and manipulate structures
structureutil Functions that operate on structure objects: finding rings and matching
SMARTS expressions, etc.
structutils Package for modules of structure utilities
ui Graphical user interface tools. Fixes some bugs in Tkinter and PMW that
cause bad interactions with Maestro. Use these to ensure consistent look and
feel with Maestro. Lastly, provides reusable specialized components.
utils Utility functions - logging script operations and standardized processing of
command line options.
Getting Help
For help installing and setting up licenses for Schrödinger software and installing documenta-
tion, see the Installation Guide. For information on running jobs, see the Job Control Guide.
Maestro has automatic, context-sensitive help (Auto-Help and Balloon Help, or tooltips), and
an online help system. To get help, follow the steps below.
• Check the Auto-Help text box, which is located at the foot of the main window. If help is
available for the task you are performing, it is automatically displayed there. Auto-Help
contains a single line of information. For more detailed information, use the online help.
• If you want information about a GUI element, such as a button or option, there may be
Balloon Help for the item. Pause the cursor over the element. If the Balloon Help does
not appear, check that Show Balloon Help is selected in the Help menu of the main win-
dow. If there is Balloon Help for the element, it appears within a few seconds.
• For information about a panel or the tab that is displayed in a panel, click the Help button
in the panel. The help topic is displayed in your browser.
• For other information in the online help, open the default help topic by choosing Help
from the Help menu on the main menu bar or by pressing CTRL+H. This topic is dis-
played in your browser. You can navigate to topics in the navigation bar.
If you do not find the information you need in the Maestro help system, check the following
sources:
If you have questions that are not answered from any of the above sources, contact Schrödinger
using the information below.
E-mail: help@schrodinger.com
USPS: Schrödinger, 101 SW Main Street, Suite 1300, Portland, OR 97204
Phone: (503) 299-1150
Fax: (503) 299-4532
WWW: http://www.schrodinger.com
FTP: ftp://ftp.schrodinger.com
Generally, e-mail correspondence is best because you can send machine output, if necessary.
When sending e-mail messages, please include the following information:
SCHRÖDINGER ®