ARIAweb Documentation

Overview


ARIA (Ambiguous Restraints for Iterative Assignment) is a software for automated NOE assignment and NMR structure calculation. It speeds up and automatizes the assignment process through the use of an iterative structure calculation scheme. Additionally, a refinement in explicit water improves the quality of the calculated structures, validation tests help spectroscopists to judge the quality of the final structures, and the support of the CCPN data model simplifies the exchange of information with other NMR software packages.

More information about ARIA can be found on the ARIA website.

The ARIAweb.pasteur.fr server provides access to the main ARIA software functionalities (NMR data conversion and structure calculation). In addition, structure calculation results can be easily visualised, with a dedicated molecular viewer and graphical displays of various validation statistics.

Access

Access to the ARIAweb server is free. Users are encouraged to register (Signup now button on the login page) to easily manage your ARIA projects. Use of the ARIAweb server without registration is possible as an anonymous user. Still, anonymous users have the possibility to become fully registered using Create a full account in the User menu .

Citing ARIAweb

If you use this webserver, please cite:

  • Allain F, Mareuil F, Ménager H, Nilges M, Bardiaux B. ARIAweb: a server for automated NMR structure calculation. Nucleic Acids Research. 2020 May 8;. https://doi.org/10.1093/nar/gkaa362

  • Brünger AT, Adams PD, Clore GM, DeLano WL, Gros P, Grosse-Kunstleve RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, Simonson T, Warren GL. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr D Biol Crystallogr. 1998 Sep 1;54(Pt 5):905-21. https://doi.org/10.1107/S0907444998003254

Projects


Calculations performed on the ARIAweb server are organized as Projects. Users can view a list of their projects or create a new project from the their homepage. To create a new projects, use the Create a new Project button on the homepage or use the button on the projects list page. On the project creation page, enter a name for your project. If you plan to use NMR data from CCPN project archive (version2 only), upload it and click the Save button. For any project, ARIAweb offers three services:

  1. Data Conversion: Convert NMR data from various formats into the internal ARIA XML format.
  2. Structure Calculation: Automated structure calculation from NMR data (using already converted NMR data or data from a CCPN project archive).
  3. Results: Access ARIAweb results for your project (data conversion or structure calculation).

The minimal input data to perform structure calculation using NMR data with ARIAweb are:

  • Sequence of the molecular chain(s) composing the molecule for which you want to calculate the structure
  • One (or more) list(s) of NOE cross-peaks
  • One (or more) list(s) of assigned chemical shifts

If those data are not already converted in the ARIA XML format, you have to first use the Data Conversion functionality (see below). Once the data are in the correct XML format, user can setup a new structure calculation (either by uploading XML file directly from their computer or by creating a new structure calculation from the data converted on the ARIAweb server (see below).

Alternatively, if your molecular and NMR data are already present in a CCPN project archive (version2 only) that you uploaded at the project creation, you can directly create a new structure calculation. CCPN project archive can be prepared from within CcpNmr Analysis (see the CcpNmr Analysis documentation for details).

Data Conversion


To convert molecular sequences and NMR data into the ARIA XML format (and later perform structure calculation), users must first create a new Data Conversion using the New button on the project page or use the button on the list of data conversion).

Users will be asked to enter a name for their conversion project and a name for the molecule. Next, for every chain in the molecule, use the Add a chain or button. The sequence file must be uploaded in the Input field. Accepted formats are SEQ (Xeasy 3 letters format) or PDB (Protein Data Bank). See the Example section for file format.

Similarly, users can create as many NOE Spectrum as needed using the Add a Spectrum or button. Two mandatory input files are required:

Click on a format name to view an excerpt of a correctly formatted file. Input files are automatically checked for format correctness. See the Example section for example files.

When all input data have been entered, click the Finish button. You will be redirected to the list of conversions for your project. To start the data conversion process, use the Start conversion button. You will be then redirected to the Results page for your project where you can follow the status of your conversion job (see the Job status section for explanations on the different status icons). Finished jobs will be listed here. Registered users will receive a-email upon job submission with a link to the results page.

When a conversion job is finished, users can:

  • Download an archive containing the converted (in ARIA XML format) data using
  • Delete converted data using (This action cannot be undone!)
  • Start a new structure calculation from the converted data using

Need more details on ARIA conversion ? Go the ARIA documentation.

Structure calculation


This process is identical to the use GUI of the ARIA standalone program. Users must first specified some parameters such as input data, NOE assignment criteria, number of structures to generate or parameters for the water refinement. When initiating a structure calculation from a previous conversion job on ARIAweb, mandatory input data are already pre-filled and shown next the Currently: showing the name of the file currently selected. When using data from a CCPN project, a list of possible data entries are shown for each appropriate input data type. See the Example section for file format.

The Structure Calculation form on ARIAweb is designed to guide the user through the main categories of parameters that need to be checked (if default values are used) or specified (if a user wants to change with customized values). The 4 main categories of parameters are:

  1. SETUP: name of the structure calculation job and other generic info
  2. DATA: all input data must be selected/uploaded here.
  3. PROTOCOL: parameters related to the ARIA iterative protocols
  4. STRUCTURE CALCULATION: parameters related to generation of structures with CNS

In the DATA category, accepted data format for Molecule, Cross-peaks and Chemical shifts are: XML or CCPN. See the Example section for file format. Additional input data consists in restraints files formatted in CNS TBL format (or CCPN for Hydrogen bonds, distances and dihedral angle restraints). See the Example section for file format and the CNS documentation for more details.

In each category, classes of parameters are listed on the left menu. To show the corresponding form, activate it using the slider    . The slider color code is as follows: inactive, current, validated, not validated.

By default, only basic options are displayed in each form. Enable the Expert Mode to see more options (User menu ).

To navigate between categories, use the Back or Next buttons below each form (this will validate the form and save what you have just entered). When you attain the end of the form and all categories icons are green, click the Save button. You will be redirected to the list of structure calculations for your project. To start the structure calculation process, use the Start ARIA job button. You will be then redirected to the Results page for your project where you can follow the status of your structure calculation job (see the Job status section for explanations on the different status icons). Finished jobs will be listed here. Registered users will receive a-email upon job submission with a link to the results page.

When a structure job is finished, users can:

  • Download an archive containing the full ARIA run data using
  • Delete the structure calculation results (This action cannot be undone!)
  • Go to the visualization of the structure calculation results using

Need more details on ARIA structure calculation ? Read the ARIA documentation.

Results


As mentioned above, results for data conversion and structure calculation jobs are listed in the Results page of a project. The next figure is a screenshot of a typical Results page

Results screenshot

The status of a job is indicated as follows:

Icon Status
Building (data are uploading to the cluster)
Pending (waiting to be submitted on the cluster)
Running (job is running on the cluster)
Success (job terminated without error)
Error (job terminated with an error, check the log )

Results archive can be downloaded using .

For structure calculation jobs, clicking the button will show the Visualisation for the job.

Visualisation


The visualisation page allows to view an analysed the results of a structure calculation job. Final structure ensemble generated by ARIAweb are displayed interactively, with various representations. Final restraints statistics, structure quality checks and bundle RMSD are shown to help the user interprets the reliability of the results. On top of that, more graphs, restraints validation and PDB files can be downloaded directly.

Click here to view a live demo of the Visualisation page for structure calculation results.

The next figure is a screenshot of a typical Visualisation page, where main components are marked in red.

Dashboard screenshot

Results for all ARIA iterations can be shown by selecting an iteration in the Iterations tab. The main component of the Visualisation page is the NGL viewer. By default, several representations of the structure ensemble generated by ARIA are shown:

  • Sec. structure is a cartoon representation with secondary structure elements (helices and strands).
  • RMS NOE viol shows side-chains colored by the average RMS (Root Mean Square) of NOE violations (per residue) from low RMS (good) to high RMS (bad) (not available for Refine iteration).
  • RMSF bundle is a sausage representation with a radius proportional to the Root Mean Square Fluctuation (per residue) in the ensemble (hidden by default). More representations can be added with the add_circle_outline button.

The Representation tab (below the NGL viewer) allows to change the visibility, styling and coloring of a selected representation. By default, all structures in the bundle are shown; use the filter_frames button to select individual models for display).

The add_circle button allows to upload another PDB file to superimpose on the ARIA structure ensemble (use the layers button to superimpose an a selected structure).

Below, Restraints statistics recapitulate the number of restraints and the trend since the previous iteration for:

  1. Restraints used for structure calculation (the more, the better)
  2. Unsatisfied ("violated") restraints (less is better)
  3. Merged (identical) restraints

The Contribution histogram gives count of restraints (from each input spectra) that have 1 (i.e. unambiguous) or more (i.e. ambiguous) assignment possibilities (or "contributions"). Ultimately, ARIA tries to reduce the ambiguity in NOE assignments, producing more unambiguous restraints. Truly ambiguous assignments can remain due to spectral overlap or chemical shifts degeneracy.

The Ensemble RMSD graph shows, for each iteration, the RMSD of the structure ensemble generated by ARIA ("bundle"), computed as the mean RMSD when superimposing of the ensemble average (after iterative superimposition). Low RMSD (< 1-2 ) is good indicator of convergence of the NOE assignment and structure calculation process by ARIA.

The Quality checks panel summarizes the results of 3 main structural quality validators: WHAT-IF, Procheck and Molprobity. Quality scores are shown on a slider from bad to good values. Bad quality score values indicate that the input data may contain errors/inconsistencies and that ARIA was not able to produce a high quality model. We provide here some indicators on how to judge the quality checks:

  1. Procheck Ramachandran percentage: for typical NMR structures deposited in the PDB, 80% of the dihedral angles lie within the preferred region of the Ramachandran plot. For high-resolution NMR structures, a higher percentage is expected (90%).

  2. WHAT-IF Z-scores: WHAT-IF results are presented in the form of overall Z-scores. In general, structures with Z-scores between -2 and +2 are considered to be within a normal range and are thus good structures, while structures with Z-scores lower than -2 should be inspected further. Useful indicators of good quality are Backbone conformation and Packing quality. The bump-score also reports the number of van der Waals violations per 100 residues.

  3. WHAT-IF profiles: recently, some studies have stressed that global structural indicators are not sufficient to detect errors in structures and suggested examining parameters on a per-residue basis. Such profiles for the WHAT-IF scores are produced by ARIA in the form of a PDF file. Thus, poor quality regions can be precisely identified.

  4. Molprobity clashscore. this reports the number of overlaps >0.4 per thousand atoms. For typical NMR structures deposited in the PDB, this score is generally high (>10). From our experience, the application of the log-harmonic potential along with automated weight estimation significantly improves this situation (see the More results panels).

The More results panel provides links to:

  • PDB file of the structure ensemble
  • graph of NOE restraints violations RMS (PDF)
  • lists of NOE restraints assignments and violations (text)
  • graph of WHATCHECK scores per residue (PDF, it8 and refine)
  • a full ARIA run archive (ZIP)
  • standard summary table of structural and restraints statistics (it8 and refine)

We invite users to read the following book chapter to learn more about ARIA and on how to judge the quality and reliability of structures determined with ARIA from NMR data.

Bardiaux B, Malliavin T, Nilges M. ARIA for solution and solid-state NMR. Methods Mol Biol. 2012;831:453-83. https://doi.org/10.1007/978-1-61779-480-3_23

Demo


In the demo, we will calculate the structure of the HRDC domain using 2 NOE crosspeaks lists.

To perform the demo:

  1. Click on in your homepage

  2. Give a name to your project (e.g. demo) and click

  3. In the Data conversion panel, click on to automatically load sample data (sequence and cross-peaks/chemical shifts lists)

  4. Click on to save your conversion project.

  5. In the Data Conversion projects list, click on to submit your Data Conversion job.

  6. Wait until the end of the job (job status )

  7. Click on to start a Structure calculation with the converted data.

  8. Give a name to your Structure calculation project (e.g. demo) and click on

  9. Click on until the last form, then click to save your Structure calculation project.

  10. In the Structure Calculation projects list, click on to submit your Structure Calculation.

  11. Wait until the end of the job (job status )

  12. When the job is finished, click on to start the job analysis and visualize the results.

Click here to view a live demo of the Visualisation page for Structure Calculation results on the HRDC domain.

Help & more


Excerpts of accepted file formats for data conversion

Data type Format
XEasy View
NMRView View
ANSIG View
Chem/TBL View
Sparky View

Example data files

Tool Data type Format File
Project CCPN project tgz ccpn_example.tgz
Conversion Sequence SEQ seq_example.seq
Conversion Cross-peaks list XEasy noesy_xeasy.peaks
Conversion Chemical shifts list XEasy shifts_xeasy.prot
Structure calculation Molecule ARIA XML sequence.xml
Structure calculation Cross-peaks list ARIA XML noesy_peaks.xml
Structure calculation Chemical shifts list ARIA XML noesy_shifts.xml
Structure calculation Unambiguous distances restraints TBL unambig_example.tbl
Structure calculation Ambiguous distances restraints TBL ambig_example.tbl
Structure calculation H-Bond restraints TBL hbonds_example.tbl
Structure calculation Dihedral angle restraints TBL dihedrals_example.tbl
Structure calculation Scalar couplings restraints TBL karplus_example.tbl
Structure calculation RDC restraints TBL rdcs_example.tbl
Structure calculation Disulfide bridge restraints TBL ssbonds_example.tbl
Structure calculation DNA planarity restraints TBL planarity_example.tbl
Structure calculation Initial structure ensemble PDB template_iupac.pdb
Structure calculation Initial Structure For Minimization PDB werner_iupac.pdb

ARIA documentation

Detailed documentation about the methods and usage of the ARIA software can be found on the ARIA website.

Support

We invite user to send enquiries specific to the use the ARIAweb server using the Contact Us link . Questions about the ARIA software must be sent to the ARIA discussion group.

Limitations

  • The number of simultaneous running jobs per user is 2.
  • No more than 50 structures per iteration can be calculated.
  • The number of MD steps in the simulated-annealing is limited to 30000 for each stage (high temperature, cooling 1 and cooling 2).
  • Currently, ARIAweb only supports standard amino acid residues or DNA/RNA bases definitions and incorporation of Zinc ions in tetrahedral coordination. We recommend to use the standalone version of ARIA when modified residues or other organic ligands have to be included. See ARIA documentation for details.

Browser compatibility

OS Version Chrome Firefox MS Edge Safari
MacOS 10.14 78.0 70.0 n/a 12.1
Linux CentOS 7 78.0 70.0 n/a n/a
Windows 10 78.0 70.0 44.18362 n/a

Computations

ARIAweb works together with the Institut Pasteur Galaxy instance to:

  1. Manage ARIA tools and workflows.
  2. Run calculations on the underlying computing cluster.
  3. Keep track of run status and histories.

Typical execution times for Data conversion and Structure calculation jobs are around 5 min and 90 min, respectively.

Note: Execution time of jobs submitted via ARIAweb depends on the computing cluster load and the number of Pending/Running jobs as shown on the users's homepage.

Authors

ARIAweb is developed and maintained by F. Allain, F. Mareuil, T. Huynh and B. Bardiaux from the Structural Bioinformatics Unit and Bioinformatics and Biostatistics HUB at Institut Pasteur, Paris.

The development of ARIAweb is supported by the French Institute of Bioinformatics (IFB).

Elixir LogoARIA is part of the ELIXIR infrastructure. ARIA is an Elixir service. Read more.