Contents

Calculating the sum formula

Various properties can be derived from the sum formula. The simplest way of calculating the sum formula is to type the sequence (e.g. AGCT) into an online calculator such as this one.

If this kind of tool is not available or does not include the modifications (non-standard nucleotides) present in the molecule of interest, it is also possible to add the unit formulae of each base, then subtract PO3H (because there is no phosphate linkage at the 3'-end), and add H2O (to account for the ends) to give the formula of the oligonucleotide:

Sum formula=ini×iPO3H+H2O=nA×A+nG×G+...PO3H+H2O,\footnotesize\begin{equation*} \textrm{Sum formula} = \sum\limits_i n_i \times i - \textrm{PO}_3\textrm H + \textrm H_2\textrm O = n_\textrm{A} \times \textrm A + n_\textrm G \times G + ... - \textrm{PO}_3\textrm H + \textrm H_2\textrm O, \end{equation*}

where i\footnotesize\sum\limits_i indicates a sum over the unit sum formulae of nucleotides i. These are given in Table 1.

BaseFormula (DNA)Formula (RNA)
AC10H12O5N5PC10H12O6N5P
GC10H12O6N5PC10H12O7N5P
CC9H12O6N3PC9H12O7N3P
TC10H13O7N2P(C10H13O8N2P)
U(C9H11O7N2P)C9H11O8N2P

The structures of the DNA molecule dAGCT and the RNA molecule AGCU are shown in Fig. 1. Under physiological conditions, the phosphate oxygen atoms are deprotonated; as shown for AGCU.

Chemical structure of the DNA oligonucleotide dAGCT (left) and the RNA oligonucleotide AGCU (right)
Figure 1
Chemical structure of the DNA oligonucleotide dAGCT (left) and the RNA oligonucleotide AGCU (right)

For dAGCT, Eqn. (1) becomes

Sum formula=nA×A+nG×G+...PO3H+H2O                                                  \footnotesize{ \textrm{Sum formula} = n_\textrm{A} \times \textrm A + n_\textrm G \times \textrm G + ... - \textrm{PO}_3\textrm H + \textrm H_2\textrm O~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ }
=1×A+1×G+1×C+1×TPO3H+H2O                                                          \footnotesize{ = 1 \times \textrm A + 1 \times \textrm G + 1 \times \textrm C + 1 \times \textrm T - \textrm{PO}_3\textrm H + \textrm H_2\textrm O~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ }
=C10H12O5N5P+C10H12O6N5P+C9H12O6N3P+C10H13O7N2PPO3H+H2O\footnotesize{ = \textrm C_{10}\textrm H_{12}\textrm O_5\textrm N_5\textrm P + \textrm C_{10}\textrm H_{12}\textrm O_6\textrm N_5\textrm P + \textrm C_9\textrm H_{12}\textrm O_6\textrm N_3\textrm P + \textrm C_{10}\textrm H_{13}\textrm O_7\textrm N_2\textrm P - \textrm{PO}_3\textrm H + \textrm H_2\textrm O }
=C39H50O22N15P3.                                                                                                                 \footnotesize{ = \textrm C_{39}\textrm H_{50}\textrm O_{22}\textrm N_{15}\textrm P_3.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ }

The sum formula of AGCU is C38H48O26N15P3. The deprotonated formulae are C39H47O22N15P33− (dAGCT) and C38H45O26N15P33− (AGCU).

Molecular weight

The molecular weight (MW), or, more correctly, molar weight (see the IUPAC Gold Book for commonly accepted definitions), is defined as the average weight of one mole (6.0221 ×\footnotesize\times 1023 particles) of a given atom or molecule relative to 12C, the weight of which is defined as exactly 12 grams per mole (g/mol).

Calculating the molecular weight

The molecular weight can be calculated from the sum formula by using standard values of the molar weight of each atom. For example, each chlorine atom contributes approximately 35.5 to the molecular weight. These values are typically not whole numbers, partly because of isotopes (chlorine occurs naturally as roughly 75% 35Cl and 25% 37Cl).

There are online calculators that convert sum formulae to molecular weights, such as this one from Lenntech.

Molecular weight can be used to calculate concentration and yield, as well as convert between masses and amounts:

c=nV,\footnotesize\begin{equation*} c = \frac{n}{V}, \end{equation*}

where c is the concentration, n is the amount and V is the volume;

m=n×MW,\footnotesize\begin{equation*} m = n \times \textrm{MW}, \end{equation*}

where m is the mass. A reaction yield is calculated via

Yield=nproductnstarting material×100%,\footnotesize\begin{equation*} \textrm{Yield} = \frac{n_\textrm{product}}{n_\textrm{starting material}} \times 100\%, \end{equation*}

and n can be obtained from the experimentally measurable mass via Eqn. (3).

Molecular mass

The molecular mass is defined as the weight of a molecule relative to 12C, which is defined as exactly 12 Dalton (Da) or atomic mass units (amu). (The analogous property of an atom is called mass number.)

Monoisotopic mass

When calculating the molecular mass, individual isotopes are considered separately. The monoisotopic mass is calculated using the mass of the most common isotope of each element. This is often also the lightest isotope.

Mass spectrometry

Mass spectrometry is the only experimental technique that can distinguish between isotopes, and therefore the only technique with which the mass is more important than the molecular weight.

Comparing molecular weight and molecular mass

For small molecules, the molecular weight is often the same as the monoisotopic mass (when rounded to the nearest integer). As the molecule gets larger, the two properties diverge.