MicroED Smallmoleculer
The fundamental concept of chirality

Chiral drugs refer to a pair of enantiomers that are mirror images of each other when a chiral center is introduced into the molecular structure. Each enantiomeric pair, when chemically pure, exhibits distinct physical and chemical properties (not just limited to optical activity) and can be named as R or S, D or L, P or M, or left-handed or right-handed, depending on different nomenclature conventions.

Currently, 50% of clinically used drugs are chiral drugs, and among the top ten globally sold drugs, 9 are chiral drugs. The prevalence of chiral drugs is due to the fact that the basic building blocks of drug targets, such as amino acids and nucleotides, are chiral structures.

Chirality is a crucial factor in molecular configuration. Organic compounds are categorized into various types of isomers in stereochemistry, including configurational isomers, geometric isomers, and optical isomers (enantiomers). The key to the formation of optical isomers lies in the molecule's asymmetry (chirality). Chiral molecules are organic compounds that cannot overlap with their mirror-image molecules.

Chiral molecules fall into three main categories:

Chiral molecules containing chiral atoms;

Axially chiral compounds (rotational isomers containing a chiral axis);

Optical isomers containing a chiral plane.

Common Naming Conventions for Chiral Molecules

R/S Notation:

The R/S system is a crucial nomenclature system for enantiomers. This method is based on the priority rules of atomic numbers. Each chiral center is assigned an R or S configuration based on the priority of substituents assigned by the system. When the orientation of the center places the lowest-priority substituent farthest from the observer, there are two possibilities: if the priorities of the remaining three substituents decrease in a clockwise direction, it is labeled as R (Rectus, Latin for right); if they decrease counterclockwise, it is labeled as S (Sinister, Latin for left). (R) or (S) is written in italics and enclosed in parentheses. If there are multiple chiral carbons, for example, (1R, 4S), the numbers specify the position of each configuration before the respective carbon.

Rules for determining R/S configuration:

a. Assign priorities to the groups connected to the chiral carbon in order.

b. Place the smallest group farthest from the observer.

c. Rotate the other three groups in a direction from largest to smallest. If the rotation is clockwise, the chiral carbon is in the R configuration; if counterclockwise, it is in the S configuration.

When comparing multi-atom groups, start by comparing the first atom. The atom with the higher atomic number comes first. If the first atoms are the same, compare the other atoms connected to the first atom. Start with the largest, then the middle, and finally the smallest until determining the order.

 

Figure 1: Absolute configuration of (S)-lactic acid

 

As shown in Figure 1, according to the rules of the R/S labeling method, the chiral center is connected to four groups: -H, -CH3, -OH, and -COOH. The first atoms connected to these groups are H, O, C, and C, respectively. Among them, the oxygen atom has the highest atomic number and is ranked first, while the hydrogen atom has the lowest atomic number and is ranked fourth. The remaining two are both carbon atoms, and their sizes cannot be compared directly. Therefore, we continue to compare down the chain. For the carbon in -CH3, the three atoms connected below, in descending order, are H, H, H. For the carbon in -COOH, the two atoms connected below, in descending order, are O, H. Starting with the largest atom, comparing -CH3's H with -COOH's O, O has a higher atomic number than H. Thus, -COOH has a higher priority than -CH3, and their priorities are 2 and 3, respectively.

Therefore, placing the smallest group farthest from the eyes, we can observe that the other three groups rotate in a counterclockwise direction, indicating an S configuration for the chiral carbon.

D/L Labeling Method:

Optical isomers can be named based on their spatial configuration of atoms. The D/L system (named after the Latin words dexter and laevus, meaning right and left) is implemented by associating the molecule with glyceraldehyde. Glyceraldehyde itself is chiral, and its two isomers are labeled as D and L. In this system, the naming of compounds is similar to glyceraldehyde, generating explicit names. The D/L label indicates the stereochemistry of a compound relative to the right- or left-handed enantiomer of glyceraldehyde. The right-handed isomer of glyceraldehyde is actually the D-isomer. Among the nineteen common L-amino acids in proteins, nine are right-handed (with a wavelength of 589 nm), and D-fructose, also known as levulose, is called a left-handed sugar because it is left-handed. An empirical rule for determining the D/L isomeric form of amino acids is the "CORN" rule.

Judgment rules for the D/L labeling method:

-COOH, -R, -NH2, and -H (where R is the side chain) are arranged around the chiral center carbon atom.

Hydrogen atoms are positioned away from the observer. If the groups CO → R → N are arranged counterclockwise around the central carbon atom, it is labeled as L-type. If arranged clockwise, it is labeled as D-type.

 

Figure 2: Absolute configuration of L-alanine

 

As shown in Figure 2, according to the D/L labeling method, the chiral center is connected to the -H, -COOH, -R, and -NH2 groups. Placing the smallest group (-H) farthest from the observer, we can see that the other three groups rotate around the carbon atom in the order of CO → R → N, and the rotation direction is counterclockwise, indicating an L configuration for the chiral carbon.

P/M Labeling Method:

For compounds containing a chiral axis, there is a lack of chiral centers corresponding to the use of the RS labeling method. We see that the chiral axis can be associated with a helix, and these molecules can be considered helical or helical propeller-like structures. The P/M labeling method is commonly used to label their configurations.

P/M Labeling Method Determination Rules:

Look from any end of the chiral axis and divide the compound into near and far groups along the chiral axis.

Determine the highest priority near group and the highest priority far group.

If moving from the highest priority near group to the corresponding far group requires a clockwise rotation, the helix is a right-handed helix and is described as P (or positive). Counterclockwise rotation means a left-handed helix and is specified as M (or negative).

 

Figure 3: P/M Labeling Method for Three Chiral Axis Compounds

 

As shown in Figure 3, according to the P/M labeling method, for compound A, the highest priority in the near group is the C atom, and in the far group, it is the Cl atom. By rotating in the C → Cl direction, the rotation is counterclockwise, indicating an M configuration. For compound B, the highest priority in the near group is the N atom, and in the far group, it is also the N atom. By rotating in the N → N direction, the rotation is clockwise, indicating a P configuration. For compound C, the highest priority in the near group is the C atom, and in the far group, it is also the C atom. By rotating in the C → C direction, the rotation is clockwise, indicating a P configuration.

 

Principles and Examples of Chiral Molecular Dynamics Refinement

Single crystal X-ray diffraction determines the absolute configuration of molecular structures through anomalous scattering. When the electrons on atoms are treated as free electrons, the scattering factor for X-rays is denoted as f0, and the phase angle difference between the scattering wave and the incident wave is a fixed value (π). However, different atoms exhibit varying abilities to bind electrons, leading to differences in the scattering capability of outer electrons compared to free electrons and resulting in a certain drift in the scattering phase angle. This phenomenon is wavelength-dependent and known as anomalous scattering.

On the other hand, MicroED (Micro Electron Diffraction) determines the absolute configuration of molecular structures through dynamic refinement. Compared to X-rays, electrons have a shorter wavelength and interact with matter many times stronger than X-rays, making them suitable for studying crystal materials at the nanometer level that X-ray diffraction cannot reach. Even in extremely small regions, strong electron diffraction can be obtained, making it particularly suitable for the analysis and identification of crystal phases in micro and nano regions. However, this advantage of electron diffraction also poses a challenge, as the diffraction patterns collected under TEM (Transmission Electron Microscopy) exhibit stronger dynamic effects, which can be a hindrance to structure resolution, as shown in Figure 4.

 

Figure 4 Significant Dynamic Effects Caused by Samples of Different Thicknesses

 

For chiral structures, there is a significant difference in the dynamic effects produced by the same compound in opposite chirality. When resolving the relative configuration of chiral compounds, we can use the differences in their dynamic effects to confirm the absolute configuration of chiral compounds through dynamic refinement.

 

Firstly, during data collection, we incorporate Precession Electron Diffraction (PED), which can average the impact of dynamic effects on diffraction patterns, as shown in Figure 5;

 

Figure 5 Schematic Diagram of the Working Principle of PED

1.Next, due to factors such as crystal thickness, mosaicity, bending, defects, etc., which are crucial for dynamic effects, during data processing, diffraction data suitable for dynamic refinement are selected based on the integration of diffraction point intensities in a double-hump diagram;

2.Finally, using the selected diffraction data, refine the compounds with different chiralities separately. The difference between the two makes it easy to determine the absolute configuration of unknown chiral compounds. Figure six illustrates 75 successful cases where the absolute configuration was determined through dynamic refinement.

 

CAS Number

Name

Molecular Formula

Non-hydrogen Atoms

68-41-7

D-Cycloserine

C3H6N2O2

7

339-72-0

L-Cycloserine

C3H6N2O2

7

73-32-5

L-Isoleucine

C6H13NO2

9

56-85-9

L-Glutamine

C5H10N2O3

10

464-49-3

D(+)-Camphor

C10H16O

11

464-45-9

(-)-Borneol

C10H18O

11

2216-51-5

(1R,2S,5R)-(-)-Menthol

C10H20O

11

616-91-1

N-Acetyl-L-Cysteine

C5H9NO3S

10

26016-98-8

Fosfomycin calcium

C3H5O4PCa

9

26016-99-9

Fosfomycin Disodium Salt

C3H5Na2O4P

10

66-84-2

D(+)-Glucosamine hydrochloride

C6H13NO5·HCl

13

13189-98-5

Fudosteine

C6H13NO3S

11

87-89-8

Inositol

C6H12O6

12

6284-40-8

N-Methyl-D-glucamine

C7H17NO5

13

61-76-7

(R)-(-)-Phenylephrine hydrochloride

C9H13NO2·HCl

13

73-22-3

L-Tryptophan

C11H12N2O2

16

81-13-0

D-Panthenol

C9H19NO4

14

51146-56-6

(S)-(+)-Ibuprofen

C13H18O2

15

72432-03-2

Miglitol

C8H17NO5

14

1119-34-2

L-Arginine hydrochloride

C6H14N4O2·HCl

13

555-30-6

3-(3,4-Dihydroxyphenyl)-2-methyl-L-alanine Sesquihydrate

C10H13NO4

15

7481-89-2

2′,3′-Dideoxycytidine

C9H13N3O3

15

104632-26-0

Pramipexole

C10H17N3S

14

69308-37-8

(R)-Baclofen

C10H12ClNO2

14

62571-86-2

Captopril

C9H15NO3S

14

527-07-1

D-Gluconic acid sodium salt

C6H11NaO7

14

3056-17-5

Stavudine

C10H12N2O4

16

6902-77-8

Genipin

C11H14O5

16

68373-14-8

Sulbactam

C8H11NO5S

15

69655-05-6

Didanosine

C10H12N4O3

17

96020-91-6

Eflornithine hydrochloride hydrate

C6H12F2N2O2·HCl·H2O

14

5080-50-2

O-Acetyl-L-carnitine hydrochloride

C9H17NO4·HCl

15

16595-80-5

Levamisol hydrochloride

C11H12N2S·HCl

15

867-81-2

Sodium D-Pantothenate

C9H16NNaO5

16

532-03-6

Methocarbamol

C11H15NO5

17

3424-98-4

Telbivudine

C10H14N2O5

17

102518-79-6

(-)-Huperzine A

C15H18N2O

18

147-94-4

Cytosine β-D-arabinofuranoside

C9H13N3O5

17

320-67-2

5-Azacytidine

C8H12N4O5

17

36791-04-5

Ribavirin

C8H12N4O5

17

121808-62-6

Pidotimod

C9H12N2O4S

16

33125-97-2

Etomidate

C14H16N2O2

18

609799-22-6

tasimelteon

C15H19NO2

18

50-91-9

5-Fluoro-2′-deoxyuridine

C9H11FN2O5

17

143491-57-0

Emtricitabine

C8H10FN3O3S

16

123441-03-2

Rivastigmine

C14H22N2O2

19

148553-50-8

Pregabalin

C8H17NO2

11

274901-16-5

Vildagliptin

C17H25N3O2

22

80433-71-2

Calcium Levofolinate

C20H21N7O7.Ca

35

797-63-7

D(-)-Norgestrel

C21H28O2

23

366789-02-8

Rivaroxaban

C19H18ClN3O5S

29

162359-55-9

Fingolimod

C19H33NO2

22

864070-44-0

Empagliflozin

C23H27ClO7

31

530-43-8

chloramphenicol palmitate

C27H42Cl2N2O6

37

486460-32-6

sitagliptin

C16H15F6N5O

28

697761-98-1

Elvitegravir

C23H23ClFNO5

31

137281-23-3

Pemetrexed

C20H21N5O6

31

641571-10-0

Nilotinib

C28H22F3N7O

39

668270-12-0

Linagliptin

C25H28N8O2

35

80621-81-4

Rifaximin

C43H51N3O11

57

66108-95-0

Iodoethanol

C19H26I3N3O9

34

114-07-8

Erythromycin

C37H67NO13

52

155213-67-5

Lopinavir

C37H48N6O5S2

50

1809249-37-3

Remdesivir

C27H35N6O8P

42

84625-61-6

Itraconazole

C35H38Cl2N8O4

49

117467-28-4

Cefuroxime Axetil

C25H28N6O7S3

41

50-55-5

Nifedipine

C33H40N2O9

44

220127-57-1

Glimepiride

C29H31N7O.CH4O3S

42

97682-44-5

Iletin

C33H38N4O6

43

231277-92-2

Lapatinib

C29H26ClFN4O4S

40

2252403-56-6

Sotorasib

C30H30F2N6O3

41

64-86-8

Colchicine

C22H25NO6

29

18531-94-7

R-1,1''-linked-2-naphthol (R)-(+)-BINOL

C20H14O2

22

602-09-5

(+/-)-1,1'-linked-2-naphthol, (+/-)-BINOL

C20H14O2

22

76189-55-4

(R)-(+)-2,2'-Bis(diphenylphosphino)-1,1'-binaphthyl, (R)-BINAP

C44H32P2

46

Figure 6: Absolute Configuration Refinement Cases