Acessibilidade / Reportar erro

Core electron binding energy (CEBE) shifts applied to structure activity relationship (SAR) analysis of neolignans

Abstracts

Core electron binding energy shifts (DCEBE's) and CEBE of carbon atoms calculated with the semi empirical HAM/3 method were shown to serve as a useful descriptor for SAR analysis of six neolignans studied. Using five selected DCEBE's of carbon atoms in the two phenyl rings of the compounds, the compounds were well separated by HCA, PCA, KNN and SIMCA methods.

DCEBE; descriptor; neolignans; HAM/3; SAR; PCA


O deslocamento da energia de ligação do elétron do caroço (DCEBE) e o CEBE de átomos de carbono, calculados com o método semi-empírico HAM/3, foram utilizados como descritores no estudo das relações estrutura-atividade (SAR) de seis neolignanas. Os resultados obtidos demonstraram a eficiência deste tipo de descritores, nas análises de SAR. Usando-se cinco valores selecionados de CEBE, dos carbonos presentes nos anéis fenílicos das neolignanas, foi possível classificá-las nas duas categorias, ativa e inativa, usando-se os métodos HCA, PCA, KNN e SIMCA.


Article

Core Electron Binding Energy (CEBE) Shifts Applied to Structure Activity Relationship (SAR) Analysis of Neolignans

M. Cristina A. Costa and Yuji Takahata* * e-mail: taka@iqm.unicamp.br

Instituto de Química, Universidade Estadual de Campinas, CP 6154, 13083-862 Campinas - SP, Brazil

O deslocamento da energia de ligação do elétron do caroço (DCEBE) e o CEBE de átomos de carbono, calculados com o método semi-empírico HAM/3, foram utilizados como descritores no estudo das relações estrutura-atividade (SAR) de seis neolignanas. Os resultados obtidos demonstraram a eficiência deste tipo de descritores, nas análises de SAR. Usando-se cinco valores selecionados de CEBE, dos carbonos presentes nos anéis fenílicos das neolignanas, foi possível classificá-las nas duas categorias, ativa e inativa, usando-se os métodos HCA, PCA, KNN e SIMCA.

Core electron binding energy shifts (DCEBE's) and CEBE of carbon atoms calculated with the semi empirical HAM/3 method were shown to serve as a useful descriptor for SAR analysis of six neolignans studied. Using five selected DCEBE's of carbon atoms in the two phenyl rings of the compounds, the compounds were well separated by HCA, PCA, KNN and SIMCA methods.

Keywords: DCEBE, descriptor, neolignans, HAM/3, SAR, PCA

Introduction

The choice of molecular descriptors is one of the most crucial parts in the work of SAR/QSAR. Many descriptors have been suggested and employed. Many of them are successful and well accepted. Some descriptors that can be calculated by quantum mechanical methods have been recognized as useful in QSAR. However there still remains room to search for alternative and/or better descriptors than those in use, especially those descriptors that can be evaluated theoretically. One of our objectives is to look for more useful descriptors than thus far used, employing mainly quantum mechanical and/or other theoretical and computational methods. Lindberg et al. had shown that core electron binding energy correlate linearly to the Hammett sigma constants (s) in substituted benzene derivatives.1 Let us consider electrophilic aromatic substitution at para position of mono substituted benzene Ph-X as an example. X in Ph-X is a substituent such as OH, NH2, NO2 etc. Linderberg et al.1 demonstrated the validity of an equation similar to equation 1:

Left hand side of equation 1 is the difference between core electron binding energy (CEBE) of para carbon of Ph-X and CEBE of carbon atom of benzene Ph-H, which is the reference molecule. The left hand side of equation 1 is called CEBE shift or DCEBE. The right hand side of equation 1 is a product between a constant k and a Hammett sigma constant at para position, sXp, of Ph-X. Equation 1 is an approximate equation. There are equations similar to equation 1 at other carbon atoms in the molecule such as ortho and meta positions in Ph-X. Linear relationship between DCEBE and Hammett sigma constants is not limited to monosubstituted benzenes. There are corresponding equations for multiply substituted benzenes. Straight lines were obtained by plotting experimentally observed CEBE values of Ph-X with respect to corresponding Hammett sigma constants sX.1 This is a demonstration of the validity of equation 1. Recently we reconfirmed this2 by calculating accurate DCEBE of the ring carbon in mono substituted benzene (Ph-X) in relation to the ring carbon in Ph-H. Density functional theory (DFT) was employed for the calculation of the DCEBE . Good agreement between the calculated CEBE and the Hammett s constant3 of the corresponding atom was obtained.2 Since Hammett sigma constant is one of the important descriptors in QSAR analysis,4 we can expect that DCEBE calculated theoretically can be also a useful descriptor in QSAR. Hammett sigma constants are usually determined experimentally. However in many drug molecules, Hammett sigma constant is not available. The object of the present work is firstly, to calculate DCEBE of a set of selected molecules whose Hammett sigma constants are not known, and secondly, to investigate whether or not the DCEBE is related to the biological activity of the molecules.

We chose six neolignans in which three of them are inactive and the other three are active against leishmaniasis. Figure 1 shows a skeleton of the neolignans and Table 1 list the six selected molecules and classes of biological response (active or inactive). They were taken from our previous publication.5 All the six molecules have the common basic skeleton.


Method of Calculation

We used the molecular geometry calculated by MM2 method previously.6 The semi-empirical HAM/3 (Hydrogenic Atoms in Molecules, version 3)7 method was used to calculate CEBE of the compounds. As far as we know, HAM/3 is the only semi-emprical method that can calculate CEBE's of a molecule. The widely used and well known semi-empirical method such as AM1 and ZINDO are not capable of calculating CEBE of a molecule. From our previous experience, average absolute deviation of CEBE's calculated by HAM/3 is expected to be about 1.50 eV.8 This is much larger than 0.3 eV that was attained by non-empirical DFT.2 Only advantage of HAM/3 is its much higher speed of calculation in comparison to non-empirical DFT. Since neolignan is fairly large molecule and calculation of CEBE has to be done one atom at a time, HAM/3 is a method of choice. We calculated CEBE's of 14 carbon atoms, C1-C14, in each molecule, that comprise of all the 12 carbon atoms in the two benzene rings plus two carbons that bridge the two benzene rings (see Figure 1). DCEBE's were calculated by equation 2, taking the difference between the calculated CEBE of each of the molecules and the value of 286.20 eV which is the CEBE of a carbon atom in an isolated benzene molecule calculated by HAM/3.

Then, Fisher's weights of the DCEBE's were calculated. Some top greatest values of the weights were selected as useful descriptors for SAR analysis. Pattern recognition methods9 such as principal component analysis (PCA), hierarchical clustering analysis (HCA), K-nearest neighbors (KNN) and SIMCA were employed to study relation between the selected descriptors and the biological activity (SAR). The data were preprocessed by the method of autoscaling, then they were employed in the pattern recognition methods.

Results and Discussions

Table 2 and Table 3 list calculated CEBE's and DCEBE's respectively of five carbon atoms C1, C2, C4, C9 and C11 (See Figure 1) selected. These carbon atoms have the top five greatest Fishers' weights among the whole set of the 14 carbon atoms. The first three atoms, C1, C2, C4, belong to the A-Ring; the other two, C9 and C11, belong to the B-Ring of neolignan. Figure 2 shows a dendogram produced with HCA using the DCEBE's in Table 3. The linkage method used was that of single link. The scale numbers on top part of the figure indicate similarity. The active compounds (5, 4 and 6) are grouped together upper part of the figure, while inactive ones (1, 2 and 3) are grouped together lower part of the figure. The two groups are well separated. Figure 3 shows score plot produced by PCA using the DCEBE's in Table 3. The x-axis represents the first principal component (PC1), while y-axis represents the second principal component (PC2). All the three active compounds (4, 5, 6) are located extreme right hand side on x-axis, while all the three inactive compounds (1, 2, 3) are located extreme left hand side of x-axis. The active group is well separated from the inactive group. The two principal components (PC1 and PC2) are given in equations 3 and 4.



PC1 explains 79.0% of variance and PC2 explains 15.0%. Cumulate variance up to PC2 is, therefore, 94.0%. Equation 3 indicates that all the five selected carbon atoms contribute more or less the same magnitude. The outstanding descriptors in equation 4 are C9 and C1. Figure 4 shows the loading graph for the five descriptors. The DCEBE's at C2 and C4 are mainly responsible for pulling the active group (4, 5 and 6) towards right hand side in the score graphics. The DCEBE's of the three inactive compounds (1, 2 and 3) are generally smaller than those of active compounds. This is especially true at C2 (Table 3). These are the reasons why the three inactive compounds (1, 2 and 3) are located extreme left in Figure 3. We also used KNN, and SIMCA methods using the five selected descriptors. All of the 6 compounds were correctly classified by both KNN and SIMCA.


Instead of DCEBE (Table 3), we also used CEBE values themselves (Table 2) to see if they work as useful descriptors in SAR analysis with the pattern recognition methods. The results were completely identical to those obtained with DCEBE. This is due to the fact that the difference between CEBE and DCEBE is the constant (equations 1 and 2). After preprocessing of the data sets, the processed CEBE and DCEBE data sets become identical. Equation 1 can be rewritten in the form of equation 5,

In equation 5, CEBE (Ph-H) is a constant because it is the CEBE of benzene which is the reference molecule. CEBE (Ph-X) is a linear function of variable sX with slope k. The linearity of equation 5 was shown in figures in the literatures.1,2 Equation 5 shows that if DCEBE's work as descriptors in SAR analysis, CEBE(Ph-X)'s themselves equally work as descriptors in SAR analysis. This situation is what we have confirmed numerically. We used mono substituted benzene (Ph-X) to discuss equations 1 and 5. But we can extend the discussions to multi substituted benzenes without loss of generality.

The Hammett equation, equation 6, correlates the equilibrium (or rate) constants (K) with the substituent constants sX for a system concerned X :

Here the subscript 0 denotes a reference system, and r, the reaction constant, is specific for the reaction considered. In case of chemical equilibrium under constant temperature, left hand side of equation 6 is linearly proportional to the difference of the change of free energy of Gibbs (DDG) in chemical/biological reactions between the system concerned and its reference system. DDG is directly related to the relative affinity of interaction between the ligand and the biological target in the system concerned. This is the reason why Hammett sigma constants s are so widely employed in the area where chemical and/or biological reactions are concerned. Comparison between equation 1 and equation 6 immediately reveals that DCEBE is a quantity that is linearly proportional to DDG. DCEBE has similar interpretability to the Hammett sigma constant s.

Equation 5 indicates that CEBE (Ph-X) itself has a similar interpretability as DCEBE. Since in SAR studies, it is the relative quantity of DDG that is important. Absolute value of DDG is not necessary for the most of the cases. Both DCEBE and CEBE are approximately proportional to DDG. This is the reason why they work in SAR analysis.

Number of compounds we worked in the present work is only six. This number is very small. The first reason why we worked with such a small set of molecules is that we wanted a quick and preliminary test if DCEBE (and CEBE) calculated with HAM/3 would serve as useful descriptor for SAR. Secondly, PCA works well even number of compounds are as small as six. This was demonstrated in our previous publication.10 We are currently working with a large number of compounds in order to see if DCEBE can really be one of useful descriptors in SAR/QSAR.

Conclusion

DCEBE (and CEBE) calculated with HAM/3 method was shown to serve as useful descriptor for SAR analysis of the six neolignans studied. Using five selected DCEBE's, the compounds were well separated by HCA, PCA, KNN and SIMCA methods. CEBE and its shift (DCEBE) of an atom in a molecule reflect faithfully its chemical environment. Since DCEBE is linearly proportional to Hammett sigma constant, there is no surprise that DCEBE (and CEBE) demonstrated its usefulness in SAR. The conclusion thus far described is of a temporary nature, because the number of samples treated is very limited. Definite and general conclusion can be drawn only when a large number of samples with different types of molecules are treated.

Acknowledgements

MCAC thanks FAPESP for a post-doctoral fellowship and YT thanks CNPq for a research fellowship.

Received: June 7, 2002

Published on the web: December 4, 2002

FAPESP helped in meeting the publication costs of this article.

  • 1. Linderberg, B.; Svensson, S.; Malmquist, P. A.; Basilier,E.; Gelius U.; Siegbahn, K.; Chem. Phys. Lett., 1976, 40, 175.
  • 2. Takahata, Y.; Chong, D.P.; Bull. Chem. Soc. Jpn., 2000, 73, 2453.
  • 3. Hammett, L. P.; J. Am. Chem. Soc, 1937, 59, 96.
  • 4. Kubinyi, H. In QSAR: Hansch Analysis and Related Approaches; Manhold, R., Krogsgaard-Larsen, Timmermman, T.; eds., VCH: Weinhheim, 1993, vol. 1, p.4.
  • 5. Costa, M.C.A.; Barata, L.E.; Takahata, Y.; J Mol. Struct. (THEOCHEM), 1995, 340, 185.
  • 6. Costa, M.C.A.; Takahata, Y.; J. Comput. Chem, 1997, 18, 712.
  • 7. Åsbrink, L.; Fridh, C.; Lindholm, E.; Chem. Phys. Lett 1977, 52: 63; ibid, 1977, 52, 69; ibid, 1977, 52, 72.
  • 8. Takahata, Y.; J. Mol. Struct. (THEOCHEM) 1987, 150, 309.
  • 9. Beebe, K. R.; Pell, R. J.; Seasholtz, M. B.; Chemometrics: A Practical Guide, John Wiley & Sons, Inc.: New York,1998, pp. 56-182.
  • 10. Vendrame R.; Ferreira , M. M. C.; Collins C. H.; Takahata Y.; J. Mol. Graph. Model. 2002, 20, 345.
  • *
    e-mail:
  • Publication Dates

    • Publication in this collection
      11 Feb 2003
    • Date of issue
      Nov 2002

    History

    • Accepted
      04 Dec 2002
    • Received
      07 June 2002
    Sociedade Brasileira de Química Instituto de Química - UNICAMP, Caixa Postal 6154, 13083-970 Campinas SP - Brazil, Tel./FAX.: +55 19 3521-3151 - São Paulo - SP - Brazil
    E-mail: office@jbcs.sbq.org.br