RAMP 08 Drug classification from molecular spectra

PROTO 204 ()


204 Rue André Ampère, Campus de l'Université Paris-Sud
Balázs Kégl (LAL)

Chemotherapy is one of the most used treatment against cancer. It uses chemical substances (chemotherapeutic agents) which kill cells that divide too quickly. These chemical substances are often diluted in a particular solution and packaged in bags, diffusers, or syringes, before being administered. Wrong medication (wrong chemotherapeutic agent or wrong concentration) can have major impacts for patients. To prevent wrong medication, some recent French regulations impose the verification of anti-cancer drugs before their administration. The goal is to check that they contain the good chemotherapeutic agent with the good dosage.

Raman spectroscopy could be used to make this check, since, theoretically, i) each molecule has a specific spectral fingerprint by which the molecule can be identified; and ii) the Raman intensity increases with the concentration of the molecule. The main advantage of spectroscopy above other methods (for example, liquid chromatography) is that it is non-destructive and non-invasive (measures are made without opening the drug containers). However, this method is rarely used in hospital environment because of the complexity of the spectral signals to analyze. Automating the analysis of these spectral signals could significantly help. Eventually, a complete analytical system (from measuring Raman spectra to identifying the chemotherapeutic agent and its concentration) could be designed, which would be easy to use and would prevent wrong medication.

In this context, the goal of this project is to develop prediction models able to identify and quantify chemotherapeutic agents from their Raman spectra.

The Lip(Sys)² laboratory measured Raman spectra of 4 types of chemotherapeutic agents (called molecule) in 3 different packages (called vial), diluted in 9 different solutions (called solute gammes), and having different concentrations. A total of 360 spectra were measured for each agent, except for one (348 spectra).

To sum up, there are too objectives:

  • classification: predict which molecule it corresponds to given the spectrum.

  • regression: predict the concentration of a molecule. The prediction should not depend on the vial or the solute group. The error metric is relative error.


  • Alexandre Bredimas
  • Alexandre Gramfort
  • Ali TFAYLI
  • Balázs Kégl
  • Bartosz Telenczuk
  • Camille Marini
  • Carl Levasseur
  • Damien Mourot
  • Djalel Benbouzid
  • Edwige Lelièvre
  • Florence Drouet
  • Frédéric Schmidt
  • Gabriel Rovina
  • Harizo Rajaona
  • Hervé Bertin
  • Issam Benabid
  • Julien Nauroy
  • Laurent Ribiere
  • Loïc Estève
  • Marc Evrard
  • Mehdi Cherti
  • Rime Michael-Jubeli
  • Robin Monnier
  • Soobash Daiboo
  • Sourava Prasad Mishra
  • Thomas Moreau
  • Tom Dupré la Tour
  • Victor Estrade
  • Yohann Sitruk
    • 09:00 09:30
      Welcome of participants & Coffee
    • 09:30 10:00
      Introduction talk
    • 10:00 12:00
      Session 1
    • 12:00 13:30
      Buffet & presentation of the first results
    • 13:30 15:00
      Session 2
    • 15:00 15:30
    • 15:30 17:00
      Session 3
    • 17:00 18:00
      Debriefing and closing