Blood Journal
Leading the way in experimental and clinical research in hematology

Consensus recommendations for risk stratification in multiple myeloma: report of the International Myeloma Workshop Consensus Panel 2

  1. Nikhil C. Munshi1,2,
  2. Kenneth C. Anderson1,
  3. P. Leif Bergsagel3,
  4. John Shaughnessy4,
  5. Antonio Palumbo5,
  6. Brian Durie6,
  7. Rafael Fonseca3,
  8. A. Keith Stewart3,
  9. Jean-Luc Harousseau7,
  10. Meletios Dimopoulos8,
  11. Sundar Jagannath9,
  12. Roman Hajek10,
  13. Orhan Sezer11,
  14. Robert Kyle12,
  15. Pieter Sonneveld13,
  16. Michele Cavo14,
  17. S. Vincent Rajkumar12,
  18. Jesus San Miguel15,
  19. John Crowley16,
  20. Hervé Avet-Loiseau7, and
  21. on behalf of the International Myeloma Workshop Consensus Panel 2
  1. 1Dana-Farber Cancer Institute, Boston, MA;
  2. 2Boston Veterans Administration Healthcare System, West Roxbury, MA;
  3. 3Mayo Clinic, Scottsdale, AZ;
  4. 4University of Arkansas for Medical Sciences, Little Rock, AR;
  5. 5University of Torino, Torino, Italy;
  6. 6Cedars-Sinai Medical Center, Los Angeles, CA;
  7. 7Universitaire de Nantes, Nantes, France;
  8. 8University of Athens, Athens, Greece;
  9. 9Mt Sinai Medical Center, New York, NY;
  10. 10University of Brno, Brno, Czech Republic;
  11. 11University Medical Center, Hamburg, Germany;
  12. 12Mayo Clinic, Rochester, MN;
  13. 13Erasmus MC, Rotterdam, The Netherlands;
  14. 14University of Bologna, Bologna, Italy;
  15. 15University of Salamanca, Salamanca, Spain; and
  16. 16Cancer Research & Biostatistics, Seattle, WA

Abstract

A panel of members of the 2009 International Myeloma Workshop developed guidelines for risk stratification in multiple myeloma. The purpose of risk stratification is not to decide time of therapy but to prognosticate. There is general consensus that risk stratification is applicable to newly diagnosed patients; however, some genetic abnormalities characteristic of poor outcome at diagnosis may suggest poor outcome if only detected at the time of relapse. Thus, in good-risk patients, it is necessary to evaluate for high-risk features at relapse. Although detection of any cytogenetic abnormality is considered to suggest higher-risk disease, the specific abnormalities considered as poor risk are cytogenetically detected chromosomal 13 or 13q deletion, t(4;14) and del17p, and detection by fluorescence in situ hybridization of t(4;14), t(14;16), and del17p. Detection of 13q deletion by fluorescence in situ hybridization only, in absence of other abnormalities, is not considered a high-risk feature. High serum β2-microglobulin level and International Staging System stages II and III, incorporating high β2-microglobulin and low albumin, are considered to predict higher risk disease. There was a consensus that the high-risk features will change in the future, with introduction of other new agents or possibly new combinations.

Introduction

Multiple myeloma is a heterogeneous disease with variable disease courses, response to therapy, and survival outcome that ranges from less than 1 year in patients with aggressive disease to more than 10 years in patients with indolent disease presentation. Various patient-, disease-, and therapy-related characteristics have been identified to predict the disease course as well as outcome in patients with myeloma. Such evaluation of prognostic factors and risk stratification is important to define treatment strategies, compare outcome of therapeutic trials, and predict survival from diagnosis. This consensus panel report has evaluated various aspects of risk stratification, including its purpose and timing, especially at diagnosis and at relapse, its relationship to therapy, and defined clinical and laboratory features, including genomic changes, that may be used to stratify patients and predict outcome at present.

Purpose of risk stratification

The general purpose of risk stratification is not to decide whether or not to treat a patient but to prognosticate.

The decision to treat is based on the criteria set for in the diagnosis of symptomatic myeloma, which includes the hypercalcemia, renal dysfunction, anemia, and bone lesions criteria.1 Patients with clearly defined monoclonal gammopathy of undetermined significance or SMM do not need initiation of therapy, irrespective of any associated risk factors, except on specifically targeted protocols. For example, if a patient with clear diagnosis of SMM has 17p− by fluorescence in situ hybridization (FISH) or del13 on cytogenetics analysis, it does not constitute an indication to start therapy. The risk stratification being described here is only for determining prognosis and stratification of treatment, rather than to decide whether to initiate treatment.

There has been general consensus on the risk factors that help stratify patients receiving conventional therapeutic modalities. However, there are studies that suggest that bortezomib and, to an extent, lenalidomide may be able to overcome some of the poor-risk features and achieve clinical benefit.28 Further studies are needed to decide on the ability of these agents to overcome the poor-risk features. At the present time, it is important to stratify, but the available information does not indicate selection of therapies—for example, if patient has t(4:14)—it does not suggest that we should use a specific therapy or novel agent.

Currently, to mandate definitive treatment according to cytogenetic abnormalities is premature, although there are emerging data suggesting that some of the novel agents could overcome the negative prognosis of the cytogenetic abnormalities.9

It is important to continue to assess the impact of risk factors with novel therapies and combinations. Clinical trials should be done based on risk stratification, to try and test whether certain patients benefit more or less from certain therapeutic agents or strategies.

Timing of risk stratification

At diagnosis

There is consensus that the current risk stratification is applicable to newly diagnosed patients. All current efforts are directed at stratification at diagnosis using the parameters obtained at diagnosis. The suggested investigations are listed in Table 1.

View this table:
Table 1

Investigation for risk stratification

At relapse

There has been documentation of change in risk factors at relapse. For example, in one study, the mean labeling index increased from 1% at diagnosis to 2.5% at relapse. If patients are followed individually, it is always higher at relapse than at diagnosis. Similar data exist for detection of p53 deletion at relapse or disease progression compared with time of diagnosis.

The evolving consensus is that, if a patient acquires high-risk features at relapse or progression, then that patient should be reclassified as having high-risk disease. For example, if a patient was not detected to have del(17p) at diagnosis but at relapse 60% cells show this change, then it is our consensus that this patient now should be reclassified as having high-risk disease.

There is general consensus that the same genetic abnormalities characteristic of poor outcome at diagnosis may suggest poor outcome if detected at the time of relapse.

Among the nongenetic risk factors, redetermination of factors, such as β2-microglobulin or International Staging System (ISS) at relapse or at follow-up, is not currently considered as predictive of change in risk stratification. The role of level of serum lactate dehydrogenase (LDH) at relapse is less clear. A very high LDH is considered to represent proliferative disease. High LDH levels are uncommon in myeloma but carry poor prognosis at diagnosis or relapse.1012

Moving forward, an important goal for relapse trials would be to evaluate these and other risk factors at relapse, keeping in consideration the type of therapy used at relapse. This is important as novel therapies are becoming available and patients are living longer.

In patients with relapsed disease, additional risk stratification criteria include type of response and length of response to prior therapy. Therapy-related poor-risk features include progression while on therapy and short duration of response.6,13 Unlike in the past, speed of response does not suggest overall outcome with newer agents.

If a patient already has an identified high-risk feature at diagnosis, then there is no need to look for the same feature again. For example, if a patient at diagnosis has t(4;14), then one does not need to look for it again at relapse with the same FISH probe. However, cytogenetic and FISH investigation should be performed at relapse to look for additional changes. If a patient is in a low-risk group, then it is recommended that cytogenetics and FISH studies be performed at relapse for risk restratification as if not detected at the presentation, at relapse, because of selection of a preexisting clone they may attain a detectable level.

Is risk stratification specific for specific treatment?

There is general agreement that the risk stratification should be a global stratification, and not stratification for old versus new therapy or risk stratification for particular one treatment. We recognize that the risk features may be relevant to a given therapy. For example, when patients with del13 are considered to have poor prognosis, it is based on a large number of studies focused on outcome after high-dose therapy and transplantation. However, with the use of novel agents, for example bortezomib, del13 does not seem to be predictive of high risk.14,15 Thus, risk factors for individual novel therapies are to be determined on an ongoing trial.

There was a consensus that the high-risk features will change in the future, with introduction of other new agents or possibly new combinations.

It is unclear whether risk stratification should change after patients receive certain treatments. For example, bortezomib is able to overcome, at least in part, the poor risk associated with t(4;14)2; do we need to identify different risk factors for patients after bortezomib treatment? The general opinion was that there is not adequate information to implement such a recommendation.

What risk factors to look for

There is a consensus that both cytogenetics and FISH play an important and independent role in risk stratification. Both FISH with specific markers and cytogenetics with specific abnormalities need to be performed on bone marrow samples.

Although detection of any cytogenetic abnormality is considered to suggest higher-risk disease, the specific abnormalities considered as poor risk are: cytogenetically detected chromosomal 13 or 13q deletion, t(4;14), and del17p; and detection by FISH of t(4;14), t(14;16), and del17p.16

High serum β2-microglobulin level and ISS stage II and III incorporating high β2-microglobulin and low albumin are considered to predict higher-risk disease.17

What additional risk factors to look for

A number of individual risk factors have been identified. However, there is, in general, emphasis to use a system that combines multiple factors, such as ISS. Some of these factors were considered in developing the ISS risk stratification systems.

Because of lack of uniform availability of the data for analysis, which led to proposal of the ISS, there are a number of factors that still may have a significant role in risk stratification as individual factors (eg, LDH was not available for all patients and was not considered in developing ISS). However, in the limited patients who had this information, LDH was found to have significant influence in identifying risk.

Some of the features considered significant as individual factors are LDH, IgA, extramedullary disease, renal failure, high serum-free light chain, and serum-free κ/serum-free λ ratio, plasmablastic disease, and plasma cell leukemia.12,1822 These features are useful under some circumstances, but their general applicability is unknown. In addition, it is very much a constellation of features that often determine high risk, rather than a single factor that may make it intermediate risk. Unlike FISH/cytogenetics, which may suggest a change in therapeutic approach to more aggressive treatment, no change in treatment approach is currently indicated based on such single higher-risk features.

Consensus for evaluation of genomic changes

As described in “What risk factors to look for,” there is a consensus that both cytogenetics and FISH have some adverse risk features. Both highlight different disease parameters, and both preferably should be performed to have a better understanding of the behavior and biology of the disease.

FISH data should be reported specifically for clonal plasma cells determined by surface marker or cytoplasmic immunoglobulin light chain expression, and not all cells. The positivity is to be determined by the percentage of positive cells that are above the individual laboratories' standard.

No specific global cutoff should be applied. It is unclear whether the number of positive cells carries any different risk. For example, if a patient has 7% versus 57% cells positive for a specific FISH abnormality, the relative risk for both patients is considered the same at present. This is not true for del(17p). In a report, del(17p) is prognostic only if present in at least 60% of the plasma cells.2

There is consensus that (1) detection of t(4;14), t (14;16), or17p by FISH suggests higher-risk disease; (2) del13 or 13q− detected only by FISH independently in the absence of other abnormality does not carry significant higher risk, whereas t(11;14) does not predict superior outcome; and (3) there are some reports that 1q+, del1p may have clinical significance as a poor-risk feature; however, the consensus is that the data are not yet adequate to suggest routine use of these FISH markers to predict prognosis.

ISS

The ISS, incorporating serum albumin and β2-microglobulin, is applicable as a prognostic system in the majority of settings. ISS is validated for conventional treatments as well as high-dose therapy.17 However, its validity with combination novel agent therapy still needs to be confirmed.

The method used for measurement should be standard. The ISS, although extremely convenient to use, requires incorporation of additional myeloma-specific features to make it more robust or more applicable using the newer generation of drugs and studies.

The ISS is a baseline lowest common denominator, to be supplemented and not necessarily supplanted. There is a clear need and consensus to add cytogenetics/FISH or other markers to ISS.

DS classification system

A clinical staging system at diagnosis using standard laboratory measurement, developed by Durie and Salmon, was predictive of clinical outcome after standard-dose chemotherapy. However, with the use of high-dose therapy and novel agents, the Durie-Salmon (DS) system is less predictive of outcome.17,23 This may be explained by the fact that the DS system is focused predominantly on tumor burden; and as these newer therapies are able to better reduce tumor burden, its significance has changed. There is increasing importance of tumor biology–related factors. DS system is still considered a means to measure tumor mass.

There is general agreement that the DS system can supplement the diagnostic criteria for myeloma, such as hypercalcemia, renal dysfunction, anemia, and bone disease; however, if a patient has already been diagnosed as having symptomatic myeloma based on current criteria, then there is no need to use the DS system in regular practice for diagnosis. As only patients with symptomatic disease should be placed in clinical trials, reporting of DS system is not considered essential. As stage I represents early stage of disease, description of patients in clinical studies by DS staging system is encouraged. However, its routine clinical use is unclear.

Incorporation of imaging results

The number of bone lytic lesions, per DS system, is not considered of any prognostic significance. Although there are small single-institution studies indicating that achieving magnetic resonance imaging–directed complete remission has prognostic significance, this observation requires further studies to include imaging parameters in risk stratification or response definition.24 Similarly, a recent study has pointed to the presence of more than 3 fluorodeoxyglucose-avid focal lesions as the leading independent parameter associated with inferior overall and event-free survival.25 However, these results require further independent confirmation before they are widely applied.

None of the imaging studies or results are currently recommended for inclusion in risk stratification.

Inclusion of expression/genomic profile

Expression profile data generated by a number of groups have been very helpful in identifying an expression signature that may identify a poor-risk group. Shaughnessy et al26 investigated the expression profile of myeloma cells in 532 newly diagnosed myeloma patients treated on 2 protocols incorporating tandem autologous transplantation. Using log-rank tests of expression quartiles, 70 genes linked to shorter durations of complete remission, event-free survival, and overall survival were identified. The ratio of mean expression levels of up-regulated to down-regulated genes defined a high-risk score, which was an independent predictor of outcome endpoints in multivariate analysis (P < .001) that included the ISS and high-risk translocations. A subset of patients with a high-risk score had a 3-year continuous complete remission rate of only 20%, as opposed to a 5-year continuous complete remission rate of 60% in the absence of a high-risk score. Interestingly, multivariate discriminant analysis identified a 17-gene subset that performed as well as the 70-gene model.26

A second large study published by Decaux et al27 from the Intergroupe Francophone du Myelome studied gene expression profiles of myeloma cells obtained at diagnosis in 182 patients and identified the 15 strongest genes to calculate a risk score associated with the length of survival. This analysis divided patients into high-risk group, characterized by the overexpression of genes involved in cell cycle progression and its surveillance, and low-risk patients, with hyperdiploid signature and heterogeneous gene expression. The results were confirmed in a test set, as well as independent cohorts composed of 853 patients with multiple myeloma. Overall survival at 3 years in low-risk and high-risk groups was 91% and 47%, respectively. These results were independent of traditional prognostic factors.27

It is interesting to note that, although both these studies have included patients undergoing high-dose therapy, the 15 and 17 gene models do not share common genes. This highlights the complexity of biologic behavior of the tumor and the fact that ultimate use of such expression data will require significantly more work. Functional commonality or functional association between these various genes needs to be considered in developing a more composite model. It also highlights the molecular redundancy in tumor cells driving their clinical behavior.

The factors that require standardization are method used to assess expression profile, the data analysis technique, consensus and validation of genes to be considered important for risk stratification, and standardization of method to apply this definition to expression profile for a single patient.

A more robust and comprehensive analysis is needed to analyze significance of stratification using comparative genomic hybridization/single nucleotide polymorphism array.

In the future, a specific polymorphism may help identify patients with differential response profile and/or higher risk of toxicity. However, currently, there is lack of data to propose any specific single nucleotide polymorphisms that can be used for such decision.

Consideration of risk factors in special therapeutic scenario

There are emerging data that allogeneic transplantation may have beneficial outcomes in high-risk patients defined by cytogenetics/FISH. These data are limited and require further confirmation.28 However, the group feels that allogeneic transplantation could be considered in this group of patients.

The current level of evidence does not provide direction in deciding whether patients with a specific risk group will benefit from maintenance therapy.

Authorship

Contribution: N.C.M., K.C.A., P.L.B., J.S., A.P., B.D., R.F., A.K.S., J.-L.H., M.D., S.J., R.H., O.S., R.K., P.S., M.C., S.V.R., J.S.M., J.C., and H.A.-L. developed the consensus, provided critical review and edits to the manuscript, gave approval to the final manuscript, and participated significantly in the development of the consensus and the writing of the manuscript.

Conflict-of-interest disclosure: N.C.M. is a consultant/advisory board member of Millenium, Celegene, Novartis, and Onyx. K.C.A. is a consultant/advisory board member of Millenium, Celegene, Novartis, Merck, BMS, Signalgenetics, and Onyx and cofounded and owns stock in Acytelon. R.F. has received a patent for the prognostication of MM based on genetic categorization of the disease and received consulting fees from Medtronic, Otsuka, Celgene, Genzyme, BMS, and Amgen and research support from Cylene and Onyx. J.S. has been awarded patents, or has submitted patent applications on molecular diagnostics and therapeutics in cancer medicine; receives royalties related to patent licenses from Genzyme, Novartis, and Signal Genetics LLC; cofounded and owns stock in Signal Genetics LLC, a molecular diagnostics company; consulted/advised Signal Genetics LLC, Array Biopharma, Celgene, Genzyme, Millennium, Centocor Ortho Biotech, and Novartis; and received honoraria from Array Biopharma, Celgene, Centocor Ortho Biotech, Genzyme, Millennium, and Novartis. A.K.S. is an advisor for Signalgenetics. O.S. has received honoraria from Amgen, Celgene, Janssen, and Novartis. M.C. has received honoraria from Janssen-Cilag, Millennium Pharmac, and Celgene. The remaining authors declare no competing financial interests.

Correspondence: Nikhil C. Munshi, Dana-Farber Cancer Institute, 44 Binney St, Boston, MA 02115; e-mail: nikhil_munshi{at}dfci.harvard.edu; and Hervé Avet-Loiseau, 42 rue Jules Simon, 44000 Nantes, France; e-mail: herve.avetloiseau{at}chu-nantes.fr.

Acknowledgments

The authors thank the following colleagues for their participation on the Consensus Panel: Drs Shaji Kumar (Rochester, MN), David S. Siegel (Hackensack, NJ), Philippe Moreau (Nantes, France), Hermann Einsele and Seema Singhal (Chicago, IL), Irene M. Ghobrial (Boston, MA), Gösta Gharton (Stockholm, Sweden), James R. Berenson (West Hollywood, CA), Jayesh Mehta (Chicago, IL), Angela Dispenzieri (Rochester, NY), Sagar Lonial (Atlanta, GA), Ruben Niesvizky (New York, NY), Robert Schlossman (Boston, MA), David H. Vesole (New York, NY), Asher Chanan-Khan (Buffalo, NY), Jeffrey Wolf (San Francisco, CA), Michael Kuehl (Bethesda, MD), Johannes Drach (Vienna, Austria), Rik Schots (Brussels, Belgium), Giampaolo Merlini (Pavia, Italy), Maurizio Zangari (Salt Lake City, UT), Jeffrey Zonder (Detroit, MI), L. Thompson Heffner Jr (Atlanta, GA), Ramon Garcia Sanz (Valencia, Spain), Philip R. Greipp (Rochester, MN), and Vinod Raina (New Delhi, India).

Footnotes

  • This work was developed as part of the 12th International Myeloma Workshop, Washington, DC, February 26–March 1, 2009.

  • Submitted October 14, 2010.
  • Accepted January 6, 2011.

References

View Abstract