John Snow Labs Announces State-of-the-Art Enhancements to its Spark NLP Technology, Resulting in 2.5M Downloads and 9x Growth in 2020


John Snow Labs, the AI and NLP for healthcare company and developer of the Spark NLP library, today announced that it has crossed the 2.5 million download mark, experiencing 9x growth of its Spark NLP technology since January 2020. Used by 54% of healthcare organizations, Spark NLP has secured the spot as the world’s most widely used natural language processing (NLP) library in the enterprise, after only four years on the market. This rapid growth can be attributed to a long series of enhancements made to Spark NLP’s state-of-the-art library, which now comes with more than 300 production-grade pre-trained models and pipelines, used by leading healthcare and life sciences companies.

John Snow Labs released 26 new versions of Spark NLP in 2019 and another 26 in 2020, with the most recent being Spark NLP for Healthcare 2.7. The most significant feature in this latest release is Text to SQL, and other upgrades include more accurate entity resolution and clinical named entity recognizers, new PICO classifier for evidence-based medicine, new biomedical named entity recognizers, and new clinical and traffic accident NER models in German. These models are pre-trained with clinical BioBERT based embeddings, the most powerful contextual language model in the clinical domain today, making it an easy-to-use, best-in-class solution for healthcare NLP projects.

This news comes on the heels of John Snow Labs’ release of its new named entity recognition (NER) model and classifier for Adverse Drug Events (ADE), announced in October. While the ADE NER model helps extract ADE and drug entities from a given text, the new ADE Classifier is trained on various ADE datasets, including academic texts, social media, and clinical notes. By combining ADE NER and Classifier, pre-trained pipelines are already fitted using certain annotators and transformers according to various use cases, saving users from building it from scratch.

“The enhancements to Spark NLP for Healthcare beat state-of-the-art benchmarks in relation extraction, named entity recognition, and adverse event detection, and are successfully being used now by leading healthcare and life science companies,” said David Talby, CTO at John Snow Labs. “We are humbled by the rapid growth of our community, and expect to continue on this trajectory as we improve our NLP technology even further in the new year.”

Not only is John Snow Labs committed to improving how companies from Kaiser Permanente to Roche realize the value of AI and NLP, but also providing education and resources to help further NLP knowledge and adoption. In addition to hosting the first-ever NLP Summit in October, the company, in partnership with Gradient Flow, issued new research exploring the state of NLP in 2020. The research found that Spark NLP was cited as the most popular NLP library across all industries, with more than half of healthcare-affiliated respondents indicating this is what their organization currently uses. In response to this, John Snow Labs will host the first NLP for Healthcare Summit, being held from April 6-9.

To learn more about Spark NLP for Healthcare, or start your free trial, visit:

About John Snow Labs

John Snow Labs, the AI and NLP for healthcare company, provides state-of-the-art software, models, and data to help healthcare and life science organizations

build, deploy, and operate AI projects. Developer of Spark NLP, the world’s most widely used NLP library in the enterprise, John Snow Labs’ award-winning clinical NLP software powers leading healthcare and pharmaceutical companies including Kaiser Permanente, McKesson, Merck, and Roche. The company is the creator and host of The NLP Summit, further educating and advancing the NLP community.