Using Spark NLP in Python to identify named entities in texts at scale. Named Entity Recognition with Python TL;DR: Named Entity Recognition (NER) is a Natural Language Processing (NLP) technique...
Using machine learning and regex patterns to identify and extract date information in Spark NLP TL; DR: Dates extraction from a text is a common Natural Language Processing (NLP) task...
Creating visualizations for analysis and reporting using Spark NLP and Spark NLP display. TL;DR: Visualizing Named Entities is a technique for representing the results of named entity recognition (NER) in...
Use pretrained models, segment texts into words, and train custom word segmenter models with Python. TL; DR: Some Asian languages don’t separate words by white space like English, and NLP...
De-Identification is a process that needs to be applied to de-identify (anonymize) or obfuscate (replace with fake entities) PHI (protected health information) data from clinical notes. Obfuscation of PHI entities...