2,300
views
0
recommends
+1 Recommend
1 collections
    4
    shares

      UK Computing Summit 2025: Navigating change (surviving and beyond) - 29-30 April @ Sheffield Hallam University - Register here.

      scite_
       
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Enterprise Master Patient Index Entity Recognition by Long Short-Term Memory Network in Electronic Health Systems

      Published
      proceedings-article
      , , ,
      Proceedings of the 32nd International BCS Human Computer Interaction Conference (HCI)
      Human Computer Interaction Conference
      4 - 6 July 2018
      Entity recognition, Long short-term memory (LSTM), Deep learning, Machine learning

            Abstract

            Named-entity recognition (NER) is the application of information extraction by artificial intelligence (AI) to locate and classify conceptual entities from natural language into pre-defined categories. In this study, we apply the Long Short-Term Memory network (LSTM) networks to identify the patient entities from the Enterprise Master Patient Index (EMPI). A sample dataset with 300,000 deidentified patient records is used to test the LSTM performance for EMPI entity recognition. The data entries are firstly converted into strings and represented by a Word2Vec model with 200 dimensions. Two LSTM models are developed for the NER recognition problem. The first LSTM model uses a multi-classifier with a softmax function, the second LSTM model uses a two-step classification procedure by binary logistic function. To evaluate the LSTM performance, we use a conventional deep neural network model for comparison, where the Levenshtein distance is used to represent the training data patterns. The classification performance is evaluated by ten-fold cross-validation. The two-step LSTM model has the classification accuracy of 99.82%, which is superior to both the multi-classification LSTM classifier at 61.08% and to the conventional deep neural network at 95.08%. Therefore, we conclude that the new two-step LSTM model provides an accurate and reliable solution to recognize the EMPI patient entities when it is properly configured and trained.

            Content

            Author and article information

            Contributors
            Conference
            July 2018
            July 2018
            : 1-4
            Affiliations
            [0001]York University

            4700 Keele St., Toronto, Canada
            [0002]Guangzhou Univ Chinese Med

            111 Dade Rd, Guangzhou, China
            [0003]Dapasoft INC

            111 Gordon Baker Rd, Toronto, Canada
            Article
            10.14236/ewic/HCI2018.181
            6bd048cc-0bc4-49d2-835f-a158c92e3d60
            © Liang et al. Published by BCS Learning and Development Ltd. Proceedings of British HCI 2018. Belfast, UK.

            This work is licensed under a Creative Commons Attribution 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

            Proceedings of the 32nd International BCS Human Computer Interaction Conference
            HCI
            32
            Belfast, UK
            4 - 6 July 2018
            Electronic Workshops in Computing (eWiC)
            Human Computer Interaction Conference
            History
            Product

            1477-9358 BCS Learning & Development

            Self URI (article page): https://www.scienceopen.com/hosted-document?doi=10.14236/ewic/HCI2018.181
            Self URI (journal page): https://ewic.bcs.org/
            Categories
            Electronic Workshops in Computing

            Applied computer science,Computer science,Security & Cryptology,Graphics & Multimedia design,General computer science,Human-computer-interaction
            Machine learning,Deep learning,Entity recognition,Long short-term memory (LSTM)

            REFERENCES

            1. 2016 Why Patient Matching Is a Challenge: Research on Master Patient Index (MPI) Data Discrepancies in Key Identifying Fields Perspectives in Health Information Management 13:1e. eCollection 2016

            2. 2015 Developing a common health information exchange platform to implement a nationwide health information network in South Korea Health Informatics Research 21 1 21 29

            3. 2012 Named entity recognition in tweets: an experimental study Conference on Empirical Methods in Natural Language Processing Jeju Island, Korea July 12-14 1524 1534 Association for Computational Linguistics Stroudsburg PA, USA

            4. 2010 Word representations: a simple and general method for semi-supervised learning The 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden July 11-16 384 394 Association for Computational Linguistics Stroudsburg PA, USA

            5. 2017 Entity recognition in the biomedical domain using hybrid approach Journal of Biomedical Semantics 8 1 51

            6. Gilleland M: Levenshtein distance, in three flavors 2009 Merriam Park Software url: http://www.merriampark.com/ld.htm. (April 15, 2018)

            Comments

            Comment on this article