Article
An S., Bae K., Choi E., Choi S. J., Choi Y., Hong S. et al. (2024), EXAONE 3.0 7.8B Instruction Tuned Language Model, arXiv e-prints, arXiv-2408.
Asimopoulos D., Siniosoglou I., Argyriou V., Karamitsou T., Fountoukidis E., Goudos S. K., et al. (2024), Benchmarking Advanced Text Anonymisation Methods: A Comparative Study on Novel and Traditional Approaches, In 2024 13th International Conference on Modern Circuits and Systems Technologies (MOCAST), 1-6, IEEE.
10.1109/MOCAST61810.2024.10615642Bae K., Choi E., Choi K., Choi S. J., Choi Y., Hong, S., et al. (2025), EXAONE Deep: Reasoning Enhanced Language Models, arXiv e-prints, arXiv-2503.
10.1158/1538-7445.AM2025-2503Bhati B. S., Ivanchev, J., Bojic, I., Datta, A., Eckhoff, D. (2021), Utility-Driven K-Anonymization of Public Transport User Data. IEEE Access, 9, 23608-23623.
10.1109/ACCESS.2021.3055505Chen H., Xiong C., Xie J. M., Cai M. (2020), Privacy Protection Method for Vehicle Trajectory Based on VLPR Data, J. Adv. Transp., 2020(1), 6026140.
10.1155/2020/6026140Chibelushi C., Sharp B., Salter A. (2005), Transcript Segmentation Using Utterance Cosine Similarity Measure, In Proceedings of the 2nd International Workshop on Natural Language Understanding and Cognitive Science (NLUCS 2005), 78-90.
10.5220/0002560900780090Conneau A., Khandelwal K., Goyal N., Chaudhary V., Wenzek G., Guzmán F., et al. (2019), Unsupervised Cross-Lingual Representation Learning at Scale, arXiv preprint, arXiv:1911.02116.
10.18653/v1/2020.acl-main.747Ghasemzadeh M., Fung B. C., Chen R., Awasthi A. (2014), Anonymizing Trajectory Data for Passenger Flow Analysis, Transp. Res. Part C: Emerg. Technol., 39, 63-79.
10.1016/j.trc.2013.12.003Hwang H. (2020), Issues and Challenges in the Use of Pseudonymized Information: Focusing on the Use of Pseudonymized Sensitive Data, Korea Insurance Research Institute.
Jang S., Cho Y., Seong H., Kim T., Woo H. (2024), The Development of a Named Entity Recognizer for Detecting Personal Information Using a Korean Pretrained Language Model, Appl. Sci., 14(13), 5682.
10.3390/app14135682Kim B., Yoo M., Park K. C., Lee K. R., Kim J. H. (2021), A Value of Civic Voices for Smart City: A Big Data Analysis of Civic Queries Posed by Seoul Citizens, Cities, 108, 102941.
10.1016/j.cities.2020.102941Kim H. T., Jang G. Y. (2023), Current and Alternative Methods of Data Alias Processing (Pseudonymization) Techniques, KIRI Research Report, 2023(7), Korea Insurance Research Institute, 1-85.
Kim J. S. (2024), Development and Evaluation of De-Identification Pipeline for Korean Clinical Notes Using Natural Language Processing, Master's Thesis, Seoul National University, Seoul, South Korea.
Kim M., Lee S. (2014), Measures of Abnormal User Activities in Online Comments Based on Cosine Similarity, J. Korea Inst. Inf. Secur. Cryptol., 24(2), 335-343.
10.13089/JKIISC.2014.24.2.335Korea Institute of Science and Technology Information (KISTI) (2017), Final Report on the Field Application Project for Creation and Distribution of De-Identified Personal Information.
Lee H. G., Yoo G. D. (2024), Integrated Data Safe Zone Prototype for Efficient Processing and Utilization of Pseudonymous Information in the Transportation Sector, J. Korean Soc. Intell. Transp. Syst., 23(3), 48-66.
10.12815/kits.2024.23.3.48Lee H. S., Song J. H. (2016), A Research on De-Identification Technique for Personal Identifiable Information, Software Policy & Research Institute, 8.
Ministry of the Interior and Safety (2025), 2025 Public Data Provision and Data-Driven Administration Evaluation Guidelines.
Nahid M. M. H., Hasan S. B. (2024), SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy, arXiv preprint, arXiv:2412.20641.
10.32388/PJIL3ENergiz M. E., Atzori M., Saygin Y. (2008), Towards Trajectory Anonymization: A Generalization-Based Approach, In Proceedings of the SIGSPATIAL ACM GIS 2008 International Workshop on Security and Privacy in GIS and LBS, 52-61.
10.1145/1503402.1503413Park G. C. (2020), Big Data Analysis of Civil Complaints Using Text Mining Techniques: Gangnam-gu Office, Seoul Digital Foundation.
Park J. H., Kang M. G. (2024), Spatial Information Extraction and Basic Analysis from 120 Dasan Call Civil Complaint Texts Through Named Entity Recognition Modeling, J. Korea Plan. Assoc., 59(7), 169-180.
10.17208/jkpa.2024.12.59.7.169Personal Information Protection Commission, Korea Internet & Security Agency (KISA) (2024), Guideline for Pseudonym Information Processing.
Rakhimzhanov D., Belginova S., Yedilkhan D. (2025), Automated Classification of Public Transport Complaints via Text Mining Using LLMs and Embeddings. Infor., 16(8), 644.
10.3390/info16080644Sang E. F., De Meulder F. (2003), Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition, arXiv preprint, cs/0306050.
Seo D. K., Kim K. W., Kim J. Y., Lee D. H. (2020), Personal Information Detection and De-Identification System Using Sentence Intent Classification and Named Entity Recognition, KIPS Annual Conference Proceedings, 27(2), 1018-1021.
Shafaeipour N., Stanciu V. D., van Steen M., Wang M. (2024), Understanding the Protection of Privacy when Counting Subway Travelers Through Anonymization, Comput. Environ. Urban Syst., 110, 102091.
10.1016/j.compenvurbsys.2024.102091Thetbanthad P., Sathanarugsawait B., Praneetpolgrang P. (2025), Automated Redaction of Personally Identifiable Information on Drug Labels Using Optical Character Recognition and Large Language Models for Compliance with Thailand’s Personal Data Protection Act, Appl. Sci., 15(9), 4923.
10.3390/app15094923Vakili T., Henriksson A., Dalianis H. (2024), End-to-end Pseudonymization of Fine-Tuned Clinical BERT Models: Privacy Preservation with Maintained Data Utility, BMC Med. Inform. Decis. Mak., 24(1), 162.
10.1186/s12911-024-02546-838915012PMC11197357Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., et al. (2017), Attention is All You Need, In Advances in Neural Information Processing Systems (Vol. 30).
Walter M., Beskorovajnov W., Lieberwirth F., Sürmeli J., Zwick P., Heinrich R. (2023), Mobility Data Anonymization - A Literature Review and an Industry-Driven Survey, Karlsruhe Reports in Informatics, 2023(3). Karlsruhe Institute of Technology (KIT).
Wandelt S., Zheng C., Wang S., Liu Y., Sun X. (2024), Large Language Models for Intelligent Transportation: A Review of the State of the Art and Challenges. Appl. Sci., 14(17), 7455.
10.3390/app14177455Wang S., Sun X., Li X., Ouyang R., Wu F., Zhang T., et al. (2025), GPT-NER: Named Entity Recognition via Large Language Models, In Findings of the Association for Computational Linguistics: NAACL 2025, 4257-4275.
10.18653/v1/2025.findings-naacl.239- Publisher :Korean Society of Transportation
- Publisher(Ko) :대한교통학회
- Journal Title :Journal of Korean Society of Transportation
- Journal Title(Ko) :대한교통학회지
- Volume : 44
- No :1
- Pages :99-113
- Received Date : 2025-11-24
- Revised Date : 2025-12-17
- Accepted Date : 2025-12-23
- DOI :https://doi.org/10.7470/jkst.2026.44.1.099


Journal of Korean Society of Transportation






