Publications
publications is in reversed chronological order.
2024
- arXivTowards Efficient and Robust VQA-NLE Data Generation with Large Vision-Language ModelsarXiv preprint arXiv:2409.14785, 2024
- arXivPreference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A SurveyarXiv preprint arXiv:2409.11564, 2024
- EMNLPSEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian LanguagesarXiv preprint arXiv:2406.10118, 2024
- arXivProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy ModelsarXiv preprint arXiv:2406.09334, 2024
- EMNLP FindingsMINERS: Multilingual Language Models as Semantic RetrieversarXiv preprint arXiv:2406.07424, 2024
- arXivLessons from the Trenches on Reproducible Evaluation of Language ModelsarXiv preprint arXiv:2405.14782, 2024
- ACL FindingsSemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 LanguagesarXiv preprint arXiv:2402.08638, 2024
- ACLCendol: Open Instruction-tuned Generative Large Language Models for Indonesian LanguagesarXiv preprint arXiv:2404.06138, 2024
- EMNLP FindingsLinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language GeneralizationarXiv preprint arXiv:2401.06034, 2024
2023
- arXivBloom: A 176b-parameter open-access multilingual language modelarXiv preprint arXiv:2211.05100, 2023
- SEALPIndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue SystemsIn Proceedings of the First Workshop in South East Asian Language Processing, 2023
- AACLEfficient Zero-Shot Cross-lingual Inference via RetrievalIn Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
- AACLNusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource LanguagesIn Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
- Machine LearningTransfer learning application of self-supervised learning in ARPESMachine Learning: Science and Technology, 2023
- arXivMultilingual Few-Shot Learning via Language Model RetrievalarXiv preprint arXiv:2306.10964, 2023
- EMNLPGlobalBench: A Benchmark for Global Progress in Natural Language ProcessingIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
- EMNLPMultilingual Large Language Models Are Not (Yet) Code-SwitchersIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
- CALCSPrompting multilingual large language models to generate code-mixed texts: The case of south east asian languagesIn Proceedings of the 6th Workshop on Computational Approaches to Linguistic Code-Switching, 2023
- ACL FindingsNusaCrowd: Open source initiative for Indonesian NLP resourcesIn Findings of the Association for Computational Linguistics: ACL 2023, 2023
- ACL FindingsMulti-lingual and Multi-cultural Figurative Language UnderstandingIn Findings of the Association for Computational Linguistics: ACL 2023, 2023
- ACL FindingsOvercoming Catastrophic Forgetting in Massively Multilingual Continual LearningIn Findings of the Association for Computational Linguistics: ACL 2023, 2023
- ACLOn “Scientific Debt” in NLP: A Case for More Rigour in Language Model Pre-Training ResearchIn Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
- ICAICTAImplementing Quantization to Indonesian BERT Language ModelIn 2023 10th International Conference on Advanced Informatics: Concept, Theory and Application (ICAICTA), 2023
- EACLTowards a Unified Multi-Domain Multilingual Named Entity Recognition ModelIn Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
- ACL FindingsThe Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and ChallengesIn Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
- SumEvalIndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local LanguagesSumEval 2022, 2022
- ACLBLOOM+ 1: Adding Language Support to BLOOM for Zero-Shot PromptingarXiv preprint arXiv:2212.09535, 2022
- AACLCross-lingual Few-Shot Learning on Unseen LanguagesIn Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022
- SumEvalIndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local LanguagesIn Proceedings of the First Workshop on Scaling Up Multilingual Evaluation, 2022
- arXivTransfer Learning Application of Self-supervised Learning in ARPESarXiv preprint arXiv:2208.10893, 2022
- arXivNusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian LanguagesarXiv preprint arXiv:2207.10524, 2022
- EMNLP DemoGEMv2: Multilingual NLG Benchmarking in a Single Line of CodearXiv preprint arXiv:2206.11249, 2022
- Accepted at TMLRBeyond the Imitation Game: Quantifying and extrapolating the capabilities of language models2022
- ACLOne Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in IndonesiaAccepted at ACL, 2022
- LRECCI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command RecognitionAccepted at LREC, 2022
- LRECASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn ConversationAccepted at LREC, 2022
2021
- arXivNL-Augmenter: A Framework for Task-Sensitive Natural Language AugmentationarXiv preprint arXiv:2112.02721, 2021
- arXivFew-Shot Bot: Prompt-Based Learning for Dialogue SystemsarXiv preprint arXiv:2110.08118, 2021
- arXivGreenformer: Factorization toolkit for efficient deep neural networksarXiv preprint arXiv:2109.06762, 2021
- InterspeechAdapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech RecognitionINTERSPEECH, Aug 2021
- SIGDIAL
- RepL4NLPExploring Fine-tuning Techniques for Pre-trained Cross-lingual Models via Continual LearningRepL4NLP, Aug 2021
- arXiv
2020
- AACL-IJCNLPIndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language UnderstandingIn Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, Aug 2020
- arXivEmograph: Capturing emotion correlations using graph networksarXiv preprint arXiv:2008.09378, Aug 2020
2019
- MRQAGeneralizing Question Answering System with Pre-trained Language Model Fine-tuningIn EMNLP 2019 MRQA Workshop, Aug 2019
- FinNLPLearning to learn sales prediction with social media sentimentIn Proceedings of the First Workshop on Financial Technology and Natural Language Processing, Aug 2019
2018
- arXivTowards end-to-end automatic code-switching speech recognitionarXiv preprint arXiv:1810.12620, Aug 2018
- arXivLearn to code-switch: Data augmentation using copy mechanism on language modelingarXiv preprint arXiv:1810.10254, Aug 2018
- ICASSPEnd-to-End Dynamic Query Memory Network for Entity-Value Independent Task-oriented DialogIn 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Aug 2018
2017
- DSTC6End-to-end recurrent entity network for entity-value independent goal-oriented dialog learningIn Wu, Chien-Sheng, et al. "End-to-end recurrent entity network for entity-value independent goal-oriented dialog learning." Dialog System Technology Challenges Workshop, DSTC6, Aug 2017
- Interspeech