mesolitica/synthetic-jawi-conversation
收藏Synthetic Jawi Conversation 数据集概述
基本信息
- 任务类别: 文本生成
- 语言: 马来语 (ms)
数据集来源
- 原始数据集来自: mesolitica/mixtral-malaysian-general-qa
- 转换为Jawi文字
数据文件
- 文件名:
synthetic-jawi-conversation.jsonl - 行数: 509,375 行
- 大小: 13 GB
示例数据
- 角色: context, user, assistant
- 内容: 包含马来语和Jawi文字的对话内容
示例
json [ {"role": "context", "content": "din Venue: DK7, Block J, 3rd Floor, Faculty of Medicine 14 21.06.2023 9.00am – 12.00pm (SDL): to turn in Reflection write up (online, synchronous) Lecturer: Prof. Dr. Fung Shin Yee / Dr. Muhamad Alfakri B. Mat Noh 4 Links to the online platforms (Zoom, GoogleMeet, Microsoft Teams and others) will be shared via the class rep a few days before the class. All students registered for this course must join the class Whatsapp group for notifications. Slides and pre-recorded videos will be uploaded on Spectrum for your reference. Online synchronous: Be present to join the online class at the designated time as stated in the timetable. Attendance will be taken up to 30 mins after class has begun. Class material will be shared on Spectrum and the students can access the information in their own time. Self-directed learning: Students use this time to prepare for presentations or assignments. Lecturers: Name Affiliation Adams, Peter J University of Auckland, New Zealand Anwar Norazit Biomedical Science Chang Li Yen Medical Microbiology Noran Naqiah Binti Mohd Hairi Social and Preventative Medicine Fung Shin Yee Molecular Medicine Haryanti Azura bt Mohammad Wali Animal Experimental Unit, FOM Leo Bey Fen Central Unit of Advanced Research Imaging, CENTUARI Mahmood Danee Social and Preventative Medicine Mohd Shafiq Azanan Paediatrics Department Muhamad Alfakri B. Mat Noh Anatomy Ng Kwan Hoong Biomedical Imaging Nur Azah Hamzaid Head, Research Training Unit, ADeC Phan Chia Wei Clinical Investigation Centre Rozaida Poh Biomedical Science Tan Ai Huey Research Management Unit, FOM Thamil Selvee Ramasamy Molecular Medicine Zanaria Saupi Udin UM Main Library For any enquiries: Prof. Dr. Fung Shin Yee - Programme Coordinator 03-7967 5745 (syfung@ummc.edu.my) Dr. Muhamad Alfakri B. Mat Noh - Programme Coordinator 03-7967 4731 (alfakri@um.edu.my) Muhd Shahrul Harris, The Dean’s Office, Faculty of Medicine 03-7967 7503 (muhdharris@um.edu.my) Updated 04.01.2023 mailto:syfung@ummc.edu.my mailto:alfakri@um.edu.my mailto:muhdharris@um.edu.my Yanti Idaya, AMK., et al. (Eds.): ICOLIS 2021, Kuala Lumpur: DLIS, FASS, 2021 24 Digital Repository for English Language Teaching and Learning Resources: A Panacea in the Time of the Pandemic Grace Hui Lin Tan and Yanti Idaya Aspura M.K. Department of Library and Information Science, Faculty of Arts and Social Sciences, Universiti Malaya, Kuala Lumpur, MALAYSIA e-mail: tangracehl@gmail.com; yanti@um.edu.my ABSTRACT The Coronavirus disease 2019 (COVID-19) pandemic poses a huge challenge to the education system in Malaysia. During this critical time, schools were closed and teachers were required to conduct lessons via the online platform. Thus, the use of digital teaching resources increased exponentially as these resources made lessons more effective and also more interesting. As these resources had to be systematically stored for easy retrieval, a digital repository prototype was developed specially to support English Language instruction at SMK Canossian Convent (SMKCC), Kluang, Johor. The repository facilitates the systematic storage and easy retrieval of the image, text and video collections of the English Language teaching and learning resources using the DSpace system. This was implemented using the ADDIE model which consists of five phases, namely analysis, design, development, implementation and evaluation. A total of 75 teaching and learning resources were uploaded into the system, accompanied by selected Dublin Core metadata fields which include Title, Author, Keyword, Issue Date, Publisher, Description, Uniform Resource Identifier (URI), Type and Language. A survey to evaluate the level of satisfaction was then carried out among users of this digital repository prototype. The results showed that 100% of the respondents found the digital repository useful, easy to use, easy to learn, and were satisfied in using it to browse, search and retrieve any particular item stored in the collection. This project has, indeed, delivered a good prototype of the digital repository service to the targeted users. Suggestions from users were noted for project improvement and future development of the digital repository. Keywords: Digital repositories; DSpace; ADDIE model; Dublin Core metadata; English Language instruction INTRODUCTION On 11 March 2020, the World Health Organisation (WHO) declared the Coronavirus disease 2019 (COVID-19) outbreak a global pandemic (Ghebreyesus 2020). Due to a sharp rise in COVID-19 cases in the country, the Malaysian Prime Minister, Tan Sri Mahiaddin Yassin officially implemented the ‘Movement Control Order’ (MCO) on 18 March 2020 (Sukumaran 2020). During this pandemic, schools in Malaysia were closed indefinitely as a way to protect students from COVID-19 infections and to prevent the disease from Yanti Idaya, AMK., et al. (Eds.): ICOLIS 2021, Kuala Lumpur: DLIS, FASS, 2021 25 spreading in the community (Soo 2020). With the closure of all schools, teachers had to use available technologies and conduct teaching via the online platform (Razak 2020). They had to be creative and innovative in overcoming the limitations of online teaching and in ensuring that e-learning was effective (Othman et al. 2020; Pokhrel and Chhetri 2021). Prior to the COVID-19 pandemic, the English Language teachers of SMK Canossian Convent (SMKCC), Kluang conducted face-to-face lessons and occasionally used digital resources available on several platforms, websites or portals such as the Johor English Digital Learning Resources (JEDLR), British Council, Teach-This.com, All Things Grammar and YouTube. These resources had to be carefully selected for lessons to be effective and interesting so as to achieve their learning objectives. During the MCO period, the use of these available digital resources has become even more popular among teachers conducting online teaching as it helps overcome the problem of having limited time to prepare individual online learning content (Huang et al. 2020). Every English Language teacher has their collection of resources stored in electronic folders in the computer or on external hard disk for future use. The English Language teachers faced various technical problems trying to locate the resources that were stored earlier. Firstly, these resources were not localised; they were scattered, and not well- organised. Secondly, no standard metadata was used when storing these resources. Information about the resources was either not stated or was too brief, that is, only the folder name or the file title was available. Thirdly, item retrieval was time-consuming as the teachers had to select the folder, and then browse through the files according to the level and title. When these titles were renamed, the time taken for the retrieval of the items was much longer. Fourthly, redundancy due to multiple storages occurred when teachers saved the resources which already existed in their computers under a different file name. Lastly, the problem of accessibility arose when teachers, while working in school, wanted to access the image, text and video collection of resources stored earlier but did not bring their computer or external hard disk to school. It was apparent that there was a need for a simple yet centralised and integrated system where good and effective teaching and learning resources could be pooled together, systematically stored in a specific location for easy retrieval, and shared among the teachers. Discussions with their English Language Department Head led to the decision to establish a digital repository as a database to store and manage the digital collection of teaching and learning resources. The resources to be included in this repository were Examination Questions, Fun Learning Activities and Writing Genres. Therefore, this research project aims to develop a digital repository prototype using DSpace for the image, text and video collections of teaching and learning resources of the English Language Department of SMKCC Kluang. The development of the digital repository prototype in this research project is to enhance item retrieval of the English Language teaching and learning resources. Yanti Idaya, AMK., et al. (Eds.): ICOLIS 2021, Kuala Lumpur: DLIS, FASS, 2021 2}, {"role": "user", "content_ms": "Bagaimanakah wabak itu membawa kepada pembangunan prototaip repositori digital untuk pengajaran Bahasa Inggeris di SMK Canossian Convent (SMKCC), Kluang, Johor?", "content_jawi": "باڬايماناكه وابق ايت ممباوا كڤد ڤمباڠونن ڤروتوتايڤ ريڤوسيتوري ديڬيتل اونتوق ڤڠاجرن بهاس ايڠڬريس د ايس عيم كي چانوسسين چونۏينت ﴿ايس عيم كيچچ﴾، قلواڠ، جوهر؟"}, {"role": "assistant", "content_ms": "Pandemik COVID-19 membawa kepada pembangunan prototaip repositori digital untuk pengajaran Bahasa Inggeris di SMK Canossian Convent (SMKCC), Kluang, Johor kerana keperluan untuk menutup sekolah selama-lamanya sebagai satu cara untuk melindungi pelajar daripada jangkitan COVID-19 dan mencegah penyakit itu daripada terus merebak dalam masyarakat. Dengan penutupan semua sekolah, guru terpaksa menjalankan pengajaran melalui platform dalam talian, yang memerlukan mereka kreatif dan inovatif dalam mengatasi batasan pengajaran dalam talian dan memastikan e-pembelajaran berkesan.
Sebelum wabak, guru Bahasa Inggeris SMKCC kadangkala menggunakan sumber digital yang terdapat di beberapa platform, laman web atau portal untuk pengajaran bersemuka. Walau bagaimanapun, dalam tempoh PKP, penggunaan sumber digital yang ada ini menjadi lebih popular dalam kalangan guru yang menjalankan pengajaran dalam talian. Setiap guru Bahasa Inggeris mempunyai koleksi sumber mereka yang disimpan dalam folder elektronik dalam komputer atau pada cakera keras luaran untuk kegunaan masa hadapan. Walau bagaimanapun, mereka menghadapi pelbagai masalah teknikal semasa cuba mencari sumber yang disimpan lebih awal, seperti sumber tidak disetempat, tidak tersusun dengan baik, tiada metadata standard digunakan semasa menyimpan sumber ini, pengambilan item memakan masa, lebihan disebabkan pelbagai storan, dan masalah kebolehaksesan.
Untuk menangani isu ini, Ketua Jabatan Bahasa Inggeris dan guru Bahasa Inggeris memutuskan untuk mewujudkan repositori digital sebagai pangkalan data untuk menyimpan dan mengurus koleksi digital sumber pengajaran dan pembelajaran. Sumber yang akan dimasukkan ke dalam repositori ini ialah Soalan Peperiksaan, Aktiviti Pembelajaran Seronok dan Genre Penulisan. Oleh itu, projek penyelidikan ini bertujuan untuk membangunkan prototaip repositori digital menggunakan DSpace untuk koleksi imej, teks dan video sumber pengajaran dan pembelajaran Jabatan Bahasa Inggeris SMKCC Kluang. Pembangunan prototaip repositori digital dalam projek penyelidikan ini adalah untuk meningkatkan perolehan semula item sumber pengajaran dan pembelajaran Bahasa Inggeris, memudahkan guru mengakses dan menggunakan sumber untuk pengajaran dalam talian.", "content_jawi": "ڤندميق چوۏيد-١٩ ممباوا كڤد ڤمباڠونن ڤروتوتايڤ ريڤوسيتوري ديڬيتل اونتوق ڤڠاجرن بهاس ايڠڬريس د ايس عيم كي چانوسسين چونۏينت ﴿ايس عيم كيچچ﴾، قلواڠ، جوهر كران كڤرلوان اونتوق منوتوڤ سكوله سلاما-لاماڽ سباڬاي ساتو چارا اونتوق مليندوڠي ڤلاجر درڤد جڠكيتن چوۏيد-١٩ دان منچڬه ڤڽاكيت ايت درڤد تروس مربق دالم مشاركت. دڠن ڤنوتوڤن سموا سكوله، ڬورو ترڤقسا منجالنكن ڤڠاجرن ملالوءي ڤلاتفورم دالم تالين، يڠ ممرلوكن مريك كرياتيف دان اينوۏاتيف دالم مڠاتسي باتسن ڤڠاجرن دالم تالين دان ممستيكن اي-ڤمبلاجرن بركسن.
سبلوم وابق، ڬورو بهاس ايڠڬريس ايس عيم كيچچ كادڠكالا مڠڬوناكن سومبر ديڬيتل يڠ ترداڤت د ببراڤ ڤلاتفورم، لامن ويب اتاو ڤورتل اونتوق ڤڠاجرن برسموكا. والاو باڬايماناڤون، دالم تيمڤوه ڤقڤ، ڤڠڬوناان سومبر ديڬيتل يڠ اد اين منجادي لبيه ڤوڤولر دالم كالڠن ڬورو يڠ منجالنكن ڤڠاجرن دالم تالين. ستياڤ ڬورو بهاس ايڠڬريس ممڤوڽاءي كوليكسي سومبر مريك يڠ دسيمڤن دالم فولدر ايليكترونيك دالم كومڤوتر اتاو ڤد چاكرا كراس لواران اونتوق كڬونأن ماس هادڤن. والاو باڬايماناڤون، مريك مڠهادڤي ڤلباڬاي مسئله




