{"id":2913,"date":"2023-07-20T03:12:14","date_gmt":"2023-07-20T03:12:14","guid":{"rendered":"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/"},"modified":"2023-07-20T03:12:14","modified_gmt":"2023-07-20T03:12:14","slug":"pencocokan-fuzzy-di-r","status":"publish","type":"post","link":"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/","title":{"rendered":"Cara melakukan pencocokan fuzzy di r (dengan contoh)"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\">Seringkali Anda mungkin ingin menggabungkan dua kumpulan data di R berdasarkan string yang tidak cocok secara sempurna. Hal ini kadang-kadang disebut <strong>pencocokan fuzzy<\/strong> .<\/span><\/p>\n<p> <span style=\"color: #000000;\">Cara termudah untuk melakukan pencocokan fuzzy di R adalah dengan menggunakan fungsi <strong>stringdist_join()<\/strong> dari paket <strong>fuzzyjoin<\/strong> .<\/span><\/p>\n<p> <span style=\"color: #000000;\">Contoh berikut menunjukkan cara menggunakan fungsi ini dalam praktiknya.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Contoh: pencocokan fuzzy di R<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\"><span style=\"color: #000000;\">Misalkan kita memiliki dua bingkai data berikut di R yang berisi informasi tentang berbagai tim bola basket:<\/span><\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#create data frames<\/span>\ndf1 &lt;- data. <span style=\"color: #3366ff;\">frame<\/span> (team=c('Mavericks', 'Nets', 'Warriors', 'Heat', 'Lakers'),\n                  points=c(99, 90, 104, 117, 100))\ndf2 &lt;- data. <span style=\"color: #3366ff;\">frame<\/span> (team=c('Mavricks', 'Warrors', 'Heat', 'Netts', 'Kings', 'Lakes'),\n                  assists=c(22, 29, 17, 40, 32, 30))\n\n<span style=\"color: #008080;\">#view data frames\n<\/span><span style=\"color: #008000;\">print<\/span> (df1)\n\n       team points\n1 Mavericks 99\n2 Nets 90\n3 Warriors 104\n4 Heat 117\n5 Lakers 100\n\n<span style=\"color: #008000;\">print<\/span> (df2)\n\n      team assists\n1 Mavricks 22\n2 Warriors 29\n3 Heat 17\n4 Netts 40\n5 Kings 32\n6 Lakes 30\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Sekarang katakanlah kita ingin melakukan <strong>penggabungan kiri<\/strong> di mana kita menyimpan semua baris dari bingkai data pertama dan menggabungkannya berdasarkan nama tim yang paling cocok dengan bingkai data kedua.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Kita dapat menggunakan kode berikut untuk melakukan ini:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <span style=\"color: #000000;\"><strong><span style=\"color: #008000;\">library<\/span> (fuzzyjoin)\n<span style=\"color: #008000;\">library<\/span> (dplyr)\n\n<span style=\"color: #008080;\">#perform fuzzy matching left join\n<\/span>stringdist_join(df1, df2, \n                by=' <span style=\"color: #ff0000;\">team<\/span> ', <span style=\"color: #008080;\">#match based on team<\/span>\n                mode=' <span style=\"color: #ff0000;\">left<\/span> ', <span style=\"color: #008080;\">#use left join<\/span>\n                method = \" <span style=\"color: #ff0000;\">jw<\/span> \", <span style=\"color: #008080;\">#use jw distance metric<\/span>\n                max_dist=99, \n                distance_col=' <span style=\"color: #ff0000;\">dist<\/span> ') %&gt;%\n  group_by(team.x) %&gt;%\n  slice_min(order_by=dist, n= <span style=\"color: #008000;\">1<\/span> )\n\n# A tibble: 5 x 5\n# Groups: team.x [5]\n  team.x points team.y assists dist\n                 \n1 Heat 117 Heat 17 0     \n2 Lakers 100 Lakes 30 0.0556\n3 Mavericks 99 Mavricks 22 0.0370\n4 Nets 90 Nets 40 0.0667\n5 Warriors 104 Warriors 29 0.0417\n<\/strong><\/span><\/pre>\n<p> <span style=\"color: #000000;\">Hasilnya adalah bingkai data yang berisi masing-masing dari lima nama tim asli dari bingkai data pertama serta tim yang paling cocok dengan bingkai data kedua.<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>Catatan #1<\/strong> : Kami memilih untuk menggunakan metrik jarak <strong>jw<\/strong> untuk pencocokan. Ini adalah kependekan dari <a href=\"https:\/\/en.wikipedia.org\/wiki\/Jaro%E2%80%93Winkler_distance\" target=\"_blank\" rel=\"noopener\">jarak Jaro-Winkler<\/a> , yang merupakan metrik yang mengukur perbedaan antara dua string.<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>Catatan #2:<\/strong> Kami menggunakan fungsi <strong>slice_min()<\/strong> dari paket dplyr untuk hanya menampilkan nama tim dari bingkai data kedua yang paling cocok dengan nama tim dari bingkai data pertama.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Sumber daya tambahan<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Tutorial berikut menjelaskan cara melakukan tugas umum lainnya di R:<\/span><\/p>\n<p> <a href=\"https:\/\/statorials.org\/id\/menggabungkan-beberapa-bingkai-data-di-r\/\" target=\"_blank\" rel=\"noopener\">Cara menggabungkan beberapa frame data di R<\/a><br \/> Cara menggabungkan bingkai data berdasarkan beberapa kolom di R<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Seringkali Anda mungkin ingin menggabungkan dua kumpulan data di R berdasarkan string yang tidak cocok secara sempurna. Hal ini kadang-kadang disebut pencocokan fuzzy . Cara termudah untuk melakukan pencocokan fuzzy di R adalah dengan menggunakan fungsi stringdist_join() dari paket fuzzyjoin . Contoh berikut menunjukkan cara menggunakan fungsi ini dalam praktiknya. Contoh: pencocokan fuzzy di R [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Cara Melakukan Pencocokan Fuzzy di R (dengan Contoh) - Statorial<\/title>\n<meta name=\"description\" content=\"Tutorial ini menjelaskan cara melakukan pencocokan fuzzy antara dua dataset di R, dengan sebuah contoh.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/\" \/>\n<meta property=\"og:locale\" content=\"id_ID\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cara Melakukan Pencocokan Fuzzy di R (dengan Contoh) - Statorial\" \/>\n<meta property=\"og:description\" content=\"Tutorial ini menjelaskan cara melakukan pencocokan fuzzy antara dua dataset di R, dengan sebuah contoh.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-20T03:12:14+00:00\" \/>\n<meta name=\"author\" content=\"Benjamin anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Ditulis oleh\" \/>\n\t<meta name=\"twitter:data1\" content=\"Benjamin anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimasi waktu membaca\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 menit\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/\",\"url\":\"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/\",\"name\":\"Cara Melakukan Pencocokan Fuzzy di R (dengan Contoh) - Statorial\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/id\/#website\"},\"datePublished\":\"2023-07-20T03:12:14+00:00\",\"dateModified\":\"2023-07-20T03:12:14+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\"},\"description\":\"Tutorial ini menjelaskan cara melakukan pencocokan fuzzy antara dua dataset di R, dengan sebuah contoh.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/#breadcrumb\"},\"inLanguage\":\"id\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/statorials.org\/id\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Cara melakukan pencocokan fuzzy di r (dengan contoh)\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/id\/#website\",\"url\":\"https:\/\/statorials.org\/id\/\",\"name\":\"Statorials\",\"description\":\"Panduan anda untuk kompetensi statistik!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/id\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"id\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\",\"name\":\"Benjamin anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"id\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Benjamin anderson\"},\"description\":\"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya\",\"sameAs\":[\"http:\/\/statorials.org\/id\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Cara Melakukan Pencocokan Fuzzy di R (dengan Contoh) - Statorial","description":"Tutorial ini menjelaskan cara melakukan pencocokan fuzzy antara dua dataset di R, dengan sebuah contoh.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/","og_locale":"id_ID","og_type":"article","og_title":"Cara Melakukan Pencocokan Fuzzy di R (dengan Contoh) - Statorial","og_description":"Tutorial ini menjelaskan cara melakukan pencocokan fuzzy antara dua dataset di R, dengan sebuah contoh.","og_url":"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/","og_site_name":"Statorials","article_published_time":"2023-07-20T03:12:14+00:00","author":"Benjamin anderson","twitter_card":"summary_large_image","twitter_misc":{"Ditulis oleh":"Benjamin anderson","Estimasi waktu membaca":"2 menit"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/","url":"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/","name":"Cara Melakukan Pencocokan Fuzzy di R (dengan Contoh) - Statorial","isPartOf":{"@id":"https:\/\/statorials.org\/id\/#website"},"datePublished":"2023-07-20T03:12:14+00:00","dateModified":"2023-07-20T03:12:14+00:00","author":{"@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81"},"description":"Tutorial ini menjelaskan cara melakukan pencocokan fuzzy antara dua dataset di R, dengan sebuah contoh.","breadcrumb":{"@id":"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/#breadcrumb"},"inLanguage":"id","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/id\/pencocokan-fuzzy-di-r\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/statorials.org\/id\/"},{"@type":"ListItem","position":2,"name":"Cara melakukan pencocokan fuzzy di r (dengan contoh)"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/id\/#website","url":"https:\/\/statorials.org\/id\/","name":"Statorials","description":"Panduan anda untuk kompetensi statistik!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/id\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"id"},{"@type":"Person","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81","name":"Benjamin anderson","image":{"@type":"ImageObject","inLanguage":"id","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/","url":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Benjamin anderson"},"description":"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya","sameAs":["http:\/\/statorials.org\/id"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/2913"}],"collection":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/comments?post=2913"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/2913\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/media?parent=2913"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/categories?post=2913"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/tags?post=2913"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}