{"id":3315,"date":"2023-07-18T03:25:46","date_gmt":"2023-07-18T03:25:46","guid":{"rendered":"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/"},"modified":"2023-07-18T03:25:46","modified_gmt":"2023-07-18T03:25:46","slug":"panda-menghitung-duplikat","status":"publish","type":"post","link":"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/","title":{"rendered":"Cara menghitung duplikat di panda: dengan contoh"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\">Anda dapat menggunakan metode berikut untuk menghitung duplikat di pandas DataFrame:<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>Metode 1: Hitung nilai duplikat dalam kolom<\/strong><\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">len<\/span> (df[' <span style=\"color: #ff0000;\">my_column<\/span> ']) - <span style=\"color: #008000;\">len<\/span> (df[' <span style=\"color: #ff0000;\">my_column<\/span> ']. <span style=\"color: #3366ff;\">drop_duplicates<\/span> ())\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\"><strong>Metode 2: Hitung baris duplikat<\/strong><\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <span style=\"color: #000000;\"><strong><span style=\"color: #008000;\">len<\/span> (df) <span style=\"color: #008000;\">-len<\/span> ( <span style=\"color: #3366ff;\">df.drop_duplicates<\/span> ())<\/strong><\/span><\/pre>\n<p> <span style=\"color: #000000;\"><strong>Metode 3: Hitung duplikat untuk setiap baris unik<\/strong><\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <span style=\"color: #000000;\"><strong>df. <span style=\"color: #3366ff;\">groupby<\/span> ( <span style=\"color: #3366ff;\">df.columns.tolist<\/span> (), as_index= <span style=\"color: #008000;\">False<\/span> ) <span style=\"color: #3366ff;\">.<\/span> <span style=\"color: #3366ff;\">size<\/span> ()<\/strong><\/span><\/pre>\n<p> <span style=\"color: #000000;\">Contoh berikut menunjukkan cara menggunakan setiap metode dalam praktik dengan pandas DataFrame berikut:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">import<\/span> pandas <span style=\"color: #008000;\">as<\/span> pd\n<span style=\"color: #008080;\">\n#createDataFrame\n<span style=\"color: #000000;\">df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ({' <span style=\"color: #ff0000;\">team<\/span> ': ['A', 'A', 'A', 'A', 'B', 'B', 'B', 'B'],\n                   ' <span style=\"color: #ff0000;\">position<\/span> ': ['G', 'G', 'G', 'F', 'G', 'G', 'F', 'F'],\n                   ' <span style=\"color: #ff0000;\">points<\/span> ': [5, 5, 8, 10, 5, 7, 10, 10]})\n\n<span style=\"color: #008080;\">#view DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n  team position points\n0 AG 5\n1 GA 5\n2 AG 8\n3 AF 10\n4 BG 5\n5 BG 7\n6 BF 10\n7 BF 10<\/span><\/span><\/strong><\/pre>\n<h2> <span style=\"color: #000000;\"><strong>Contoh 1: Hitung nilai duplikat dalam sebuah kolom<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Kode berikut menunjukkan cara menghitung jumlah nilai duplikat pada kolom <strong>poin<\/strong> :<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#count duplicate values in points column\n<\/span><span style=\"color: #008000;\">len<\/span> (df[' <span style=\"color: #ff0000;\">points<\/span> '])- <span style=\"color: #008000;\">len<\/span> (df[' <span style=\"color: #ff0000;\">points<\/span> ']. <span style=\"color: #3366ff;\">drop_duplicates<\/span> ())\n\n4<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Terlihat ada <strong>4<\/strong> nilai duplikat pada kolom <strong>poin<\/strong> .<\/span><\/p>\n<h2> <span style=\"color: #000000;\"><strong>Contoh 2: Hitung baris duplikat<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Kode berikut menunjukkan cara menghitung jumlah baris duplikat di DataFrame:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <span style=\"color: #000000;\"><strong><span style=\"color: #008000;\"><span style=\"color: #008080;\">#count number of duplicate rows<\/span>\nlen<\/span> (df) <span style=\"color: #008000;\">-len<\/span> ( <span style=\"color: #3366ff;\">df.drop_duplicates<\/span> ())\n\n2<\/strong><\/span><\/pre>\n<p> <span style=\"color: #000000;\">Kita dapat melihat ada <strong>2<\/strong> baris duplikat di DataFrame.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Kita dapat menggunakan sintaks berikut untuk menampilkan 2 baris duplikat ini:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <span style=\"color: #000000;\"><strong><span style=\"color: #008000;\"><span style=\"color: #008080;\">#display duplicated rows<\/span>\n<span style=\"color: #000000;\">df[df. <span style=\"color: #3366ff;\">duplicated<\/span> ()]\n\n        team position points\n1 A G 5\n7 B F 10<\/span>\n<\/span><\/strong><\/span><\/pre>\n<h2> <span style=\"color: #000000;\"><strong>Contoh 3: Hitung duplikat untuk setiap baris unik<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Kode berikut menunjukkan cara menghitung jumlah duplikat untuk setiap baris unik di DataFrame:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <span style=\"color: #000000;\"><strong><span style=\"color: #008000;\"><span style=\"color: #008080;\">#display number of duplicates for each unique row\n<span style=\"color: #000000;\">df. <span style=\"color: #3366ff;\">groupby<\/span> ( <span style=\"color: #3366ff;\">df.columns.tolist<\/span> (), as_index= <span style=\"color: #008000;\">False<\/span> ) <span style=\"color: #3366ff;\">.<\/span> <span style=\"color: #3366ff;\">size<\/span> ()\n\n        team position points size\n0 A F 10 1\n1 A G 5 2\n2 A G 8 1\n3 B F 10 2\n4 B G 5 1\n5 B G 7 1\n<\/span><\/span><\/span><\/strong><\/span><\/pre>\n<p> <span style=\"color: #000000;\"><span style=\"color: #000000;\">Kolom <strong>Ukuran<\/strong> menampilkan jumlah duplikat untuk setiap baris unik.<\/span><\/span><\/p>\n<h2> <span style=\"color: #000000;\"><strong>Sumber daya tambahan<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Tutorial berikut menjelaskan cara melakukan operasi umum lainnya di panda:<\/span><\/p>\n<p> <a href=\"https:\/\/statorials.org\/id\/panda-menjatuhkan-duplikat\/\" target=\"_blank\" rel=\"noopener\">Cara menghapus baris duplikat di Pandas<\/a><br \/> <a href=\"https:\/\/statorials.org\/id\/panda-menjatuhkan-kolom-duplikat\/\" target=\"_blank\" rel=\"noopener\">Cara menghapus kolom duplikat di Pandas<\/a><br \/> <a href=\"https:\/\/statorials.org\/id\/panda-pilih-kolom-berdasarkan-indeks\/\" target=\"_blank\" rel=\"noopener\">Cara memilih kolom berdasarkan indeks di Pandas<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Anda dapat menggunakan metode berikut untuk menghitung duplikat di pandas DataFrame: Metode 1: Hitung nilai duplikat dalam kolom len (df[&#8216; my_column &#8216;]) &#8211; len (df[&#8216; my_column &#8216;]. drop_duplicates ()) Metode 2: Hitung baris duplikat len (df) -len ( df.drop_duplicates ()) Metode 3: Hitung duplikat untuk setiap baris unik df. groupby ( df.columns.tolist (), as_index= False [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Cara menghitung duplikat di panda (dengan contoh) \u2013 Statologi<\/title>\n<meta name=\"description\" content=\"Tutorial ini menjelaskan cara menghitung duplikat di pandas DataFrame, dengan beberapa contoh.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/\" \/>\n<meta property=\"og:locale\" content=\"id_ID\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cara menghitung duplikat di panda (dengan contoh) \u2013 Statologi\" \/>\n<meta property=\"og:description\" content=\"Tutorial ini menjelaskan cara menghitung duplikat di pandas DataFrame, dengan beberapa contoh.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-18T03:25:46+00:00\" \/>\n<meta name=\"author\" content=\"Benjamin anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Ditulis oleh\" \/>\n\t<meta name=\"twitter:data1\" content=\"Benjamin anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimasi waktu membaca\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 menit\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/\",\"url\":\"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/\",\"name\":\"Cara menghitung duplikat di panda (dengan contoh) \u2013 Statologi\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/id\/#website\"},\"datePublished\":\"2023-07-18T03:25:46+00:00\",\"dateModified\":\"2023-07-18T03:25:46+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\"},\"description\":\"Tutorial ini menjelaskan cara menghitung duplikat di pandas DataFrame, dengan beberapa contoh.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/#breadcrumb\"},\"inLanguage\":\"id\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/statorials.org\/id\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Cara menghitung duplikat di panda: dengan contoh\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/id\/#website\",\"url\":\"https:\/\/statorials.org\/id\/\",\"name\":\"Statorials\",\"description\":\"Panduan anda untuk kompetensi statistik!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/id\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"id\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\",\"name\":\"Benjamin anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"id\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Benjamin anderson\"},\"description\":\"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya\",\"sameAs\":[\"http:\/\/statorials.org\/id\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Cara menghitung duplikat di panda (dengan contoh) \u2013 Statologi","description":"Tutorial ini menjelaskan cara menghitung duplikat di pandas DataFrame, dengan beberapa contoh.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/","og_locale":"id_ID","og_type":"article","og_title":"Cara menghitung duplikat di panda (dengan contoh) \u2013 Statologi","og_description":"Tutorial ini menjelaskan cara menghitung duplikat di pandas DataFrame, dengan beberapa contoh.","og_url":"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/","og_site_name":"Statorials","article_published_time":"2023-07-18T03:25:46+00:00","author":"Benjamin anderson","twitter_card":"summary_large_image","twitter_misc":{"Ditulis oleh":"Benjamin anderson","Estimasi waktu membaca":"2 menit"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/","url":"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/","name":"Cara menghitung duplikat di panda (dengan contoh) \u2013 Statologi","isPartOf":{"@id":"https:\/\/statorials.org\/id\/#website"},"datePublished":"2023-07-18T03:25:46+00:00","dateModified":"2023-07-18T03:25:46+00:00","author":{"@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81"},"description":"Tutorial ini menjelaskan cara menghitung duplikat di pandas DataFrame, dengan beberapa contoh.","breadcrumb":{"@id":"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/#breadcrumb"},"inLanguage":"id","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/id\/panda-menghitung-duplikat\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/statorials.org\/id\/"},{"@type":"ListItem","position":2,"name":"Cara menghitung duplikat di panda: dengan contoh"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/id\/#website","url":"https:\/\/statorials.org\/id\/","name":"Statorials","description":"Panduan anda untuk kompetensi statistik!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/id\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"id"},{"@type":"Person","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81","name":"Benjamin anderson","image":{"@type":"ImageObject","inLanguage":"id","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/","url":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Benjamin anderson"},"description":"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya","sameAs":["http:\/\/statorials.org\/id"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/3315"}],"collection":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/comments?post=3315"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/3315\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/media?parent=3315"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/categories?post=3315"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/tags?post=3315"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}