{"id":2522,"date":"2023-07-21T22:03:56","date_gmt":"2023-07-21T22:03:56","guid":{"rendered":"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/"},"modified":"2023-07-21T22:03:56","modified_gmt":"2023-07-21T22:03:56","slug":"panda-menemukan-duplikat","status":"publish","type":"post","link":"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/","title":{"rendered":"Cara menemukan duplikat di pandas dataframe (dengan contoh)"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\">Anda dapat menggunakan fungsi <a href=\"https:\/\/pandas.pydata.org\/docs\/reference\/api\/pandas.DataFrame.duplicated.html\" target=\"_blank\" rel=\"noopener\">duplikat()<\/a> untuk menemukan nilai duplikat di DataFrame pandas.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Fungsi ini menggunakan sintaks dasar berikut:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#find duplicate rows across all columns<\/span>\nduplicateRows = df[df. <span style=\"color: #3366ff;\">duplicated<\/span> ()]\n\n<span style=\"color: #008080;\">#find duplicate rows across specific columns<\/span>\nduplicateRows = df[df. <span style=\"color: #3366ff;\">duplicated<\/span> ([' <span style=\"color: #ff0000;\">col1<\/span> ',' <span style=\"color: #ff0000;\">col2<\/span> '])]\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Contoh berikut menunjukkan cara menggunakan fungsi ini dalam praktik dengan pandas DataFrame berikut:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">import<\/span> pandas <span style=\"color: #008000;\">as<\/span> pd\n\n<span style=\"color: #008080;\">#createDataFrame\n<span style=\"color: #000000;\">df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ({' <span style=\"color: #ff0000;\">team<\/span> ': ['A', 'A', 'A', 'A', 'B', 'B', 'B', 'B'],\n                   ' <span style=\"color: #ff0000;\">points<\/span> ': [10, 10, 12, 12, 15, 17, 20, 20],\n                   ' <span style=\"color: #ff0000;\">assists<\/span> ': [5, 5, 7, 9, 12, 9, 6, 6]})\n\n<span style=\"color: #008080;\">#view DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n  team points assists\n0 to 10 5\n1 to 10 5\n2 to 12 7\n3 to 12 9\n4 B 15 12\n5 B 17 9\n6 B 20 6\n7 B 20 6\n<\/span><\/span><\/strong><\/pre>\n<h3> <span style=\"color: #000000;\"><strong>Contoh 1: Temukan baris duplikat di semua kolom<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Kode berikut menunjukkan cara menemukan baris duplikat di semua kolom DataFrame:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#identify duplicate rows\n<\/span>duplicateRows = df[df. <span style=\"color: #3366ff;\">duplicated<\/span> ()]\n\n<span style=\"color: #008080;\">#view duplicate rows\n<\/span>duplicateRows\n\n        team points assists\n1 to 10 5\n7 B 20 6\n<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Ada dua baris yang merupakan duplikat persis dari baris lain di DataFrame.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Perhatikan bahwa kita juga dapat menggunakan argumen <strong>keep=&#8217;last&#8217;<\/strong> untuk menampilkan baris duplikat pertama, bukan baris terakhir:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#identify duplicate rows\n<\/span>duplicateRows = df[df. <span style=\"color: #3366ff;\">duplicated<\/span> (keep=' <span style=\"color: #ff0000;\">last<\/span> ')]\n\n<span style=\"color: #008080;\">#view duplicate rows\n<\/span><span style=\"color: #008000;\">print<\/span> (duplicateRows)\n\n\tteam points assists\n0 to 10 5\n6 B 20 6\n<\/span><\/span><\/strong><\/pre>\n<h3> <span style=\"color: #000000;\"><strong>Contoh 2: Temukan baris duplikat di kolom tertentu<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Kode berikut menunjukkan cara menemukan baris duplikat hanya di kolom &#8220;tim&#8221; dan &#8220;poin&#8221; pada DataFrame:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#identify duplicate rows across 'team' and 'points' columns\n<\/span>duplicateRows = df[df. <span style=\"color: #3366ff;\">duplicated<\/span> ([' <span style=\"color: #ff0000;\">team<\/span> ',' <span style=\"color: #ff0000;\">points<\/span> '])]\n\n<span style=\"color: #008080;\">#view duplicate rows\n<\/span><span style=\"color: #008000;\">print<\/span> (duplicateRows)\n\n        team points assists\n1 to 10 5\n3 to 12 9\n7 B 20 6\n<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Ada tiga baris yang nilai pada kolom \u201ctim\u201d dan \u201cpoin\u201d sama persis dengan baris sebelumnya.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Contoh 3: Temukan baris duplikat dalam sebuah kolom<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Kode berikut menunjukkan cara menemukan baris duplikat hanya di kolom &#8220;tim&#8221; DataFrame:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#identify duplicate rows in 'team' column\n<\/span>duplicateRows = df[df. <span style=\"color: #3366ff;\">duplicated<\/span> ([' <span style=\"color: #ff0000;\">team<\/span> '])]\n\n<span style=\"color: #008080;\">#view duplicate rows\n<\/span><span style=\"color: #008000;\">print<\/span> (duplicateRows)\n\n\tteam points assists\n1 to 10 5\n2 to 12 7\n3 to 12 9\n5 B 17 9\n6 B 20 6\n7 B 20 6\n<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Ada total enam baris yang nilai pada kolom &#8220;tim&#8221; sama persis dengan baris sebelumnya.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Sumber daya tambahan<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Tutorial berikut menjelaskan cara melakukan operasi umum lainnya di panda:<\/span><\/p>\n<p> <a href=\"https:\/\/statorials.org\/id\/panda-menjatuhkan-duplikat\/\" target=\"_blank\" rel=\"noopener\">Cara menghapus baris duplikat di Pandas<\/a><br \/> <a href=\"https:\/\/statorials.org\/id\/panda-menjatuhkan-kolom-duplikat\/\" target=\"_blank\" rel=\"noopener\">Cara menghapus kolom duplikat di Pandas<\/a><br \/> <a href=\"https:\/\/statorials.org\/id\/panda-pilih-kolom-berdasarkan-indeks\/\" target=\"_blank\" rel=\"noopener\">Cara memilih kolom berdasarkan indeks di Pandas<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Anda dapat menggunakan fungsi duplikat() untuk menemukan nilai duplikat di DataFrame pandas. Fungsi ini menggunakan sintaks dasar berikut: #find duplicate rows across all columns duplicateRows = df[df. duplicated ()] #find duplicate rows across specific columns duplicateRows = df[df. duplicated ([&#8216; col1 &#8216;,&#8217; col2 &#8216;])] Contoh berikut menunjukkan cara menggunakan fungsi ini dalam praktik dengan pandas [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Cara menemukan duplikat di Pandas DataFrame (dengan contoh) \u2013 Statorials<\/title>\n<meta name=\"description\" content=\"Tutorial ini menjelaskan cara menemukan elemen duplikat di pandas DataFrame, dengan beberapa contoh.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/\" \/>\n<meta property=\"og:locale\" content=\"id_ID\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cara menemukan duplikat di Pandas DataFrame (dengan contoh) \u2013 Statorials\" \/>\n<meta property=\"og:description\" content=\"Tutorial ini menjelaskan cara menemukan elemen duplikat di pandas DataFrame, dengan beberapa contoh.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-21T22:03:56+00:00\" \/>\n<meta name=\"author\" content=\"Benjamin anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Ditulis oleh\" \/>\n\t<meta name=\"twitter:data1\" content=\"Benjamin anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimasi waktu membaca\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 menit\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/\",\"url\":\"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/\",\"name\":\"Cara menemukan duplikat di Pandas DataFrame (dengan contoh) \u2013 Statorials\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/id\/#website\"},\"datePublished\":\"2023-07-21T22:03:56+00:00\",\"dateModified\":\"2023-07-21T22:03:56+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\"},\"description\":\"Tutorial ini menjelaskan cara menemukan elemen duplikat di pandas DataFrame, dengan beberapa contoh.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/#breadcrumb\"},\"inLanguage\":\"id\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/statorials.org\/id\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Cara menemukan duplikat di pandas dataframe (dengan contoh)\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/id\/#website\",\"url\":\"https:\/\/statorials.org\/id\/\",\"name\":\"Statorials\",\"description\":\"Panduan anda untuk kompetensi statistik!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/id\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"id\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\",\"name\":\"Benjamin anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"id\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Benjamin anderson\"},\"description\":\"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya\",\"sameAs\":[\"http:\/\/statorials.org\/id\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Cara menemukan duplikat di Pandas DataFrame (dengan contoh) \u2013 Statorials","description":"Tutorial ini menjelaskan cara menemukan elemen duplikat di pandas DataFrame, dengan beberapa contoh.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/","og_locale":"id_ID","og_type":"article","og_title":"Cara menemukan duplikat di Pandas DataFrame (dengan contoh) \u2013 Statorials","og_description":"Tutorial ini menjelaskan cara menemukan elemen duplikat di pandas DataFrame, dengan beberapa contoh.","og_url":"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/","og_site_name":"Statorials","article_published_time":"2023-07-21T22:03:56+00:00","author":"Benjamin anderson","twitter_card":"summary_large_image","twitter_misc":{"Ditulis oleh":"Benjamin anderson","Estimasi waktu membaca":"2 menit"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/","url":"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/","name":"Cara menemukan duplikat di Pandas DataFrame (dengan contoh) \u2013 Statorials","isPartOf":{"@id":"https:\/\/statorials.org\/id\/#website"},"datePublished":"2023-07-21T22:03:56+00:00","dateModified":"2023-07-21T22:03:56+00:00","author":{"@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81"},"description":"Tutorial ini menjelaskan cara menemukan elemen duplikat di pandas DataFrame, dengan beberapa contoh.","breadcrumb":{"@id":"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/#breadcrumb"},"inLanguage":"id","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/id\/panda-menemukan-duplikat\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/statorials.org\/id\/"},{"@type":"ListItem","position":2,"name":"Cara menemukan duplikat di pandas dataframe (dengan contoh)"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/id\/#website","url":"https:\/\/statorials.org\/id\/","name":"Statorials","description":"Panduan anda untuk kompetensi statistik!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/id\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"id"},{"@type":"Person","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81","name":"Benjamin anderson","image":{"@type":"ImageObject","inLanguage":"id","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/","url":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Benjamin anderson"},"description":"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya","sameAs":["http:\/\/statorials.org\/id"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/2522"}],"collection":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/comments?post=2522"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/2522\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/media?parent=2522"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/categories?post=2522"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/tags?post=2522"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}