{"id":2517,"date":"2023-07-21T22:38:17","date_gmt":"2023-07-21T22:38:17","guid":{"rendered":"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/"},"modified":"2023-07-21T22:38:17","modified_gmt":"2023-07-21T22:38:17","slug":"pengelompokan-data-dengan-python","status":"publish","type":"post","link":"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/","title":{"rendered":"Cara melakukan pengelompokan data dengan python: dengan contoh"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\">Anda dapat menggunakan sintaks dasar berikut untuk melakukan pengelompokan data pada pandas DataFrame:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">import<\/span> pandas <span style=\"color: #008000;\">as<\/span> pd\n\n<span style=\"color: #008080;\">#perform binning with 3 bins<\/span>\ndf[' <span style=\"color: #ff0000;\">new_bin<\/span> '] = pd. <span style=\"color: #3366ff;\">qcut<\/span> (df[' <span style=\"color: #ff0000;\">variable_name<\/span> '], q= <span style=\"color: #008000;\">3<\/span> )\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Contoh berikut menunjukkan cara menggunakan sintaksis ini dalam praktiknya dengan pandas DataFrame berikut:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008000;\">import<\/span> pandas <span style=\"color: #008000;\">as<\/span> pd\n\n<span style=\"color: #008080;\">#createDataFrame\n<\/span>df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ({' <span style=\"color: #ff0000;\">points<\/span> ': [4, 4, 7, 8, 12, 13, 15, 18, 22, 23, 23, 25],\n                   ' <span style=\"color: #ff0000;\">assists<\/span> ': [2, 5, 4, 7, 7, 8, 5, 4, 5, 11, 13, 8],\n                   ' <span style=\"color: #ff0000;\">rebounds<\/span> ': [7, 7, 4, 6, 3, 8, 9, 9, 12, 11, 8, 9]})\n\n<span style=\"color: #008080;\">#view DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n    points assists rebounds\n0 4 2 7\n1 4 5 7\n2 7 4 4\n3 8 7 6\n4 12 7 3\n5 13 8 8\n6 15 5 9\n7 18 4 9\n8 22 5 12\n9 23 11 11\n10 23 13 8\n11 25 8 9\n<\/span><\/span><\/strong><\/pre>\n<h3> <span style=\"color: #000000;\"><strong>Contoh 1: Lakukan pengelompokan data master<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Kode berikut menunjukkan cara melakukan pengelompokan data pada variabel <strong>poin<\/strong> menggunakan fungsi <a href=\"https:\/\/pandas.pydata.org\/docs\/reference\/api\/pandas.qcut.html\" target=\"_blank\" rel=\"noopener\">qcut()<\/a> dengan tanda putus tertentu:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#perform data binning on variable <em>points<\/em>\n<\/span>df[' <span style=\"color: #ff0000;\">points_bin<\/span> '] = pd. <span style=\"color: #3366ff;\">qcut<\/span> (df[' <span style=\"color: #ff0000;\">points<\/span> '], q= <span style=\"color: #008000;\">3<\/span> )\n\n<span style=\"color: #008080;\">#view updated DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n    points assists rebounds points_bin\n0 4 2 7 (3,999, 10,667]\n1 4 5 7 (3,999, 10,667]\n2 7 4 4 (3,999, 10,667]\n3 8 7 6 (3,999, 10,667]\n4 12 7 3 (10,667, 19,333]\n5 13 8 8 (10,667, 19,333]\n6 15 5 9 (10,667, 19,333]\n7 18 4 9 (10,667, 19,333]\n8 22 5 12 (19.333, 25.0]\n9 23 11 11 (19.333, 25.0]\n10 23 13 8 (19.333, 25.0]\n11 25 8 9 (19.333, 25.0]\n<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Perhatikan bahwa setiap baris dalam bingkai data telah ditempatkan ke dalam salah satu dari tiga kelompok berdasarkan nilai kolom titik.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Kita dapat menggunakan fungsi <strong>value_counts()<\/strong> untuk mengetahui berapa banyak baris yang ditempatkan di setiap nampan:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#count frequency of each bin\n<\/span>df[' <span style=\"color: #ff0000;\">points_bin<\/span> ']. <span style=\"color: #3366ff;\">value_counts<\/span> ()\n\n(3,999, 10,667] 4\n(10.667, 19.333] 4\n(19.333, 25.0] 4\nName: points_bin, dtype: int64\n<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Kita dapat melihat bahwa setiap bin berisi 4 observasi.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Contoh 2: Lakukan pengelompokan data dengan kuantil tertentu<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Kita juga dapat melakukan pengelompokan data menggunakan kuantil tertentu:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#perform data binning on variable <em>points<\/em> with specific quantiles\n<\/span>df[' <span style=\"color: #ff0000;\">points_bin<\/span> '] = pd. <span style=\"color: #3366ff;\">qcut<\/span> (df[' <span style=\"color: #ff0000;\">points<\/span> '], q=[0, .2, .4, .6, .8, 1])\n\n<span style=\"color: #008080;\">#view updated DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n    points assists rebounds points_bin\n0 4 2 7 (3.999, 7.2]\n1 4 5 7 (3.999, 7.2]\n2 7 4 4 (3.999, 7.2]\n3 8 7 6 (7.2, 12.4]\n4 12 7 3 (7.2, 12.4]\n5 13 8 8 (12.4, 16.8]\n6 15 5 9 (12.4, 16.8]\n7 18 4 9 (16.8, 22.8]\n8 22 5 12 (16.8, 22.8]\n9 23 11 11 (22.8, 25.0]\n10 23 13 8 (22.8, 25.0]\n11 25 8 9 (22.8, 25.0]\n<\/span><\/span><\/strong><\/pre>\n<h3> <span style=\"color: #000000;\"><strong>Contoh 3:<\/strong><\/span> <span style=\"color: #000000;\"><strong>Lakukan pengelompokan data dengan label<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Kami juga dapat melakukan pengelompokan data menggunakan kuantil dan label tertentu:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#perform data binning on <em>points<\/em> variable with specific quantiles and labels\n<\/span>df[' <span style=\"color: #ff0000;\">points_bin<\/span> '] = pd. <span style=\"color: #3366ff;\">qcut<\/span> (df[' <span style=\"color: #ff0000;\">points<\/span> '],\n                           q=[0, .2, .4, .6, .8, 1],\n                           labels=[' <span style=\"color: #ff0000;\">A<\/span> ',' <span style=\"color: #ff0000;\">B<\/span> ',' <span style=\"color: #ff0000;\">C<\/span> ',' <span style=\"color: #ff0000;\">D<\/span> ',' <span style=\"color: #ff0000;\">E<\/span> '])\n\n<span style=\"color: #008080;\">#view updated DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n    points assists rebounds points_bin\n0 4 2 7 A\n1 4 5 7 A\n2 7 4 4 A\n3 8 7 6 B\n4 12 7 3 B\n5 13 8 8 C\n6 15 5 9 C\n7 18 4 9 D\n8 22 5 12 D\n9 23 11 11 E\n10 23 13 8 E\n11 25 8 9 E\n<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Perhatikan bahwa setiap baris telah diberi nampan berdasarkan nilai kolom <strong>titik<\/strong> dan nampan telah diberi label menggunakan huruf.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Sumber daya tambahan<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Tutorial berikut menjelaskan cara melakukan tugas umum lainnya di panda:<\/span><\/p>\n<p> <a href=\"https:\/\/statorials.org\/id\/nilai_hitungan-panda\/\" target=\"_blank\" rel=\"noopener\">Pandas: cara menggunakan fungsi value_counts()<\/a><br \/> <a href=\"https:\/\/statorials.org\/id\/jumlah-tabel-pivot-panda\/\" target=\"_blank\" rel=\"noopener\">Pandas: Cara membuat tabel pivot dengan jumlah nilai<\/a><br \/> <a href=\"https:\/\/statorials.org\/id\/pandas-menghitung-nilai-tertentu-di-kolom\/\" target=\"_blank\" rel=\"noopener\">Pandas: cara menghitung kemunculan nilai tertentu dalam sebuah kolom<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Anda dapat menggunakan sintaks dasar berikut untuk melakukan pengelompokan data pada pandas DataFrame: import pandas as pd #perform binning with 3 bins df[&#8216; new_bin &#8216;] = pd. qcut (df[&#8216; variable_name &#8216;], q= 3 ) Contoh berikut menunjukkan cara menggunakan sintaksis ini dalam praktiknya dengan pandas DataFrame berikut: import pandas as pd #createDataFrame df = pd. [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Cara Melakukan Clustering Data dengan Python (dengan Contoh) - Statologi<\/title>\n<meta name=\"description\" content=\"Tutorial ini menjelaskan cara melakukan pengelompokan data dengan Python, dengan beberapa contoh.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/\" \/>\n<meta property=\"og:locale\" content=\"id_ID\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cara Melakukan Clustering Data dengan Python (dengan Contoh) - Statologi\" \/>\n<meta property=\"og:description\" content=\"Tutorial ini menjelaskan cara melakukan pengelompokan data dengan Python, dengan beberapa contoh.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-21T22:38:17+00:00\" \/>\n<meta name=\"author\" content=\"Benjamin anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Ditulis oleh\" \/>\n\t<meta name=\"twitter:data1\" content=\"Benjamin anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimasi waktu membaca\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 menit\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/\",\"url\":\"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/\",\"name\":\"Cara Melakukan Clustering Data dengan Python (dengan Contoh) - Statologi\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/id\/#website\"},\"datePublished\":\"2023-07-21T22:38:17+00:00\",\"dateModified\":\"2023-07-21T22:38:17+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\"},\"description\":\"Tutorial ini menjelaskan cara melakukan pengelompokan data dengan Python, dengan beberapa contoh.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/#breadcrumb\"},\"inLanguage\":\"id\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/statorials.org\/id\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Cara melakukan pengelompokan data dengan python: dengan contoh\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/id\/#website\",\"url\":\"https:\/\/statorials.org\/id\/\",\"name\":\"Statorials\",\"description\":\"Panduan anda untuk kompetensi statistik!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/id\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"id\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\",\"name\":\"Benjamin anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"id\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Benjamin anderson\"},\"description\":\"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya\",\"sameAs\":[\"http:\/\/statorials.org\/id\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Cara Melakukan Clustering Data dengan Python (dengan Contoh) - Statologi","description":"Tutorial ini menjelaskan cara melakukan pengelompokan data dengan Python, dengan beberapa contoh.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/","og_locale":"id_ID","og_type":"article","og_title":"Cara Melakukan Clustering Data dengan Python (dengan Contoh) - Statologi","og_description":"Tutorial ini menjelaskan cara melakukan pengelompokan data dengan Python, dengan beberapa contoh.","og_url":"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/","og_site_name":"Statorials","article_published_time":"2023-07-21T22:38:17+00:00","author":"Benjamin anderson","twitter_card":"summary_large_image","twitter_misc":{"Ditulis oleh":"Benjamin anderson","Estimasi waktu membaca":"2 menit"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/","url":"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/","name":"Cara Melakukan Clustering Data dengan Python (dengan Contoh) - Statologi","isPartOf":{"@id":"https:\/\/statorials.org\/id\/#website"},"datePublished":"2023-07-21T22:38:17+00:00","dateModified":"2023-07-21T22:38:17+00:00","author":{"@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81"},"description":"Tutorial ini menjelaskan cara melakukan pengelompokan data dengan Python, dengan beberapa contoh.","breadcrumb":{"@id":"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/#breadcrumb"},"inLanguage":"id","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/id\/pengelompokan-data-dengan-python\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/statorials.org\/id\/"},{"@type":"ListItem","position":2,"name":"Cara melakukan pengelompokan data dengan python: dengan contoh"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/id\/#website","url":"https:\/\/statorials.org\/id\/","name":"Statorials","description":"Panduan anda untuk kompetensi statistik!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/id\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"id"},{"@type":"Person","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81","name":"Benjamin anderson","image":{"@type":"ImageObject","inLanguage":"id","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/","url":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Benjamin anderson"},"description":"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya","sameAs":["http:\/\/statorials.org\/id"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/2517"}],"collection":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/comments?post=2517"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/2517\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/media?parent=2517"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/categories?post=2517"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/tags?post=2517"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}