{"id":2235,"date":"2023-07-23T03:24:57","date_gmt":"2023-07-23T03:24:57","guid":{"rendered":"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/"},"modified":"2023-07-23T03:24:57","modified_gmt":"2023-07-23T03:24:57","slug":"gorace-kodowanie-w-pythonie","status":"publish","type":"post","link":"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/","title":{"rendered":"Jak wykona\u0107 kodowanie one-hot w pythonie"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\"><strong>Kodowanie typu one-hot<\/strong> s\u0142u\u017cy do konwertowania zmiennych kategorycznych do formatu, kt\u00f3ry mo\u017ce by\u0107 \u0142atwo wykorzystany przez <a href=\"https:\/\/statorials.org\" target=\"_blank\" rel=\"noopener\">algorytmy uczenia maszynowego<\/a> .<\/span><\/p>\n<p> <span style=\"color: #000000;\">Podstawow\u0105 ide\u0105 kodowania one-hot jest utworzenie nowych zmiennych, kt\u00f3re przyjmuj\u0105 warto\u015bci 0 i 1 w celu reprezentowania oryginalnych warto\u015bci kategorycznych.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Na przyk\u0142ad poni\u017cszy obraz pokazuje, w jaki spos\u00f3b kodujemy jednorazowo, aby przekonwertowa\u0107 zmienn\u0105 kategorialn\u0105 zawieraj\u0105c\u0105 nazwy zespo\u0142\u00f3w na nowe zmienne zawieraj\u0105ce tylko warto\u015bci 0 i 1:<\/span> <\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\" wp-image-20468 aligncenter\" src=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png\" alt=\"\" width=\"592\" height=\"275\" srcset=\"\" sizes=\"auto, \"><\/p>\n<p> <span style=\"color: #000000;\">Poni\u017cszy przyk\u0142ad krok po kroku pokazuje, jak wykona\u0107 jednoetapowe kodowanie dla dok\u0142adnie tego zestawu danych w Pythonie.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Krok 1: Utw\u00f3rz dane<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Najpierw utw\u00f3rzmy nast\u0119puj\u0105c\u0105 ramk\u0119 DataFrame pandy:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #107d3f;\">import<\/span> pandas <span style=\"color: #107d3f;\">as<\/span> pd\n\n<span style=\"color: #008080;\">#createDataFrame\n<\/span>df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ({' <span style=\"color: #ff0000;\">team<\/span> ': ['A', 'A', 'B', 'B', 'B', 'B', 'C', 'C'],\n                   ' <span style=\"color: #ff0000;\">points<\/span> ': [25, 12, 15, 14, 19, 23, 25, 29]})\n\n<span style=\"color: #008080;\">#view DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n  team points\n0 to 25\n1 to 12\n2 B 15\n3 B 14\n4 B 19\n5 B 23\n6 C 25\n7 C 29<\/strong><\/pre>\n<h3> <span style=\"color: #000000;\"><strong>Krok 2: Wykonaj kodowanie na gor\u0105co<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Nast\u0119pnie zaimportujmy funkcj\u0119 <strong>OneHotEncoder()<\/strong> z biblioteki <strong>sklearn<\/strong> i u\u017cyjmy jej do wykonania kodowania na gor\u0105co w zmiennej \u201eteam\u201d w ramce DataFrame pand:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">from<\/span> sklearn. <span style=\"color: #3366ff;\">preprocessing<\/span> <span style=\"color: #008000;\">import<\/span> OneHotEncoder\n\n<span style=\"color: #008080;\">#creating instance of one-hot-encoder\n<\/span>encoder = OneHotEncoder(handle_unknown=' <span style=\"color: #ff0000;\">ignore<\/span> ')\n\n<span style=\"color: #008080;\">#perform one-hot encoding on 'team' column \n<\/span>encoder_df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ( <span style=\"color: #3366ff;\">encoder.fit_transform<\/span> (df[[' <span style=\"color: #ff0000;\">team<\/span> ']]). <span style=\"color: #3366ff;\">toarray<\/span> ())\n\n<span style=\"color: #008080;\">#merge one-hot encoded columns back with original DataFrame\n<\/span>final_df = df. <span style=\"color: #3366ff;\">join<\/span> (encoder_df)\n\n<span style=\"color: #008080;\">#view final df\n<\/span><span style=\"color: #008000;\">print<\/span> (final_df)\n\n  team points 0 1 2\n0 to 25 1.0 0.0 0.0\n1 to 12 1.0 0.0 0.0\n2 B 15 0.0 1.0 0.0\n3 B 14 0.0 1.0 0.0\n4 B 19 0.0 1.0 0.0\n5 B 23 0.0 1.0 0.0\n6 C 25 0.0 0.0 1.0\n7 C 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Nale\u017cy zauwa\u017cy\u0107, \u017ce do ramki DataFrame dodano trzy nowe kolumny, poniewa\u017c oryginalna kolumna \u201ezesp\u00f3\u0142\u201d zawiera\u0142a trzy unikalne warto\u015bci.<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>Uwaga<\/strong> : pe\u0142n\u0105 dokumentacj\u0119 funkcji <strong>OneHotEncoder()<\/strong> mo\u017cna znale\u017a\u0107 <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.preprocessing.OneHotEncoder.html\" target=\"_blank\" rel=\"noopener\">tutaj<\/a> .<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Krok 3: Usu\u0144 oryginaln\u0105 zmienn\u0105 kategoryczn\u0105<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Na koniec mo\u017cemy usun\u0105\u0107 oryginaln\u0105 zmienn\u0105 \u201ezesp\u00f3\u0142\u201d z ramki DataFrame, poniewa\u017c ju\u017c jej nie potrzebujemy:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#drop 'team' column\n<\/span>final_df. <span style=\"color: #3366ff;\">drop<\/span> (' <span style=\"color: #ff0000;\">team<\/span> ', axis= <span style=\"color: #008000;\">1<\/span> , inplace= <span style=\"color: #008000;\">True<\/span> )\n\n<span style=\"color: #008080;\">#view final df\n<\/span><span style=\"color: #008000;\">print<\/span> (final_df)\n\n   points 0 1 2\n0 25 1.0 0.0 0.0\n1 12 1.0 0.0 0.0\n2 15 0.0 1.0 0.0\n3 14 0.0 1.0 0.0\n4 19 0.0 1.0 0.0\n5 23 0.0 1.0 0.0\n6 25 0.0 0.0 1.0\n7 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\"><strong>Powi\u0105zane:<\/strong><\/span> <a href=\"https:\/\/statorials.org\/pl\/upusc-pandy-kolumnowe\/\" target=\"_blank\" rel=\"noopener\">Jak usun\u0105\u0107 kolumny w Pandach (4 metody)<\/a><\/p>\n<p> <span style=\"color: #000000;\">Mogliby\u015bmy r\u00f3wnie\u017c zmieni\u0107 nazwy kolumn ko\u0144cowej ramki danych, aby by\u0142y \u0142atwiejsze do odczytania:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#rename columns\n<\/span>final_df. <span style=\"color: #3366ff;\">columns<\/span> = ['points', 'teamA', 'teamB', 'teamC']\n\n<span style=\"color: #008080;\">#view final df<\/span>\n<span style=\"color: #008000;\">print<\/span> (final_df)\n\n   points teamA teamB teamC\n0 25 1.0 0.0 0.0\n1 12 1.0 0.0 0.0\n2 15 0.0 1.0 0.0\n3 14 0.0 1.0 0.0\n4 19 0.0 1.0 0.0\n5 23 0.0 1.0 0.0\n6 25 0.0 0.0 1.0\n7 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Kodowanie \u201eone-hot\u201d zosta\u0142o uko\u0144czone i mo\u017cemy teraz wstawi\u0107 t\u0119 ramk\u0119 DataFrame pandy do dowolnego algorytmu uczenia maszynowego.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Dodatkowe zasoby<\/strong><\/span><\/h3>\n<p> <a href=\"https:\/\/statorials.org\/pl\/sredni-rozmiar-w-pythonie\/\" target=\"_blank\" rel=\"noopener\">Jak obliczy\u0107 \u015bredni\u0105 obci\u0119t\u0105 w Pythonie<\/a><br \/> <a href=\"https:\/\/statorials.org\/pl\/python-regresji-liniowej\/\" target=\"_blank\" rel=\"noopener\">Jak wykona\u0107 regresj\u0119 liniow\u0105 w Pythonie<\/a><br \/> <a href=\"https:\/\/statorials.org\/pl\/python-regresji-logistycznej\/\" target=\"_blank\" rel=\"noopener\">Jak przeprowadzi\u0107 regresj\u0119 logistyczn\u0105 w Pythonie<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Kodowanie typu one-hot s\u0142u\u017cy do konwertowania zmiennych kategorycznych do formatu, kt\u00f3ry mo\u017ce by\u0107 \u0142atwo wykorzystany przez algorytmy uczenia maszynowego . Podstawow\u0105 ide\u0105 kodowania one-hot jest utworzenie nowych zmiennych, kt\u00f3re przyjmuj\u0105 warto\u015bci 0 i 1 w celu reprezentowania oryginalnych warto\u015bci kategorycznych. Na przyk\u0142ad poni\u017cszy obraz pokazuje, w jaki spos\u00f3b kodujemy jednorazowo, aby przekonwertowa\u0107 zmienn\u0105 kategorialn\u0105 zawieraj\u0105c\u0105 [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3],"tags":[],"class_list":["post-2235","post","type-post","status-publish","format-standard","hentry","category-przewodnik"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Jak wykona\u0107 kodowanie One-Hot w Pythonie - Statologia<\/title>\n<meta name=\"description\" content=\"W tym samouczku wyja\u015bniono, jak wykona\u0107 kodowanie typu one-hot w j\u0119zyku Python, na przyk\u0142adzie krok po kroku.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/\" \/>\n<meta property=\"og:locale\" content=\"pl_PL\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Jak wykona\u0107 kodowanie One-Hot w Pythonie - Statologia\" \/>\n<meta property=\"og:description\" content=\"W tym samouczku wyja\u015bniono, jak wykona\u0107 kodowanie typu one-hot w j\u0119zyku Python, na przyk\u0142adzie krok po kroku.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-23T03:24:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png\" \/>\n<meta name=\"author\" content=\"Benjamin Anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Napisane przez\" \/>\n\t<meta name=\"twitter:data1\" content=\"Benjamin Anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Szacowany czas czytania\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minuty\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/\",\"url\":\"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/\",\"name\":\"Jak wykona\u0107 kodowanie One-Hot w Pythonie - Statologia\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/pl\/#website\"},\"datePublished\":\"2023-07-23T03:24:57+00:00\",\"dateModified\":\"2023-07-23T03:24:57+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/pl\/#\/schema\/person\/6484727a4612df3e69f016c3129c6965\"},\"description\":\"W tym samouczku wyja\u015bniono, jak wykona\u0107 kodowanie typu one-hot w j\u0119zyku Python, na przyk\u0142adzie krok po kroku.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/#breadcrumb\"},\"inLanguage\":\"pl-PL\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Dom\",\"item\":\"https:\/\/statorials.org\/pl\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Jak wykona\u0107 kodowanie one-hot w pythonie\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/pl\/#website\",\"url\":\"https:\/\/statorials.org\/pl\/\",\"name\":\"Statorials\",\"description\":\"Tw\u00f3j przewodnik po kompetencjach statystycznych!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/pl\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"pl-PL\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/pl\/#\/schema\/person\/6484727a4612df3e69f016c3129c6965\",\"name\":\"Benjamin Anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pl-PL\",\"@id\":\"https:\/\/statorials.org\/pl\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/statorials.org\/pl\/wp-content\/uploads\/2023\/11\/Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"https:\/\/statorials.org\/pl\/wp-content\/uploads\/2023\/11\/Benjamin-Anderson-96x96.jpg\",\"caption\":\"Benjamin Anderson\"},\"description\":\"Cze\u015b\u0107, jestem Benjamin i jestem emerytowanym profesorem statystyki, kt\u00f3ry zosta\u0142 oddanym nauczycielem Statorials. Dzi\u0119ki bogatemu do\u015bwiadczeniu i wiedzy specjalistycznej w dziedzinie statystyki ch\u0119tnie dziel\u0119 si\u0119 swoj\u0105 wiedz\u0105, aby wzmocni\u0107 pozycj\u0119 uczni\u00f3w za po\u015brednictwem Statorials. Wiedzie\u0107 wi\u0119cej\",\"sameAs\":[\"https:\/\/statorials.org\/pl\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Jak wykona\u0107 kodowanie One-Hot w Pythonie - Statologia","description":"W tym samouczku wyja\u015bniono, jak wykona\u0107 kodowanie typu one-hot w j\u0119zyku Python, na przyk\u0142adzie krok po kroku.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/","og_locale":"pl_PL","og_type":"article","og_title":"Jak wykona\u0107 kodowanie One-Hot w Pythonie - Statologia","og_description":"W tym samouczku wyja\u015bniono, jak wykona\u0107 kodowanie typu one-hot w j\u0119zyku Python, na przyk\u0142adzie krok po kroku.","og_url":"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/","og_site_name":"Statorials","article_published_time":"2023-07-23T03:24:57+00:00","og_image":[{"url":"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png"}],"author":"Benjamin Anderson","twitter_card":"summary_large_image","twitter_misc":{"Napisane przez":"Benjamin Anderson","Szacowany czas czytania":"2 minuty"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/","url":"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/","name":"Jak wykona\u0107 kodowanie One-Hot w Pythonie - Statologia","isPartOf":{"@id":"https:\/\/statorials.org\/pl\/#website"},"datePublished":"2023-07-23T03:24:57+00:00","dateModified":"2023-07-23T03:24:57+00:00","author":{"@id":"https:\/\/statorials.org\/pl\/#\/schema\/person\/6484727a4612df3e69f016c3129c6965"},"description":"W tym samouczku wyja\u015bniono, jak wykona\u0107 kodowanie typu one-hot w j\u0119zyku Python, na przyk\u0142adzie krok po kroku.","breadcrumb":{"@id":"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/#breadcrumb"},"inLanguage":"pl-PL","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/pl\/gorace-kodowanie-w-pythonie\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Dom","item":"https:\/\/statorials.org\/pl\/"},{"@type":"ListItem","position":2,"name":"Jak wykona\u0107 kodowanie one-hot w pythonie"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/pl\/#website","url":"https:\/\/statorials.org\/pl\/","name":"Statorials","description":"Tw\u00f3j przewodnik po kompetencjach statystycznych!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/pl\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"pl-PL"},{"@type":"Person","@id":"https:\/\/statorials.org\/pl\/#\/schema\/person\/6484727a4612df3e69f016c3129c6965","name":"Benjamin Anderson","image":{"@type":"ImageObject","inLanguage":"pl-PL","@id":"https:\/\/statorials.org\/pl\/#\/schema\/person\/image\/","url":"https:\/\/statorials.org\/pl\/wp-content\/uploads\/2023\/11\/Benjamin-Anderson-96x96.jpg","contentUrl":"https:\/\/statorials.org\/pl\/wp-content\/uploads\/2023\/11\/Benjamin-Anderson-96x96.jpg","caption":"Benjamin Anderson"},"description":"Cze\u015b\u0107, jestem Benjamin i jestem emerytowanym profesorem statystyki, kt\u00f3ry zosta\u0142 oddanym nauczycielem Statorials. Dzi\u0119ki bogatemu do\u015bwiadczeniu i wiedzy specjalistycznej w dziedzinie statystyki ch\u0119tnie dziel\u0119 si\u0119 swoj\u0105 wiedz\u0105, aby wzmocni\u0107 pozycj\u0119 uczni\u00f3w za po\u015brednictwem Statorials. Wiedzie\u0107 wi\u0119cej","sameAs":["https:\/\/statorials.org\/pl"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/posts\/2235","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/comments?post=2235"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/posts\/2235\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/media?parent=2235"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/categories?post=2235"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/tags?post=2235"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}