{"id":2236,"date":"2023-07-23T03:24:57","date_gmt":"2023-07-23T03:24:57","guid":{"rendered":"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/"},"modified":"2023-07-23T03:24:57","modified_gmt":"2023-07-23T03:24:57","slug":"codificacao-quente-em-python","status":"publish","type":"post","link":"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/","title":{"rendered":"Como realizar a codifica\u00e7\u00e3o one-hot em python"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\"><strong>A codifica\u00e7\u00e3o one-hot<\/strong> \u00e9 usada para converter vari\u00e1veis categ\u00f3ricas em um formato que pode ser facilmente usado por <a href=\"https:\/\/statorials.org\/pt\/estatologia-explica-conceitos-de-forma-simples-e-direta-facilitamos-o-aprendizado-de-estatistica\/\" target=\"_blank\" rel=\"noopener\">algoritmos de aprendizado de m\u00e1quina<\/a> .<\/span><\/p>\n<p> <span style=\"color: #000000;\">A ideia b\u00e1sica da codifica\u00e7\u00e3o one-hot \u00e9 criar novas vari\u00e1veis que assumam os valores 0 e 1 para representar os valores categ\u00f3ricos originais.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Por exemplo, a imagem a seguir mostra como far\u00edamos a codifica\u00e7\u00e3o one-hot para converter uma vari\u00e1vel categ\u00f3rica contendo nomes de equipes em novas vari\u00e1veis contendo apenas valores 0 e 1:<\/span> <\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\" wp-image-20468 aligncenter\" src=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png\" alt=\"\" width=\"592\" height=\"275\" srcset=\"\" sizes=\"auto, \"><\/p>\n<p> <span style=\"color: #000000;\">O exemplo passo a passo a seguir mostra como realizar a codifica\u00e7\u00e3o one-hot para este conjunto de dados exato em Python.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Etapa 1: crie os dados<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Primeiro, vamos criar o seguinte DataFrame do pandas:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #107d3f;\">import<\/span> pandas <span style=\"color: #107d3f;\">as<\/span> pd\n\n<span style=\"color: #008080;\">#createDataFrame\n<\/span>df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ({' <span style=\"color: #ff0000;\">team<\/span> ': ['A', 'A', 'B', 'B', 'B', 'B', 'C', 'C'],\n                   ' <span style=\"color: #ff0000;\">points<\/span> ': [25, 12, 15, 14, 19, 23, 25, 29]})\n\n<span style=\"color: #008080;\">#view DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n  team points\n0 to 25\n1 to 12\n2 B 15\n3 B 14\n4 B 19\n5 B 23\n6 C 25\n7 C 29<\/strong><\/pre>\n<h3> <span style=\"color: #000000;\"><strong>Etapa 2: execute a codifica\u00e7\u00e3o one-hot<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">A seguir, vamos importar a fun\u00e7\u00e3o <strong>OneHotEncoder()<\/strong> da biblioteca <strong>sklearn<\/strong> e us\u00e1-la para realizar codifica\u00e7\u00e3o a quente na vari\u00e1vel &#8216;team&#8217; no DataFrame do pandas:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">from<\/span> sklearn. <span style=\"color: #3366ff;\">preprocessing<\/span> <span style=\"color: #008000;\">import<\/span> OneHotEncoder\n\n<span style=\"color: #008080;\">#creating instance of one-hot-encoder\n<\/span>encoder = OneHotEncoder(handle_unknown=' <span style=\"color: #ff0000;\">ignore<\/span> ')\n\n<span style=\"color: #008080;\">#perform one-hot encoding on 'team' column \n<\/span>encoder_df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ( <span style=\"color: #3366ff;\">encoder.fit_transform<\/span> (df[[' <span style=\"color: #ff0000;\">team<\/span> ']]). <span style=\"color: #3366ff;\">toarray<\/span> ())\n\n<span style=\"color: #008080;\">#merge one-hot encoded columns back with original DataFrame\n<\/span>final_df = df. <span style=\"color: #3366ff;\">join<\/span> (encoder_df)\n\n<span style=\"color: #008080;\">#view final df\n<\/span><span style=\"color: #008000;\">print<\/span> (final_df)\n\n  team points 0 1 2\n0 to 25 1.0 0.0 0.0\n1 to 12 1.0 0.0 0.0\n2 B 15 0.0 1.0 0.0\n3 B 14 0.0 1.0 0.0\n4 B 19 0.0 1.0 0.0\n5 B 23 0.0 1.0 0.0\n6 C 25 0.0 0.0 1.0\n7 C 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Observe que tr\u00eas novas colunas foram adicionadas ao DataFrame, pois a coluna &#8220;equipe&#8221; original continha tr\u00eas valores exclusivos.<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>Observa\u00e7\u00e3o<\/strong> : voc\u00ea pode encontrar a documenta\u00e7\u00e3o completa da fun\u00e7\u00e3o <strong>OneHotEncoder()<\/strong> <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.preprocessing.OneHotEncoder.html\" target=\"_blank\" rel=\"noopener\">aqui<\/a> .<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Etapa 3: remover a vari\u00e1vel categ\u00f3rica original<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Finalmente, podemos remover a vari\u00e1vel \u2018team\u2019 original do DataFrame, pois n\u00e3o precisamos mais dela:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#drop 'team' column\n<\/span>final_df. <span style=\"color: #3366ff;\">drop<\/span> (' <span style=\"color: #ff0000;\">team<\/span> ', axis= <span style=\"color: #008000;\">1<\/span> , inplace= <span style=\"color: #008000;\">True<\/span> )\n\n<span style=\"color: #008080;\">#view final df\n<\/span><span style=\"color: #008000;\">print<\/span> (final_df)\n\n   points 0 1 2\n0 25 1.0 0.0 0.0\n1 12 1.0 0.0 0.0\n2 15 0.0 1.0 0.0\n3 14 0.0 1.0 0.0\n4 19 0.0 1.0 0.0\n5 23 0.0 1.0 0.0\n6 25 0.0 0.0 1.0\n7 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\"><strong>Relacionado:<\/strong><\/span> <a href=\"https:\/\/statorials.org\/pt\/soltar-pandas-de-coluna\/\" target=\"_blank\" rel=\"noopener\">Como excluir colunas no Pandas (4 m\u00e9todos)<\/a><\/p>\n<p> <span style=\"color: #000000;\">Tamb\u00e9m poder\u00edamos renomear as colunas do DataFrame final para torn\u00e1-las mais f\u00e1ceis de ler:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#rename columns\n<\/span>final_df. <span style=\"color: #3366ff;\">columns<\/span> = ['points', 'teamA', 'teamB', 'teamC']\n\n<span style=\"color: #008080;\">#view final df<\/span>\n<span style=\"color: #008000;\">print<\/span> (final_df)\n\n   points teamA teamB teamC\n0 25 1.0 0.0 0.0\n1 12 1.0 0.0 0.0\n2 15 0.0 1.0 0.0\n3 14 0.0 1.0 0.0\n4 19 0.0 1.0 0.0\n5 23 0.0 1.0 0.0\n6 25 0.0 0.0 1.0\n7 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">A codifica\u00e7\u00e3o one-hot est\u00e1 conclu\u00edda e agora podemos inserir este DataFrame do pandas em qualquer algoritmo de aprendizado de m\u00e1quina que desejarmos.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Recursos adicionais<\/strong><\/span><\/h3>\n<p> <a href=\"https:\/\/statorials.org\/pt\/tamanho-medio-em-python\/\" target=\"_blank\" rel=\"noopener\">Como calcular uma m\u00e9dia aparada em Python<\/a><br \/> <a href=\"https:\/\/statorials.org\/pt\/regressao-linear-python\/\" target=\"_blank\" rel=\"noopener\">Como realizar regress\u00e3o linear em Python<\/a><br \/> <a href=\"https:\/\/statorials.org\/pt\/regressao-logistica-python\/\" target=\"_blank\" rel=\"noopener\">Como realizar regress\u00e3o log\u00edstica em Python<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>A codifica\u00e7\u00e3o one-hot \u00e9 usada para converter vari\u00e1veis categ\u00f3ricas em um formato que pode ser facilmente usado por algoritmos de aprendizado de m\u00e1quina . A ideia b\u00e1sica da codifica\u00e7\u00e3o one-hot \u00e9 criar novas vari\u00e1veis que assumam os valores 0 e 1 para representar os valores categ\u00f3ricos originais. Por exemplo, a imagem a seguir mostra como [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"class_list":["post-2236","post","type-post","status-publish","format-standard","hentry","category-guia"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Como realizar a codifica\u00e7\u00e3o One-Hot em Python - Estatologia<\/title>\n<meta name=\"description\" content=\"Este tutorial explica como fazer codifica\u00e7\u00e3o one-hot em Python, com um exemplo passo a passo.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/\" \/>\n<meta property=\"og:locale\" content=\"pt_PT\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Como realizar a codifica\u00e7\u00e3o One-Hot em Python - Estatologia\" \/>\n<meta property=\"og:description\" content=\"Este tutorial explica como fazer codifica\u00e7\u00e3o one-hot em Python, com um exemplo passo a passo.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-23T03:24:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png\" \/>\n<meta name=\"author\" content=\"Dr. benjamim anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dr. benjamim anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tempo estimado de leitura\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/\",\"url\":\"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/\",\"name\":\"Como realizar a codifica\u00e7\u00e3o One-Hot em Python - Estatologia\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/pt\/#website\"},\"datePublished\":\"2023-07-23T03:24:57+00:00\",\"dateModified\":\"2023-07-23T03:24:57+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/pt\/#\/schema\/person\/e08f98e8db95e0aa9c310e1b27c9c666\"},\"description\":\"Este tutorial explica como fazer codifica\u00e7\u00e3o one-hot em Python, com um exemplo passo a passo.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/#breadcrumb\"},\"inLanguage\":\"pt-PT\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Lar\",\"item\":\"https:\/\/statorials.org\/pt\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Como realizar a codifica\u00e7\u00e3o one-hot em python\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/pt\/#website\",\"url\":\"https:\/\/statorials.org\/pt\/\",\"name\":\"Statorials\",\"description\":\"O seu guia para a literacia estat\u00edstica!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/pt\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"pt-PT\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/pt\/#\/schema\/person\/e08f98e8db95e0aa9c310e1b27c9c666\",\"name\":\"Dr. benjamim anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-PT\",\"@id\":\"https:\/\/statorials.org\/pt\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/statorials.org\/pt\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"https:\/\/statorials.org\/pt\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Dr. benjamim anderson\"},\"description\":\"Ol\u00e1, sou Benjamin, um professor aposentado de estat\u00edstica que se tornou professor dedicado na Statorials. Com vasta experi\u00eancia e conhecimento na \u00e1rea de estat\u00edstica, estou empenhado em compartilhar meu conhecimento para capacitar os alunos por meio de Statorials. Saber mais\",\"sameAs\":[\"https:\/\/statorials.org\/pt\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Como realizar a codifica\u00e7\u00e3o One-Hot em Python - Estatologia","description":"Este tutorial explica como fazer codifica\u00e7\u00e3o one-hot em Python, com um exemplo passo a passo.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/","og_locale":"pt_PT","og_type":"article","og_title":"Como realizar a codifica\u00e7\u00e3o One-Hot em Python - Estatologia","og_description":"Este tutorial explica como fazer codifica\u00e7\u00e3o one-hot em Python, com um exemplo passo a passo.","og_url":"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/","og_site_name":"Statorials","article_published_time":"2023-07-23T03:24:57+00:00","og_image":[{"url":"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png"}],"author":"Dr. benjamim anderson","twitter_card":"summary_large_image","twitter_misc":{"Escrito por":"Dr. benjamim anderson","Tempo estimado de leitura":"2 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/","url":"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/","name":"Como realizar a codifica\u00e7\u00e3o One-Hot em Python - Estatologia","isPartOf":{"@id":"https:\/\/statorials.org\/pt\/#website"},"datePublished":"2023-07-23T03:24:57+00:00","dateModified":"2023-07-23T03:24:57+00:00","author":{"@id":"https:\/\/statorials.org\/pt\/#\/schema\/person\/e08f98e8db95e0aa9c310e1b27c9c666"},"description":"Este tutorial explica como fazer codifica\u00e7\u00e3o one-hot em Python, com um exemplo passo a passo.","breadcrumb":{"@id":"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/#breadcrumb"},"inLanguage":"pt-PT","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/pt\/codificacao-quente-em-python\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Lar","item":"https:\/\/statorials.org\/pt\/"},{"@type":"ListItem","position":2,"name":"Como realizar a codifica\u00e7\u00e3o one-hot em python"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/pt\/#website","url":"https:\/\/statorials.org\/pt\/","name":"Statorials","description":"O seu guia para a literacia estat\u00edstica!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/pt\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"pt-PT"},{"@type":"Person","@id":"https:\/\/statorials.org\/pt\/#\/schema\/person\/e08f98e8db95e0aa9c310e1b27c9c666","name":"Dr. benjamim anderson","image":{"@type":"ImageObject","inLanguage":"pt-PT","@id":"https:\/\/statorials.org\/pt\/#\/schema\/person\/image\/","url":"https:\/\/statorials.org\/pt\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"https:\/\/statorials.org\/pt\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Dr. benjamim anderson"},"description":"Ol\u00e1, sou Benjamin, um professor aposentado de estat\u00edstica que se tornou professor dedicado na Statorials. Com vasta experi\u00eancia e conhecimento na \u00e1rea de estat\u00edstica, estou empenhado em compartilhar meu conhecimento para capacitar os alunos por meio de Statorials. Saber mais","sameAs":["https:\/\/statorials.org\/pt"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/posts\/2236","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/comments?post=2236"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/posts\/2236\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/media?parent=2236"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/categories?post=2236"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/tags?post=2236"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}