{"id":2237,"date":"2023-07-23T03:24:57","date_gmt":"2023-07-23T03:24:57","guid":{"rendered":"https:\/\/statorials.org\/nl\/hot-codering-in-python\/"},"modified":"2023-07-23T03:24:57","modified_gmt":"2023-07-23T03:24:57","slug":"hot-codering-in-python","status":"publish","type":"post","link":"https:\/\/statorials.org\/nl\/hot-codering-in-python\/","title":{"rendered":"One-hot-codering uitvoeren in python"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\"><strong>One-hot-codering<\/strong> wordt gebruikt om categorische variabelen om te zetten in een indeling die gemakkelijk kan worden gebruikt door machine learning-algoritmen .<\/span><\/p>\n<p> <span style=\"color: #000000;\">Het basisidee van one-hot coding is het cre\u00ebren van nieuwe variabelen die de waarden 0 en 1 aannemen om de oorspronkelijke categorische waarden weer te geven.<\/span><\/p>\n<p> <span style=\"color: #000000;\">De volgende afbeelding laat bijvoorbeeld zien hoe we one-hot-encoderen om een categorische variabele met teamnamen te converteren naar nieuwe variabelen die alleen 0- en 1-waarden bevatten:<\/span> <\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\" wp-image-20468 aligncenter\" src=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png\" alt=\"\" width=\"592\" height=\"275\" srcset=\"\" sizes=\"auto, \"><\/p>\n<p> <span style=\"color: #000000;\">In het volgende stapsgewijze voorbeeld ziet u hoe u one-hot-codering kunt uitvoeren voor deze exacte gegevensset in Python.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Stap 1: Cre\u00eber de gegevens<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Laten we eerst de volgende panda&#8217;s DataFrame maken:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #107d3f;\">import<\/span> pandas <span style=\"color: #107d3f;\">as<\/span> pd\n\n<span style=\"color: #008080;\">#createDataFrame\n<\/span>df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ({' <span style=\"color: #ff0000;\">team<\/span> ': ['A', 'A', 'B', 'B', 'B', 'B', 'C', 'C'],\n                   ' <span style=\"color: #ff0000;\">points<\/span> ': [25, 12, 15, 14, 19, 23, 25, 29]})\n\n<span style=\"color: #008080;\">#view DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> (df)\n\n  team points\n0 to 25\n1 to 12\n2 B 15\n3 B 14\n4 B 19\n5 B 23\n6 C 25\n7 C 29<\/strong><\/pre>\n<h3> <span style=\"color: #000000;\"><strong>Stap 2: Voer one-hot-codering uit<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Laten we vervolgens de functie <strong>OneHotEncoder()<\/strong> importeren uit de <strong>sklearn-<\/strong> bibliotheek en deze gebruiken om hot-encoding uit te voeren op de &#8218;team&#8216;-variabele in het pandas DataFrame:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">from<\/span> sklearn. <span style=\"color: #3366ff;\">preprocessing<\/span> <span style=\"color: #008000;\">import<\/span> OneHotEncoder\n\n<span style=\"color: #008080;\">#creating instance of one-hot-encoder\n<\/span>encoder = OneHotEncoder(handle_unknown=' <span style=\"color: #ff0000;\">ignore<\/span> ')\n\n<span style=\"color: #008080;\">#perform one-hot encoding on 'team' column \n<\/span>encoder_df = pd. <span style=\"color: #3366ff;\">DataFrame<\/span> ( <span style=\"color: #3366ff;\">encoder.fit_transform<\/span> (df[[' <span style=\"color: #ff0000;\">team<\/span> ']]). <span style=\"color: #3366ff;\">toarray<\/span> ())\n\n<span style=\"color: #008080;\">#merge one-hot encoded columns back with original DataFrame\n<\/span>final_df = df. <span style=\"color: #3366ff;\">join<\/span> (encoder_df)\n\n<span style=\"color: #008080;\">#view final df\n<\/span><span style=\"color: #008000;\">print<\/span> (final_df)\n\n  team points 0 1 2\n0 to 25 1.0 0.0 0.0\n1 to 12 1.0 0.0 0.0\n2 B 15 0.0 1.0 0.0\n3 B 14 0.0 1.0 0.0\n4 B 19 0.0 1.0 0.0\n5 B 23 0.0 1.0 0.0\n6 C 25 0.0 0.0 1.0\n7 C 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Houd er rekening mee dat er drie nieuwe kolommen aan het DataFrame zijn toegevoegd, omdat de oorspronkelijke kolom &#8218;team&#8216; drie unieke waarden bevatte.<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>Opmerking<\/strong> : u kunt de volledige documentatie voor de <strong>OneHotEncoder()-<\/strong> functie <a href=\"https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.preprocessing.OneHotEncoder.html\" target=\"_blank\" rel=\"noopener\">hier<\/a> vinden.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Stap 3: Verwijder de oorspronkelijke categorische variabele<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Ten slotte kunnen we de originele &#8218;team&#8216;-variabele uit het DataFrame verwijderen, omdat we deze niet langer nodig hebben:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#drop 'team' column\n<\/span>final_df. <span style=\"color: #3366ff;\">drop<\/span> (' <span style=\"color: #ff0000;\">team<\/span> ', axis= <span style=\"color: #008000;\">1<\/span> , inplace= <span style=\"color: #008000;\">True<\/span> )\n\n<span style=\"color: #008080;\">#view final df\n<\/span><span style=\"color: #008000;\">print<\/span> (final_df)\n\n   points 0 1 2\n0 25 1.0 0.0 0.0\n1 12 1.0 0.0 0.0\n2 15 0.0 1.0 0.0\n3 14 0.0 1.0 0.0\n4 19 0.0 1.0 0.0\n5 23 0.0 1.0 0.0\n6 25 0.0 0.0 1.0\n7 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\"><strong>Gerelateerd:<\/strong><\/span> <a href=\"https:\/\/statorials.org\/nl\/laat-kolompandas-vallen\/\" target=\"_blank\" rel=\"noopener\">Kolommen verwijderen in Panda&#8217;s (4 methoden)<\/a><\/p>\n<p> <span style=\"color: #000000;\">We kunnen ook de kolommen van het uiteindelijke DataFrame hernoemen om ze gemakkelijker leesbaar te maken:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#rename columns\n<\/span>final_df. <span style=\"color: #3366ff;\">columns<\/span> = ['points', 'teamA', 'teamB', 'teamC']\n\n<span style=\"color: #008080;\">#view final df<\/span>\n<span style=\"color: #008000;\">print<\/span> (final_df)\n\n   points teamA teamB teamC\n0 25 1.0 0.0 0.0\n1 12 1.0 0.0 0.0\n2 15 0.0 1.0 0.0\n3 14 0.0 1.0 0.0\n4 19 0.0 1.0 0.0\n5 23 0.0 1.0 0.0\n6 25 0.0 0.0 1.0\n7 29 0.0 0.0 1.0\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">De one-hot-codering is voltooid en we kunnen dit panda&#8217;s DataFrame nu in elk gewenst machine learning-algoritme invoegen.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Aanvullende bronnen<\/strong><\/span><\/h3>\n<p> <a href=\"https:\/\/statorials.org\/nl\/middelgroot-in-python\/\" target=\"_blank\" rel=\"noopener\">Hoe een getrimd gemiddelde in Python te berekenen<\/a><br \/> <a href=\"https:\/\/statorials.org\/nl\/lineaire-regressiepython\/\" target=\"_blank\" rel=\"noopener\">Hoe lineaire regressie uit te voeren in Python<\/a><br \/> <a href=\"https:\/\/statorials.org\/nl\/logistische-regressiepython\/\" target=\"_blank\" rel=\"noopener\">Hoe logistieke regressie uit te voeren in Python<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>One-hot-codering wordt gebruikt om categorische variabelen om te zetten in een indeling die gemakkelijk kan worden gebruikt door machine learning-algoritmen . Het basisidee van one-hot coding is het cre\u00ebren van nieuwe variabelen die de waarden 0 en 1 aannemen om de oorspronkelijke categorische waarden weer te geven. De volgende afbeelding laat bijvoorbeeld zien hoe we [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"class_list":["post-2237","post","type-post","status-publish","format-standard","hentry","category-gids"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>One-Hot-codering uitvoeren in Python - Statorials<\/title>\n<meta name=\"description\" content=\"In deze tutorial wordt uitgelegd hoe u one-hot-codering uitvoert in Python, met een stapsgewijs voorbeeld.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/nl\/hot-codering-in-python\/\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"One-Hot-codering uitvoeren in Python - Statorials\" \/>\n<meta property=\"og:description\" content=\"In deze tutorial wordt uitgelegd hoe u one-hot-codering uitvoert in Python, met een stapsgewijs voorbeeld.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/nl\/hot-codering-in-python\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-23T03:24:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png\" \/>\n<meta name=\"author\" content=\"Dr.benjamin anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Verfasst von\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dr.benjamin anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data2\" content=\"2\u00a0Minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/nl\/hot-codering-in-python\/\",\"url\":\"https:\/\/statorials.org\/nl\/hot-codering-in-python\/\",\"name\":\"One-Hot-codering uitvoeren in Python - Statorials\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/nl\/#website\"},\"datePublished\":\"2023-07-23T03:24:57+00:00\",\"dateModified\":\"2023-07-23T03:24:57+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/nl\/#\/schema\/person\/d4b8842173cca1bb62cdec41860e4219\"},\"description\":\"In deze tutorial wordt uitgelegd hoe u one-hot-codering uitvoert in Python, met een stapsgewijs voorbeeld.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/nl\/hot-codering-in-python\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/nl\/hot-codering-in-python\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/nl\/hot-codering-in-python\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Thuis\",\"item\":\"https:\/\/statorials.org\/nl\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"One-hot-codering uitvoeren in python\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/nl\/#website\",\"url\":\"https:\/\/statorials.org\/nl\/\",\"name\":\"Statorials\",\"description\":\"Uw gids voor statistische competentie\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/nl\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"de\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/nl\/#\/schema\/person\/d4b8842173cca1bb62cdec41860e4219\",\"name\":\"Dr.benjamin anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\/\/statorials.org\/nl\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/statorials.org\/nl\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"http:\/\/statorials.org\/nl\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Dr.benjamin anderson\"},\"description\":\"Ik ben Benjamin, een gepensioneerde hoogleraar statistiek die nu een toegewijde Statorials-lesgever is. Ik heb uitgebreide ervaring en expertise op het gebied van statistiek en ik ben vastbesloten om mijn kennis te delen met studenten via Statorials. Lees verder\",\"sameAs\":[\"http:\/\/statorials.org\/nl\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"One-Hot-codering uitvoeren in Python - Statorials","description":"In deze tutorial wordt uitgelegd hoe u one-hot-codering uitvoert in Python, met een stapsgewijs voorbeeld.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/nl\/hot-codering-in-python\/","og_locale":"de_DE","og_type":"article","og_title":"One-Hot-codering uitvoeren in Python - Statorials","og_description":"In deze tutorial wordt uitgelegd hoe u one-hot-codering uitvoert in Python, met een stapsgewijs voorbeeld.","og_url":"https:\/\/statorials.org\/nl\/hot-codering-in-python\/","og_site_name":"Statorials","article_published_time":"2023-07-23T03:24:57+00:00","og_image":[{"url":"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/unhot1.png"}],"author":"Dr.benjamin anderson","twitter_card":"summary_large_image","twitter_misc":{"Verfasst von":"Dr.benjamin anderson","Gesch\u00e4tzte Lesezeit":"2\u00a0Minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/nl\/hot-codering-in-python\/","url":"https:\/\/statorials.org\/nl\/hot-codering-in-python\/","name":"One-Hot-codering uitvoeren in Python - Statorials","isPartOf":{"@id":"https:\/\/statorials.org\/nl\/#website"},"datePublished":"2023-07-23T03:24:57+00:00","dateModified":"2023-07-23T03:24:57+00:00","author":{"@id":"https:\/\/statorials.org\/nl\/#\/schema\/person\/d4b8842173cca1bb62cdec41860e4219"},"description":"In deze tutorial wordt uitgelegd hoe u one-hot-codering uitvoert in Python, met een stapsgewijs voorbeeld.","breadcrumb":{"@id":"https:\/\/statorials.org\/nl\/hot-codering-in-python\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/nl\/hot-codering-in-python\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/nl\/hot-codering-in-python\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Thuis","item":"https:\/\/statorials.org\/nl\/"},{"@type":"ListItem","position":2,"name":"One-hot-codering uitvoeren in python"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/nl\/#website","url":"https:\/\/statorials.org\/nl\/","name":"Statorials","description":"Uw gids voor statistische competentie","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/nl\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"de"},{"@type":"Person","@id":"https:\/\/statorials.org\/nl\/#\/schema\/person\/d4b8842173cca1bb62cdec41860e4219","name":"Dr.benjamin anderson","image":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/statorials.org\/nl\/#\/schema\/person\/image\/","url":"http:\/\/statorials.org\/nl\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"http:\/\/statorials.org\/nl\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Dr.benjamin anderson"},"description":"Ik ben Benjamin, een gepensioneerde hoogleraar statistiek die nu een toegewijde Statorials-lesgever is. Ik heb uitgebreide ervaring en expertise op het gebied van statistiek en ik ben vastbesloten om mijn kennis te delen met studenten via Statorials. Lees verder","sameAs":["http:\/\/statorials.org\/nl"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/posts\/2237","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/comments?post=2237"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/posts\/2237\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/media?parent=2237"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/categories?post=2237"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/tags?post=2237"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}