{"id":2240,"date":"2023-07-23T03:04:58","date_gmt":"2023-07-23T03:04:58","guid":{"rendered":"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/"},"modified":"2023-07-23T03:04:58","modified_gmt":"2023-07-23T03:04:58","slug":"gegevens-transformeren-in-python","status":"publish","type":"post","link":"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/","title":{"rendered":"Gegevens transformeren in python (logboek, vierkantswortel, kubuswortel)"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\">Veel statistische tests gaan ervan uit dat datasets normaal verdeeld zijn. In de praktijk is dit echter vaak niet het geval.<\/span><\/p>\n<p> <span style=\"color: #000000;\">E\u00e9n manier om dit probleem op te lossen is door de verdeling van waarden in een dataset te transformeren met behulp van een van de volgende drie transformaties:<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>1. Logtransformatie:<\/strong> transformeer de responsvariabele van y naar <strong>log(y)<\/strong> .<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>2. Vierkantsworteltransformatie:<\/strong> Transformeer de responsvariabele van y naar <strong><span style=\"text-decoration: overline;\">\u221ay<\/span><\/strong> .<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>3. Derdemachtsworteltransformatie:<\/strong> transformeer de responsvariabele van y naar <strong>y <sup>1\/3<\/sup><\/strong> .<\/span><\/p>\n<p> <span style=\"color: #000000;\">Door deze transformaties uit te voeren, wordt de dataset doorgaans normaler verdeeld.<\/span><\/p>\n<p> <span style=\"color: #000000;\">De volgende voorbeelden laten zien hoe u deze transformaties in Python kunt uitvoeren.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Logboektransformatie in Python<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">De volgende code laat zien hoe u een <strong>logaritmische transformatie<\/strong> op een variabele uitvoert en zij-aan-zij plots maakt om de oorspronkelijke distributie en de log-getransformeerde distributie van de gegevens weer te geven:<\/span> <\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">import<\/span> numpy <span style=\"color: #008000;\">as<\/span> np\n<span style=\"color: #008000;\">import<\/span> matplotlib. <span style=\"color: #3366ff;\">pyplot<\/span> <span style=\"color: #008000;\">as<\/span> plt\n\n<span style=\"color: #008080;\">#make this example reproducible\n<\/span>n.p. <span style=\"color: #3366ff;\">random<\/span> . <span style=\"color: #3366ff;\">seeds<\/span> (0)\n\n<span style=\"color: #008080;\">#create beta distributed random variable with 200 values\n<\/span>data = np. <span style=\"color: #3366ff;\">random<\/span> . <span style=\"color: #3366ff;\">beta<\/span> (a= <span style=\"color: #008000;\">4<\/span> , b= <span style=\"color: #008000;\">15<\/span> , size= <span style=\"color: #008000;\">300<\/span> )\n\n<span style=\"color: #008080;\">#create log-transformed data\n<\/span>data_log = np. <span style=\"color: #3366ff;\">log<\/span> (data)\n\n<span style=\"color: #008080;\">#define grid of plots\n<\/span>fig, axs = plt. <span style=\"color: #3366ff;\">subplots<\/span> (nrows= <span style=\"color: #008000;\">1<\/span> , ncols= <span style=\"color: #008000;\">2<\/span> )\n\n<span style=\"color: #008080;\">#create histograms\n<\/span>axs[0]. <span style=\"color: #3366ff;\">hist<\/span> (data, edgecolor=' <span style=\"color: #ff0000;\">black<\/span> ')\naxs[1]. <span style=\"color: #3366ff;\">hist<\/span> (data_log, edgecolor=' <span style=\"color: #ff0000;\">black<\/span> ')\n\n<span style=\"color: #008080;\">#add title to each histogram\n<\/span>axs[0]. <span style=\"color: #3366ff;\">set_title<\/span> (' <span style=\"color: #ff0000;\">Original Data<\/span> ')\naxs[1].set_title(' <span style=\"color: #ff0000;\">Log-Transformed Data<\/span> ')\n<\/strong><\/pre>\n<p><img decoding=\"async\" loading=\"lazy\" class=\" wp-image-20488 aligncenter\" src=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/trans11.png\" alt=\"\" width=\"550\" height=\"384\" srcset=\"\" sizes=\"auto, \"><\/p>\n<p> <span style=\"color: #000000;\">Merk op hoe de log-getransformeerde distributie normaler verdeeld is dan de oorspronkelijke distributie.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Het is nog steeds geen perfecte &#8222;klokvorm&#8220;, maar het ligt dichter bij een normale verdeling dan de oorspronkelijke verdeling.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Vierkantsworteltransformatie in Python<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">De volgende code laat zien hoe u een <strong>vierkantsworteltransformatie<\/strong> op een variabele uitvoert en zij-aan-zij plots maakt om de oorspronkelijke distributie en de getransformeerde vierkantswortelverdeling van de gegevens weer te geven:<\/span> <\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">import<\/span> numpy <span style=\"color: #008000;\">as<\/span> np\n<span style=\"color: #008000;\">import<\/span> matplotlib. <span style=\"color: #3366ff;\">pyplot<\/span> <span style=\"color: #008000;\">as<\/span> plt\n\n<span style=\"color: #008080;\">#make this example reproducible\n<\/span>n.p. <span style=\"color: #3366ff;\">random<\/span> . <span style=\"color: #3366ff;\">seeds<\/span> (0)\n\n<span style=\"color: #008080;\">#create beta distributed random variable with 200 values\n<\/span>data = np. <span style=\"color: #3366ff;\">random<\/span> . <span style=\"color: #3366ff;\">beta<\/span> (a= <span style=\"color: #008000;\">1<\/span> , b= <span style=\"color: #008000;\">5<\/span> , size= <span style=\"color: #008000;\">300<\/span> )\n\n<span style=\"color: #008080;\">#create log-transformed data\n<\/span>data_log = np. <span style=\"color: #3366ff;\">sqrt<\/span> (data)\n\n<span style=\"color: #008080;\">#define grid of plots\n<\/span>fig, axs = plt. <span style=\"color: #3366ff;\">subplots<\/span> (nrows= <span style=\"color: #008000;\">1<\/span> , ncols= <span style=\"color: #008000;\">2<\/span> )\n\n<span style=\"color: #008080;\">#create histograms\n<\/span>axs[0]. <span style=\"color: #3366ff;\">hist<\/span> (data, edgecolor=' <span style=\"color: #ff0000;\">black<\/span> ')\naxs[1]. <span style=\"color: #3366ff;\">hist<\/span> (data_log, edgecolor=' <span style=\"color: #ff0000;\">black<\/span> ')\n\n<span style=\"color: #008080;\">#add title to each histogram\n<\/span>axs[0]. <span style=\"color: #3366ff;\">set_title<\/span> (' <span style=\"color: #ff0000;\">Original Data<\/span> ')\naxs[1].set_title(' <span style=\"color: #ff0000;\">Square Root Transformed Data<\/span> ')<\/strong> <\/pre>\n<p><img decoding=\"async\" loading=\"lazy\" class=\" wp-image-20490 aligncenter\" src=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/trans12.png\" alt=\"\" width=\"555\" height=\"379\" srcset=\"\" sizes=\"auto, \"><\/p>\n<p> <span style=\"color: #000000;\">Merk op hoe de met de vierkantswortel getransformeerde gegevens veel normaler verdeeld zijn dan de oorspronkelijke gegevens.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Kubusworteltransformatie in Python<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">De volgende code laat zien hoe u een <strong>derdemachtsworteltransformatie<\/strong> op een variabele uitvoert en zij-aan-zij-plots maakt om de oorspronkelijke verdeling en de getransformeerde derdemachtswortelverdeling van de gegevens weer te geven:<\/span> <\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">import<\/span> numpy <span style=\"color: #008000;\">as<\/span> np\n<span style=\"color: #008000;\">import<\/span> matplotlib. <span style=\"color: #3366ff;\">pyplot<\/span> <span style=\"color: #008000;\">as<\/span> plt\n\n<span style=\"color: #008080;\">#make this example reproducible\n<\/span>n.p. <span style=\"color: #3366ff;\">random<\/span> . <span style=\"color: #3366ff;\">seeds<\/span> (0)\n\n<span style=\"color: #008080;\">#create beta distributed random variable with 200 values\n<\/span>data = np. <span style=\"color: #3366ff;\">random<\/span> . <span style=\"color: #3366ff;\">beta<\/span> (a= <span style=\"color: #008000;\">1<\/span> , b= <span style=\"color: #008000;\">5<\/span> , size= <span style=\"color: #008000;\">300<\/span> )\n\n<span style=\"color: #008080;\">#create log-transformed data\n<\/span>data_log = np. <span style=\"color: #3366ff;\">cbrt<\/span> (data)\n\n<span style=\"color: #008080;\">#define grid of plots\n<\/span>fig, axs = plt. <span style=\"color: #3366ff;\">subplots<\/span> (nrows= <span style=\"color: #008000;\">1<\/span> , ncols= <span style=\"color: #008000;\">2<\/span> )\n\n<span style=\"color: #008080;\">#create histograms\n<\/span>axs[0]. <span style=\"color: #3366ff;\">hist<\/span> (data, edgecolor=' <span style=\"color: #ff0000;\">black<\/span> ')\naxs[1]. <span style=\"color: #3366ff;\">hist<\/span> (data_log, edgecolor=' <span style=\"color: #ff0000;\">black<\/span> ')\n\n<span style=\"color: #008080;\">#add title to each histogram\n<\/span>axs[0]. <span style=\"color: #3366ff;\">set_title<\/span> (' <span style=\"color: #ff0000;\">Original Data<\/span> ')\naxs[1].set_title(' <span style=\"color: #ff0000;\">Cube Root Transformed Data<\/span> ')<\/strong> <\/pre>\n<p><img decoding=\"async\" loading=\"lazy\" class=\" wp-image-20491 aligncenter\" src=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/trans13.png\" alt=\"\" width=\"540\" height=\"370\" srcset=\"\" sizes=\"auto, \"><\/p>\n<p> <span style=\"color: #000000;\">Merk op dat de getransformeerde gegevens uit de derdemachtswortel veel normaler verdeeld zijn dan de oorspronkelijke gegevens.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Aanvullende bronnen<\/strong><\/span><\/h3>\n<p> <a href=\"https:\/\/statorials.org\/nl\/z-score-python\/\" target=\"_blank\" rel=\"noopener\">Hoe Z-scores in Python te berekenen<\/a><br \/> <a href=\"https:\/\/statorials.org\/nl\/normaliseer-gegevens-in-python\/\" target=\"_blank\" rel=\"noopener\">Hoe gegevens in Python te normaliseren<\/a><br \/> <a href=\"https:\/\/statorials.org\/nl\/normaliteitshypothese\/\" target=\"_blank\" rel=\"noopener\">Wat is de normaliteitsaanname in de statistiek?<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Veel statistische tests gaan ervan uit dat datasets normaal verdeeld zijn. In de praktijk is dit echter vaak niet het geval. E\u00e9n manier om dit probleem op te lossen is door de verdeling van waarden in een dataset te transformeren met behulp van een van de volgende drie transformaties: 1. Logtransformatie: transformeer de responsvariabele van [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"class_list":["post-2240","post","type-post","status-publish","format-standard","hentry","category-gids"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Gegevens transformeren in Python (logboek, vierkantswortel, kubuswortel) - Statorials<\/title>\n<meta name=\"description\" content=\"In deze tutorial wordt uitgelegd hoe u algemene gegevenstransformaties in Python uitvoert, met verschillende voorbeelden.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Gegevens transformeren in Python (logboek, vierkantswortel, kubuswortel) - Statorials\" \/>\n<meta property=\"og:description\" content=\"In deze tutorial wordt uitgelegd hoe u algemene gegevenstransformaties in Python uitvoert, met verschillende voorbeelden.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-23T03:04:58+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/trans11.png\" \/>\n<meta name=\"author\" content=\"Dr.benjamin anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Verfasst von\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dr.benjamin anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data2\" content=\"3\u00a0Minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/\",\"url\":\"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/\",\"name\":\"Gegevens transformeren in Python (logboek, vierkantswortel, kubuswortel) - Statorials\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/nl\/#website\"},\"datePublished\":\"2023-07-23T03:04:58+00:00\",\"dateModified\":\"2023-07-23T03:04:58+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/nl\/#\/schema\/person\/d4b8842173cca1bb62cdec41860e4219\"},\"description\":\"In deze tutorial wordt uitgelegd hoe u algemene gegevenstransformaties in Python uitvoert, met verschillende voorbeelden.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Thuis\",\"item\":\"https:\/\/statorials.org\/nl\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Gegevens transformeren in python (logboek, vierkantswortel, kubuswortel)\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/nl\/#website\",\"url\":\"https:\/\/statorials.org\/nl\/\",\"name\":\"Statorials\",\"description\":\"Uw gids voor statistische competentie\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/nl\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"de\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/nl\/#\/schema\/person\/d4b8842173cca1bb62cdec41860e4219\",\"name\":\"Dr.benjamin anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\/\/statorials.org\/nl\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/statorials.org\/nl\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"http:\/\/statorials.org\/nl\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Dr.benjamin anderson\"},\"description\":\"Ik ben Benjamin, een gepensioneerde hoogleraar statistiek die nu een toegewijde Statorials-lesgever is. Ik heb uitgebreide ervaring en expertise op het gebied van statistiek en ik ben vastbesloten om mijn kennis te delen met studenten via Statorials. Lees verder\",\"sameAs\":[\"http:\/\/statorials.org\/nl\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Gegevens transformeren in Python (logboek, vierkantswortel, kubuswortel) - Statorials","description":"In deze tutorial wordt uitgelegd hoe u algemene gegevenstransformaties in Python uitvoert, met verschillende voorbeelden.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/","og_locale":"de_DE","og_type":"article","og_title":"Gegevens transformeren in Python (logboek, vierkantswortel, kubuswortel) - Statorials","og_description":"In deze tutorial wordt uitgelegd hoe u algemene gegevenstransformaties in Python uitvoert, met verschillende voorbeelden.","og_url":"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/","og_site_name":"Statorials","article_published_time":"2023-07-23T03:04:58+00:00","og_image":[{"url":"https:\/\/statorials.org\/wp-content\/uploads\/2023\/08\/trans11.png"}],"author":"Dr.benjamin anderson","twitter_card":"summary_large_image","twitter_misc":{"Verfasst von":"Dr.benjamin anderson","Gesch\u00e4tzte Lesezeit":"3\u00a0Minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/","url":"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/","name":"Gegevens transformeren in Python (logboek, vierkantswortel, kubuswortel) - Statorials","isPartOf":{"@id":"https:\/\/statorials.org\/nl\/#website"},"datePublished":"2023-07-23T03:04:58+00:00","dateModified":"2023-07-23T03:04:58+00:00","author":{"@id":"https:\/\/statorials.org\/nl\/#\/schema\/person\/d4b8842173cca1bb62cdec41860e4219"},"description":"In deze tutorial wordt uitgelegd hoe u algemene gegevenstransformaties in Python uitvoert, met verschillende voorbeelden.","breadcrumb":{"@id":"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/nl\/gegevens-transformeren-in-python\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Thuis","item":"https:\/\/statorials.org\/nl\/"},{"@type":"ListItem","position":2,"name":"Gegevens transformeren in python (logboek, vierkantswortel, kubuswortel)"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/nl\/#website","url":"https:\/\/statorials.org\/nl\/","name":"Statorials","description":"Uw gids voor statistische competentie","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/nl\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"de"},{"@type":"Person","@id":"https:\/\/statorials.org\/nl\/#\/schema\/person\/d4b8842173cca1bb62cdec41860e4219","name":"Dr.benjamin anderson","image":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/statorials.org\/nl\/#\/schema\/person\/image\/","url":"http:\/\/statorials.org\/nl\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"http:\/\/statorials.org\/nl\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Dr.benjamin anderson"},"description":"Ik ben Benjamin, een gepensioneerde hoogleraar statistiek die nu een toegewijde Statorials-lesgever is. Ik heb uitgebreide ervaring en expertise op het gebied van statistiek en ik ben vastbesloten om mijn kennis te delen met studenten via Statorials. Lees verder","sameAs":["http:\/\/statorials.org\/nl"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/posts\/2240","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/comments?post=2240"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/posts\/2240\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/media?parent=2240"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/categories?post=2240"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/nl\/wp-json\/wp\/v2\/tags?post=2240"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}