{"id":1115,"date":"2023-07-27T14:52:53","date_gmt":"2023-07-27T14:52:53","guid":{"rendered":"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/"},"modified":"2023-07-27T14:52:53","modified_gmt":"2023-07-27T14:52:53","slug":"normalizuj-dane-w-r","status":"publish","type":"post","link":"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/","title":{"rendered":"Jak standaryzowa\u0107 dane w r: z przyk\u0142adami"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\"><strong>Standaryzacja<\/strong> zbioru danych oznacza skalowanie wszystkich warto\u015bci w zbiorze danych w taki spos\u00f3b, \u017ce \u015brednia warto\u015b\u0107 wynosi 0, a odchylenie standardowe wynosi 1.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Najcz\u0119stszym sposobem na osi\u0105gni\u0119cie tego jest zastosowanie standaryzacji z-score, kt\u00f3ra skaluje warto\u015bci za pomoc\u0105 nast\u0119puj\u0105cego wzoru:<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>(x <sub>i<\/sub> \u2013 <span style=\"text-decoration: overline;\">x<\/span> ) \/ s<\/strong><\/span><\/p>\n<p> <span style=\"color: #000000;\">Z\u0142oto:<\/span><\/p>\n<ul>\n<li> <span style=\"color: #000000;\"><strong>x <sub>i<\/sub><\/strong> : <sup>i-ta<\/sup> warto\u015b\u0107 zbioru danych<\/span><\/li>\n<li> <span style=\"color: #000000;\"><strong><span style=\"text-decoration: overline;\">x<\/span><\/strong> : Przyk\u0142adowe \u015brodki<\/span><\/li>\n<li> <span style=\"color: #000000;\"><strong>s<\/strong> : odchylenie standardowe pr\u00f3bki<\/span><\/li>\n<\/ul>\n<p> <span style=\"color: #000000;\">Poni\u017csze przyk\u0142ady pokazuj\u0105, jak u\u017cywa\u0107 <a href=\"https:\/\/statorials.org\/pl\/funkcja-skali-w-r\/\" target=\"_blank\" rel=\"noopener\">funkcjiscale()<\/a> z pakietem <strong>dplyr<\/strong> w j\u0119zyku R do skalowania jednej lub wi\u0119kszej liczby zmiennych w ramce danych przy u\u017cyciu standaryzacji z-score.<\/span><\/p>\n<h2> <span style=\"color: #000000;\"><strong>Standaryzuj pojedyncz\u0105 zmienn\u0105<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Poni\u017cszy kod pokazuje, jak skalowa\u0107 pojedyncz\u0105 zmienn\u0105 w ramce danych z trzema zmiennymi:<br \/><\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #993300;\">library<\/span> (dplyr)\n\n<span style=\"color: #008080;\">#make this example reproducible<\/span> \nset.seed(1)\n\n<span style=\"color: #008080;\">#create original data frame\n<\/span>df &lt;- data.frame(var1= runif(10, 0, 50), \n                 var2= runif(10, 2, 23),\n                 var3= runif(10, 5, 38))\n<span style=\"color: #008080;\">\n#view original data frame<\/span>\ndf\n\n        var1 var2 var3\n1 13.275433 6.325466 35.845273\n2 18.606195 5.707692 12.000703\n3 28.642668 16.427480 26.505234\n4 45.410389 10.066178 9.143318\n5 10.084097 18.166670 13.818282\n6 44.919484 12.451684 17.741765\n7 47.233763 17.069989 5.441881\n8 33.039890 22.830028 17.618803\n9 31.455702 9.980739 33.699798\n10 3.089314 18.326350 16.231517\n\n<span style=\"color: #008080;\">#scale <em>var1<\/em> to have mean = 0 and standard deviation = 1<\/span>\ndf2 &lt;- df %&gt;% <span style=\"color: #3366ff;\">mutate_at<\/span> (c(' <span style=\"color: #008000;\">var1<\/span> '), ~( <span style=\"color: #3366ff;\">scale<\/span> (.) %&gt;% <span style=\"color: #3366ff;\">as.vector<\/span> ))\ndf2\n\n          var1 var2 var3\n1 -0.90606801 6.325466 35.845273\n2 -0.56830963 5.707692 12.000703\n3 0.06760377 16.427480 26.505234\n4 1.13001072 10.066178 9.143318\n5 -1.10827188 18.166670 13.818282\n6 1.09890684 12.451684 17.741765\n7 1.24554014 17.069989 5.441881\n8 0.34621281 22.830028 17.618803\n9 0.24583830 9.980739 33.699798\n10 -1.55146305 18.326350 16.231517\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Nale\u017cy zauwa\u017cy\u0107, \u017ce skalowano tylko pierwsz\u0105 zmienn\u0105, podczas gdy pozosta\u0142e dwie zmienne pozosta\u0142y takie same. Mo\u017cemy szybko potwierdzi\u0107, \u017ce nowa skalowana zmienna ma \u015bredni\u0105 warto\u015b\u0107 0 i odchylenie standardowe 1:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#calculate mean of scaled variable<\/span>\nmean(df2$var1)\n\n[1] -4.18502e-18 <span style=\"color: #008080;\">#basically zero<\/span>\n\n<span style=\"color: #008080;\">#calculate standard deviation of scaled variable<\/span> \nsd(df2$var1)\n\n[1] 1<\/strong><\/pre>\n<h2> <span style=\"color: #000000;\"><strong>Standaryzuj wiele zmiennych<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Poni\u017cszy kod pokazuje, jak skalowa\u0107 wiele zmiennych w ramce danych jednocze\u015bnie:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #993300;\">library<\/span> (dplyr)\n\n<span style=\"color: #008080;\">#make this example reproducible<\/span> \nset.seed(1)\n\n<span style=\"color: #008080;\">#create original data frame\n<\/span>df &lt;- data.frame(var1= runif(10, 0, 50), \n                 var2= runif(10, 2, 23),\n                 var3= runif(10, 5, 38))\n\n<span style=\"color: #008080;\">#scale <em>var1<\/em> and <em>var2<\/em> to have mean = 0 and standard deviation = 1<\/span>\ndf3 &lt;- df %&gt;% <span style=\"color: #3366ff;\">mutate_at<\/span> (c(' <span style=\"color: #008000;\">var1<\/span> ', ' <span style=\"color: #008000;\">var2<\/span> '), ~( <span style=\"color: #3366ff;\">scale<\/span> (.) %&gt;% <span style=\"color: #3366ff;\">as.vector<\/span> ))\ndf3\n\n          var1 var2 var3\n1 -0.90606801 -1.3045574 35.845273\n2 -0.56830963 -1.4133223 12.000703\n3 0.06760377 0.4739961 26.505234\n4 1.13001072 -0.6459703 9.143318\n5 -1.10827188 0.7801967 13.818282\n6 1.09890684 -0.2259798 17.741765\n7 1.24554014 0.5871157 5.441881\n8 0.34621281 1.6012242 17.618803\n9 0.24583830 -0.6610127 33.699798\n10 -1.55146305 0.8083098 16.231517<\/strong><\/pre>\n<h2> <span style=\"color: #000000;\"><strong>Standaryzuj wszystkie zmienne<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Poni\u017cszy kod pokazuje, jak skalowa\u0107 <em>wszystkie<\/em> zmienne w ramce danych za pomoc\u0105 funkcji <strong>mutate_all<\/strong> :<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #993300;\">library<\/span> (dplyr)\n\n<span style=\"color: #008080;\">#make this example reproducible<\/span> \nset.seed(1)\n\n<span style=\"color: #008080;\">#create original data frame\n<\/span>df &lt;- data.frame(var1= runif(10, 0, 50), \n                 var2= runif(10, 2, 23),\n                 var3= runif(10, 5, 38))\n\n<span style=\"color: #008080;\">#scale all variables to have mean = 0 and standard deviation = 1<\/span>\ndf4 &lt;- df %&gt;% <span style=\"color: #3366ff;\">mutate_all<\/span> (~( <span style=\"color: #3366ff;\">scale<\/span> (.) %&gt;% <span style=\"color: #3366ff;\">as.vector<\/span> ))\ndf4\n\n          var1 var2 var3\n1 -0.90606801 -1.3045574 1.6819976\n2 -0.56830963 -1.4133223 -0.6715858\n3 0.06760377 0.4739961 0.7600871\n4 1.13001072 -0.6459703 -0.9536246\n5 -1.10827188 0.7801967 -0.4921813\n6 1.09890684 -0.2259798 -0.1049130\n7 1.24554014 0.5871157 -1.3189757\n8 0.34621281 1.6012242 -0.1170501\n9 0.24583830 -0.6610127 1.4702281\n10 -1.55146305 0.8083098 -0.2539824<\/strong><\/pre>\n<h2> <span style=\"color: #000000;\"><strong>Dodatkowe zasoby<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Poni\u017csze samouczki wyja\u015bniaj\u0105, jak wykonywa\u0107 inne typowe zadania w j\u0119zyku R:<\/span><\/p>\n<p> <a href=\"https:\/\/statorials.org\/pl\/jak-normalizowac-dane-w-r\/\" target=\"_blank\" rel=\"noopener noreferrer\">Jak normalizowa\u0107 dane w R<\/a><br \/> <a href=\"https:\/\/statorials.org\/pl\/odchylenie-standardowe-w-r\/\" target=\"_blank\" rel=\"noopener noreferrer\">Jak obliczy\u0107 odchylenie standardowe w R<\/a><br \/> <a href=\"https:\/\/statorials.org\/pl\/przypisz-brakujace-wartosci-w-r\/\" target=\"_blank\" rel=\"noopener noreferrer\">Jak wszystkim przypisa\u0107 brakuj\u0105ce warto\u015bci w R<\/a><br \/> <a href=\"https:\/\/statorials.org\/pl\/przekszta\u0142cic-dane-w-r\/\" target=\"_blank\" rel=\"noopener noreferrer\">Jak przekszta\u0142ca\u0107 dane w R (log, pierwiastek kwadratowy, pierwiastek sze\u015bcienny)<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Standaryzacja zbioru danych oznacza skalowanie wszystkich warto\u015bci w zbiorze danych w taki spos\u00f3b, \u017ce \u015brednia warto\u015b\u0107 wynosi 0, a odchylenie standardowe wynosi 1. Najcz\u0119stszym sposobem na osi\u0105gni\u0119cie tego jest zastosowanie standaryzacji z-score, kt\u00f3ra skaluje warto\u015bci za pomoc\u0105 nast\u0119puj\u0105cego wzoru: (x i \u2013 x ) \/ s Z\u0142oto: x i : i-ta warto\u015b\u0107 zbioru danych x [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3],"tags":[],"class_list":["post-1115","post","type-post","status-publish","format-standard","hentry","category-przewodnik"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Jak standaryzowa\u0107 dane w R (z przyk\u0142adami) \u2013 Statorials<\/title>\n<meta name=\"description\" content=\"W tym samouczku wyja\u015bniono, jak standaryzowa\u0107 dane w j\u0119zyku R, podaj\u0105c kilka przyk\u0142ad\u00f3w.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/\" \/>\n<meta property=\"og:locale\" content=\"pl_PL\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Jak standaryzowa\u0107 dane w R (z przyk\u0142adami) \u2013 Statorials\" \/>\n<meta property=\"og:description\" content=\"W tym samouczku wyja\u015bniono, jak standaryzowa\u0107 dane w j\u0119zyku R, podaj\u0105c kilka przyk\u0142ad\u00f3w.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-27T14:52:53+00:00\" \/>\n<meta name=\"author\" content=\"Benjamin Anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Napisane przez\" \/>\n\t<meta name=\"twitter:data1\" content=\"Benjamin Anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Szacowany czas czytania\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minuty\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/\",\"url\":\"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/\",\"name\":\"Jak standaryzowa\u0107 dane w R (z przyk\u0142adami) \u2013 Statorials\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/pl\/#website\"},\"datePublished\":\"2023-07-27T14:52:53+00:00\",\"dateModified\":\"2023-07-27T14:52:53+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/pl\/#\/schema\/person\/6484727a4612df3e69f016c3129c6965\"},\"description\":\"W tym samouczku wyja\u015bniono, jak standaryzowa\u0107 dane w j\u0119zyku R, podaj\u0105c kilka przyk\u0142ad\u00f3w.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/#breadcrumb\"},\"inLanguage\":\"pl-PL\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Dom\",\"item\":\"https:\/\/statorials.org\/pl\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Jak standaryzowa\u0107 dane w r: z przyk\u0142adami\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/pl\/#website\",\"url\":\"https:\/\/statorials.org\/pl\/\",\"name\":\"Statorials\",\"description\":\"Tw\u00f3j przewodnik po kompetencjach statystycznych!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/pl\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"pl-PL\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/pl\/#\/schema\/person\/6484727a4612df3e69f016c3129c6965\",\"name\":\"Benjamin Anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pl-PL\",\"@id\":\"https:\/\/statorials.org\/pl\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/statorials.org\/pl\/wp-content\/uploads\/2023\/11\/Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"https:\/\/statorials.org\/pl\/wp-content\/uploads\/2023\/11\/Benjamin-Anderson-96x96.jpg\",\"caption\":\"Benjamin Anderson\"},\"description\":\"Cze\u015b\u0107, jestem Benjamin i jestem emerytowanym profesorem statystyki, kt\u00f3ry zosta\u0142 oddanym nauczycielem Statorials. Dzi\u0119ki bogatemu do\u015bwiadczeniu i wiedzy specjalistycznej w dziedzinie statystyki ch\u0119tnie dziel\u0119 si\u0119 swoj\u0105 wiedz\u0105, aby wzmocni\u0107 pozycj\u0119 uczni\u00f3w za po\u015brednictwem Statorials. Wiedzie\u0107 wi\u0119cej\",\"sameAs\":[\"https:\/\/statorials.org\/pl\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Jak standaryzowa\u0107 dane w R (z przyk\u0142adami) \u2013 Statorials","description":"W tym samouczku wyja\u015bniono, jak standaryzowa\u0107 dane w j\u0119zyku R, podaj\u0105c kilka przyk\u0142ad\u00f3w.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/","og_locale":"pl_PL","og_type":"article","og_title":"Jak standaryzowa\u0107 dane w R (z przyk\u0142adami) \u2013 Statorials","og_description":"W tym samouczku wyja\u015bniono, jak standaryzowa\u0107 dane w j\u0119zyku R, podaj\u0105c kilka przyk\u0142ad\u00f3w.","og_url":"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/","og_site_name":"Statorials","article_published_time":"2023-07-27T14:52:53+00:00","author":"Benjamin Anderson","twitter_card":"summary_large_image","twitter_misc":{"Napisane przez":"Benjamin Anderson","Szacowany czas czytania":"2 minuty"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/","url":"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/","name":"Jak standaryzowa\u0107 dane w R (z przyk\u0142adami) \u2013 Statorials","isPartOf":{"@id":"https:\/\/statorials.org\/pl\/#website"},"datePublished":"2023-07-27T14:52:53+00:00","dateModified":"2023-07-27T14:52:53+00:00","author":{"@id":"https:\/\/statorials.org\/pl\/#\/schema\/person\/6484727a4612df3e69f016c3129c6965"},"description":"W tym samouczku wyja\u015bniono, jak standaryzowa\u0107 dane w j\u0119zyku R, podaj\u0105c kilka przyk\u0142ad\u00f3w.","breadcrumb":{"@id":"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/#breadcrumb"},"inLanguage":"pl-PL","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/pl\/normalizuj-dane-w-r\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Dom","item":"https:\/\/statorials.org\/pl\/"},{"@type":"ListItem","position":2,"name":"Jak standaryzowa\u0107 dane w r: z przyk\u0142adami"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/pl\/#website","url":"https:\/\/statorials.org\/pl\/","name":"Statorials","description":"Tw\u00f3j przewodnik po kompetencjach statystycznych!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/pl\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"pl-PL"},{"@type":"Person","@id":"https:\/\/statorials.org\/pl\/#\/schema\/person\/6484727a4612df3e69f016c3129c6965","name":"Benjamin Anderson","image":{"@type":"ImageObject","inLanguage":"pl-PL","@id":"https:\/\/statorials.org\/pl\/#\/schema\/person\/image\/","url":"https:\/\/statorials.org\/pl\/wp-content\/uploads\/2023\/11\/Benjamin-Anderson-96x96.jpg","contentUrl":"https:\/\/statorials.org\/pl\/wp-content\/uploads\/2023\/11\/Benjamin-Anderson-96x96.jpg","caption":"Benjamin Anderson"},"description":"Cze\u015b\u0107, jestem Benjamin i jestem emerytowanym profesorem statystyki, kt\u00f3ry zosta\u0142 oddanym nauczycielem Statorials. Dzi\u0119ki bogatemu do\u015bwiadczeniu i wiedzy specjalistycznej w dziedzinie statystyki ch\u0119tnie dziel\u0119 si\u0119 swoj\u0105 wiedz\u0105, aby wzmocni\u0107 pozycj\u0119 uczni\u00f3w za po\u015brednictwem Statorials. Wiedzie\u0107 wi\u0119cej","sameAs":["https:\/\/statorials.org\/pl"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/posts\/1115","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/comments?post=1115"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/posts\/1115\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/media?parent=1115"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/categories?post=1115"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/pl\/wp-json\/wp\/v2\/tags?post=1115"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}