{"id":1116,"date":"2023-07-27T14:52:53","date_gmt":"2023-07-27T14:52:53","guid":{"rendered":"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/"},"modified":"2023-07-27T14:52:53","modified_gmt":"2023-07-27T14:52:53","slug":"normalizar-dados-em-r","status":"publish","type":"post","link":"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/","title":{"rendered":"Como padronizar dados em r: com exemplos"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\"><strong>Padronizar<\/strong> um conjunto de dados significa dimensionar todos os valores no conjunto de dados de forma que o valor m\u00e9dio seja 0 e o desvio padr\u00e3o seja 1.<\/span><\/p>\n<p> <span style=\"color: #000000;\">A maneira mais comum de fazer isso \u00e9 usar a padroniza\u00e7\u00e3o do z-score, que dimensiona os valores usando a seguinte f\u00f3rmula:<\/span><\/p>\n<p> <span style=\"color: #000000;\"><strong>(x <sub>eu<\/sub> \u2013 <span style=\"text-decoration: overline;\">x<\/span> ) \/ s<\/strong><\/span><\/p>\n<p> <span style=\"color: #000000;\">Ouro:<\/span><\/p>\n<ul>\n<li> <span style=\"color: #000000;\"><strong>x <sub>i<\/sub><\/strong> : o i- <sup>\u00e9simo<\/sup> valor do conjunto de dados<\/span><\/li>\n<li> <span style=\"color: #000000;\"><strong><span style=\"text-decoration: overline;\">x<\/span><\/strong> : A amostra significa<\/span><\/li>\n<li> <span style=\"color: #000000;\"><strong>s<\/strong> : o desvio padr\u00e3o da amostra<\/span><\/li>\n<\/ul>\n<p> <span style=\"color: #000000;\">Os exemplos a seguir mostram como usar a fun\u00e7\u00e3o <a href=\"https:\/\/statorials.org\/pt\/funcao-de-escala-em-r\/\" target=\"_blank\" rel=\"noopener\">scale()<\/a> com o pacote <strong>dplyr<\/strong> em R para dimensionar uma ou mais vari\u00e1veis em um quadro de dados usando a padroniza\u00e7\u00e3o de pontua\u00e7\u00e3o z.<\/span><\/p>\n<h2> <span style=\"color: #000000;\"><strong>Padronize uma \u00fanica vari\u00e1vel<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">O c\u00f3digo a seguir mostra como dimensionar uma \u00fanica vari\u00e1vel em um quadro de dados com tr\u00eas vari\u00e1veis:<br \/><\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #993300;\">library<\/span> (dplyr)\n\n<span style=\"color: #008080;\">#make this example reproducible<\/span> \nset.seed(1)\n\n<span style=\"color: #008080;\">#create original data frame\n<\/span>df &lt;- data.frame(var1= runif(10, 0, 50), \n                 var2= runif(10, 2, 23),\n                 var3= runif(10, 5, 38))\n<span style=\"color: #008080;\">\n#view original data frame<\/span>\ndf\n\n        var1 var2 var3\n1 13.275433 6.325466 35.845273\n2 18.606195 5.707692 12.000703\n3 28.642668 16.427480 26.505234\n4 45.410389 10.066178 9.143318\n5 10.084097 18.166670 13.818282\n6 44.919484 12.451684 17.741765\n7 47.233763 17.069989 5.441881\n8 33.039890 22.830028 17.618803\n9 31.455702 9.980739 33.699798\n10 3.089314 18.326350 16.231517\n\n<span style=\"color: #008080;\">#scale <em>var1<\/em> to have mean = 0 and standard deviation = 1<\/span>\ndf2 &lt;- df %&gt;% <span style=\"color: #3366ff;\">mutate_at<\/span> (c(' <span style=\"color: #008000;\">var1<\/span> '), ~( <span style=\"color: #3366ff;\">scale<\/span> (.) %&gt;% <span style=\"color: #3366ff;\">as.vector<\/span> ))\ndf2\n\n          var1 var2 var3\n1 -0.90606801 6.325466 35.845273\n2 -0.56830963 5.707692 12.000703\n3 0.06760377 16.427480 26.505234\n4 1.13001072 10.066178 9.143318\n5 -1.10827188 18.166670 13.818282\n6 1.09890684 12.451684 17.741765\n7 1.24554014 17.069989 5.441881\n8 0.34621281 22.830028 17.618803\n9 0.24583830 9.980739 33.699798\n10 -1.55146305 18.326350 16.231517\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Observe que apenas a primeira vari\u00e1vel foi dimensionada enquanto as outras duas vari\u00e1veis permaneceram as mesmas. Podemos confirmar rapidamente que a nova vari\u00e1vel escalonada tem um valor m\u00e9dio de 0 e um desvio padr\u00e3o de 1:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#calculate mean of scaled variable<\/span>\nmean(df2$var1)\n\n[1] -4.18502e-18 <span style=\"color: #008080;\">#basically zero<\/span>\n\n<span style=\"color: #008080;\">#calculate standard deviation of scaled variable<\/span> \nsd(df2$var1)\n\n[1] 1<\/strong><\/pre>\n<h2> <span style=\"color: #000000;\"><strong>Padronize m\u00faltiplas vari\u00e1veis<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">O c\u00f3digo a seguir mostra como dimensionar diversas vari\u00e1veis em um quadro de dados de uma s\u00f3 vez:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #993300;\">library<\/span> (dplyr)\n\n<span style=\"color: #008080;\">#make this example reproducible<\/span> \nset.seed(1)\n\n<span style=\"color: #008080;\">#create original data frame\n<\/span>df &lt;- data.frame(var1= runif(10, 0, 50), \n                 var2= runif(10, 2, 23),\n                 var3= runif(10, 5, 38))\n\n<span style=\"color: #008080;\">#scale <em>var1<\/em> and <em>var2<\/em> to have mean = 0 and standard deviation = 1<\/span>\ndf3 &lt;- df %&gt;% <span style=\"color: #3366ff;\">mutate_at<\/span> (c(' <span style=\"color: #008000;\">var1<\/span> ', ' <span style=\"color: #008000;\">var2<\/span> '), ~( <span style=\"color: #3366ff;\">scale<\/span> (.) %&gt;% <span style=\"color: #3366ff;\">as.vector<\/span> ))\ndf3\n\n          var1 var2 var3\n1 -0.90606801 -1.3045574 35.845273\n2 -0.56830963 -1.4133223 12.000703\n3 0.06760377 0.4739961 26.505234\n4 1.13001072 -0.6459703 9.143318\n5 -1.10827188 0.7801967 13.818282\n6 1.09890684 -0.2259798 17.741765\n7 1.24554014 0.5871157 5.441881\n8 0.34621281 1.6012242 17.618803\n9 0.24583830 -0.6610127 33.699798\n10 -1.55146305 0.8083098 16.231517<\/strong><\/pre>\n<h2> <span style=\"color: #000000;\"><strong>Padronize todas as vari\u00e1veis<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">O c\u00f3digo a seguir mostra como dimensionar <em>todas<\/em> as vari\u00e1veis em um quadro de dados usando a fun\u00e7\u00e3o <strong>mutate_all<\/strong> :<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #993300;\">library<\/span> (dplyr)\n\n<span style=\"color: #008080;\">#make this example reproducible<\/span> \nset.seed(1)\n\n<span style=\"color: #008080;\">#create original data frame\n<\/span>df &lt;- data.frame(var1= runif(10, 0, 50), \n                 var2= runif(10, 2, 23),\n                 var3= runif(10, 5, 38))\n\n<span style=\"color: #008080;\">#scale all variables to have mean = 0 and standard deviation = 1<\/span>\ndf4 &lt;- df %&gt;% <span style=\"color: #3366ff;\">mutate_all<\/span> (~( <span style=\"color: #3366ff;\">scale<\/span> (.) %&gt;% <span style=\"color: #3366ff;\">as.vector<\/span> ))\ndf4\n\n          var1 var2 var3\n1 -0.90606801 -1.3045574 1.6819976\n2 -0.56830963 -1.4133223 -0.6715858\n3 0.06760377 0.4739961 0.7600871\n4 1.13001072 -0.6459703 -0.9536246\n5 -1.10827188 0.7801967 -0.4921813\n6 1.09890684 -0.2259798 -0.1049130\n7 1.24554014 0.5871157 -1.3189757\n8 0.34621281 1.6012242 -0.1170501\n9 0.24583830 -0.6610127 1.4702281\n10 -1.55146305 0.8083098 -0.2539824<\/strong><\/pre>\n<h2> <span style=\"color: #000000;\"><strong>Recursos adicionais<\/strong><\/span><\/h2>\n<p> <span style=\"color: #000000;\">Os tutoriais a seguir explicam como realizar outras tarefas comuns em R:<\/span><\/p>\n<p> <a href=\"https:\/\/statorials.org\/pt\/como-normalizar-dados-em-r\/\" target=\"_blank\" rel=\"noopener noreferrer\">Como normalizar dados em R<\/a><br \/> <a href=\"https:\/\/statorials.org\/pt\/desvio-padrao-em-r\/\" target=\"_blank\" rel=\"noopener noreferrer\">Como calcular o desvio padr\u00e3o em R<\/a><br \/> <a href=\"https:\/\/statorials.org\/pt\/imputar-valores-ausentes-em-r\/\" target=\"_blank\" rel=\"noopener noreferrer\">Como imputar todos os valores ausentes em R<\/a><br \/> <a href=\"https:\/\/statorials.org\/pt\/transformar-dados-em-r\/\" target=\"_blank\" rel=\"noopener noreferrer\">Como transformar dados em R (Log, raiz quadrada, raiz c\u00fabica)<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Padronizar um conjunto de dados significa dimensionar todos os valores no conjunto de dados de forma que o valor m\u00e9dio seja 0 e o desvio padr\u00e3o seja 1. A maneira mais comum de fazer isso \u00e9 usar a padroniza\u00e7\u00e3o do z-score, que dimensiona os valores usando a seguinte f\u00f3rmula: (x eu \u2013 x ) \/ [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"class_list":["post-1116","post","type-post","status-publish","format-standard","hentry","category-guia"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Como padronizar dados em R (com exemplos) \u2013 Estatoriais<\/title>\n<meta name=\"description\" content=\"Este tutorial explica como padronizar dados em R, com v\u00e1rios exemplos.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/\" \/>\n<meta property=\"og:locale\" content=\"pt_PT\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Como padronizar dados em R (com exemplos) \u2013 Estatoriais\" \/>\n<meta property=\"og:description\" content=\"Este tutorial explica como padronizar dados em R, com v\u00e1rios exemplos.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-27T14:52:53+00:00\" \/>\n<meta name=\"author\" content=\"Dr. benjamim anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dr. benjamim anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tempo estimado de leitura\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/\",\"url\":\"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/\",\"name\":\"Como padronizar dados em R (com exemplos) \u2013 Estatoriais\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/pt\/#website\"},\"datePublished\":\"2023-07-27T14:52:53+00:00\",\"dateModified\":\"2023-07-27T14:52:53+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/pt\/#\/schema\/person\/e08f98e8db95e0aa9c310e1b27c9c666\"},\"description\":\"Este tutorial explica como padronizar dados em R, com v\u00e1rios exemplos.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/#breadcrumb\"},\"inLanguage\":\"pt-PT\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Lar\",\"item\":\"https:\/\/statorials.org\/pt\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Como padronizar dados em r: com exemplos\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/pt\/#website\",\"url\":\"https:\/\/statorials.org\/pt\/\",\"name\":\"Statorials\",\"description\":\"O seu guia para a literacia estat\u00edstica!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/pt\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"pt-PT\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/pt\/#\/schema\/person\/e08f98e8db95e0aa9c310e1b27c9c666\",\"name\":\"Dr. benjamim anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-PT\",\"@id\":\"https:\/\/statorials.org\/pt\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/statorials.org\/pt\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"https:\/\/statorials.org\/pt\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Dr. benjamim anderson\"},\"description\":\"Ol\u00e1, sou Benjamin, um professor aposentado de estat\u00edstica que se tornou professor dedicado na Statorials. Com vasta experi\u00eancia e conhecimento na \u00e1rea de estat\u00edstica, estou empenhado em compartilhar meu conhecimento para capacitar os alunos por meio de Statorials. Saber mais\",\"sameAs\":[\"https:\/\/statorials.org\/pt\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Como padronizar dados em R (com exemplos) \u2013 Estatoriais","description":"Este tutorial explica como padronizar dados em R, com v\u00e1rios exemplos.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/","og_locale":"pt_PT","og_type":"article","og_title":"Como padronizar dados em R (com exemplos) \u2013 Estatoriais","og_description":"Este tutorial explica como padronizar dados em R, com v\u00e1rios exemplos.","og_url":"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/","og_site_name":"Statorials","article_published_time":"2023-07-27T14:52:53+00:00","author":"Dr. benjamim anderson","twitter_card":"summary_large_image","twitter_misc":{"Escrito por":"Dr. benjamim anderson","Tempo estimado de leitura":"2 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/","url":"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/","name":"Como padronizar dados em R (com exemplos) \u2013 Estatoriais","isPartOf":{"@id":"https:\/\/statorials.org\/pt\/#website"},"datePublished":"2023-07-27T14:52:53+00:00","dateModified":"2023-07-27T14:52:53+00:00","author":{"@id":"https:\/\/statorials.org\/pt\/#\/schema\/person\/e08f98e8db95e0aa9c310e1b27c9c666"},"description":"Este tutorial explica como padronizar dados em R, com v\u00e1rios exemplos.","breadcrumb":{"@id":"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/#breadcrumb"},"inLanguage":"pt-PT","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/pt\/normalizar-dados-em-r\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Lar","item":"https:\/\/statorials.org\/pt\/"},{"@type":"ListItem","position":2,"name":"Como padronizar dados em r: com exemplos"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/pt\/#website","url":"https:\/\/statorials.org\/pt\/","name":"Statorials","description":"O seu guia para a literacia estat\u00edstica!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/pt\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"pt-PT"},{"@type":"Person","@id":"https:\/\/statorials.org\/pt\/#\/schema\/person\/e08f98e8db95e0aa9c310e1b27c9c666","name":"Dr. benjamim anderson","image":{"@type":"ImageObject","inLanguage":"pt-PT","@id":"https:\/\/statorials.org\/pt\/#\/schema\/person\/image\/","url":"https:\/\/statorials.org\/pt\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"https:\/\/statorials.org\/pt\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Dr. benjamim anderson"},"description":"Ol\u00e1, sou Benjamin, um professor aposentado de estat\u00edstica que se tornou professor dedicado na Statorials. Com vasta experi\u00eancia e conhecimento na \u00e1rea de estat\u00edstica, estou empenhado em compartilhar meu conhecimento para capacitar os alunos por meio de Statorials. Saber mais","sameAs":["https:\/\/statorials.org\/pt"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/posts\/1116","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/comments?post=1116"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/posts\/1116\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/media?parent=1116"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/categories?post=1116"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/pt\/wp-json\/wp\/v2\/tags?post=1116"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}