{"id":2525,"date":"2023-07-21T21:45:15","date_gmt":"2023-07-21T21:45:15","guid":{"rendered":"https:\/\/statorials.org\/id\/panda-membaca-html\/"},"modified":"2023-07-21T21:45:15","modified_gmt":"2023-07-21T21:45:15","slug":"panda-membaca-html","status":"publish","type":"post","link":"https:\/\/statorials.org\/id\/panda-membaca-html\/","title":{"rendered":"Cara membaca tabel html dengan pandas (termasuk contoh)"},"content":{"rendered":"<p><\/p>\n<hr>\n<p><span style=\"color: #000000;\">Anda dapat menggunakan fungsi pandas <a href=\"https:\/\/pandas.pydata.org\/docs\/reference\/api\/pandas.read_html.html\" target=\"_blank\" rel=\"noopener\">read_html()<\/a> untuk membaca tabel HTML menjadi DataFrame pandas.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Fungsi ini menggunakan sintaks dasar berikut:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong>df = pd. <span style=\"color: #3366ff;\">read_html<\/span> (' <span style=\"color: #ff0000;\">https:\/\/en.wikipedia.org\/wiki\/National_Basketball_Association<\/span> ')\n<\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Contoh berikut menunjukkan cara menggunakan fungsi ini untuk membaca tabel nama tim NBA dari <a href=\"https:\/\/en.wikipedia.org\/wiki\/National_Basketball_Association\" target=\"_blank\" rel=\"noopener\">halaman Wikipedia ini<\/a> .<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Contoh: Membaca tabel HTML dengan Pandas<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Sebelum menggunakan fungsi <strong>read_html()<\/strong> , Anda mungkin perlu menginstal lxml:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <span style=\"color: #000000;\"><strong>pip <span style=\"color: #008000;\">install<\/span> lxml<\/strong><\/span><\/pre>\n<p> <span style=\"color: #000000;\"><strong>Catatan<\/strong> : Jika Anda menggunakan notebook Jupyter, Anda harus me-restart kernel setelah melakukan instalasi ini.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Selanjutnya, kita dapat menggunakan fungsi <strong>read_html()<\/strong> untuk membaca setiap tabel HTML di <a href=\"https:\/\/en.wikipedia.org\/wiki\/National_Basketball_Association\" target=\"_blank\" rel=\"noopener\">halaman Wikipedia ini<\/a> :<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008000;\">import<\/span> pandas <span style=\"color: #008000;\">as<\/span> pd\n<span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008000;\">import<\/span> numpy <span style=\"color: #008000;\">as<\/span> np\n<span style=\"color: #008000;\">import<\/span> matplotlib. <span style=\"color: #3366ff;\">pyplot<\/span> <span style=\"color: #008000;\">as<\/span> plt\n<span style=\"color: #008000;\">from<\/span> unicodedata <span style=\"color: #008000;\">import<\/span> normalize\n\n<span style=\"color: #008080;\">#read all HTML tables from specific URL\n<\/span>tabs = pd. <span style=\"color: #3366ff;\">read_html<\/span> (' <span style=\"color: #ff0000;\">https:\/\/en.wikipedia.org\/wiki\/National_Basketball_Association<\/span> ')\n\n<span style=\"color: #008080;\">#display total number of tables read\n<\/span><span style=\"color: #008000;\">len<\/span> (tabs)\n\n44<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Kita dapat melihat total 44 tabel HTML ditemukan di halaman ini.<\/span><\/p>\n<p> <span style=\"color: #000000;\">Saya tahu bahwa tabel yang saya minati berisi kata &#8220;Divisi&#8221;, jadi saya bisa menggunakan argumen <strong>kecocokan<\/strong> untuk hanya mengambil tabel HTML yang berisi kata ini:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\"><span style=\"color: #000000;\"><span style=\"color: #008080;\">#read HTML tables from specific URL with the word \"Division\" in them\n<\/span>tabs = pd. <span style=\"color: #3366ff;\">read_html<\/span> (' <span style=\"color: #ff0000;\">https:\/\/en.wikipedia.org\/wiki\/National_Basketball_Association<\/span> ',\n                    match=' <span style=\"color: #ff0000;\">Division<\/span> ')\n\n<span style=\"color: #008080;\">#display total number of tables read\n<\/span><span style=\"color: #008000;\">len<\/span> (tabs)\n\n1<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Saya kemudian dapat <a href=\"https:\/\/statorials.org\/id\/nama-kolom-daftar-panda\/\" target=\"_blank\" rel=\"noopener\">membuat daftar nama<\/a> kolom tabel:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <strong><span style=\"color: #008080;\">#define table\n<span style=\"color: #000000;\">df = tabs[0]<\/span>\n\n#list all column names of table\n<span style=\"color: #000000;\"><span style=\"color: #008000;\">list<\/span> (df)\n\n[('Division', 'Eastern Conference'),\n ('Team', 'Eastern Conference'),\n ('Location', 'Eastern Conference'),\n ('Arena', 'Eastern Conference'),\n ('Capacity', 'Eastern Conference'),\n ('Coordinates', 'Eastern Conference'),\n ('Founded', 'Eastern Conference'),\n ('Joined', 'Eastern Conference'),\n ('Unnamed: 8_level_0', 'Eastern Conference')]<\/span><\/span><\/strong><\/pre>\n<p> <span style=\"color: #000000;\">Saya hanya tertarik pada dua kolom pertama, jadi saya bisa <a href=\"https:\/\/statorials.org\/id\/panda-loc-vs-iloc\/\" target=\"_blank\" rel=\"noopener\">memfilter<\/a> DataFrame agar hanya berisi kolom berikut:<\/span><\/p>\n<pre style=\"background-color: #ececec; font-size: 15px;\"> <span style=\"color: #000000;\"><strong><span style=\"color: #008080;\">#filter DataFrame to only contain first two columns\n<\/span>df_final = df. <span style=\"color: #3366ff;\">iloc<\/span> [:, 0:2]\n\n<span style=\"color: #008080;\">#rename columns\n<\/span>df_final. <span style=\"color: #3366ff;\">columns<\/span> = [' <span style=\"color: #ff0000;\">Division<\/span> ', ' <span style=\"color: #ff0000;\">Team<\/span> ']\n\n<span style=\"color: #008080;\">#view first few rows of final DataFrame\n<\/span><span style=\"color: #008000;\">print<\/span> ( <span style=\"color: #3366ff;\">df_final.head<\/span> ())\n\n   Division Team\n0 Atlantic Boston Celtics\n1 Atlantic Brooklyn Nets\n2 Atlantic New York Knicks\n3 Atlantic Philadelphia 76ers\n4 Atlantic Toronto Raptors\n<\/strong><\/span><\/pre>\n<p> <span style=\"color: #000000;\">Tabel terakhir hanya berisi kolom \u201cDivisi\u201d dan \u201cTim\u201d.<\/span><\/p>\n<h3> <span style=\"color: #000000;\"><strong>Sumber daya tambahan<\/strong><\/span><\/h3>\n<p> <span style=\"color: #000000;\">Tutorial berikut menjelaskan cara membaca jenis file lain di pandas:<\/span><\/p>\n<p> <a href=\"https:\/\/statorials.org\/id\/panda-membaca-file-teks\/\" target=\"_blank\" rel=\"noopener\">Cara membaca file teks dengan Pandas<\/a><br \/> <a href=\"https:\/\/statorials.org\/id\/panda-membaca-excel\/\" target=\"_blank\" rel=\"noopener\">Cara membaca file Excel dengan Pandas<\/a><br \/> <a href=\"https:\/\/statorials.org\/id\/panda-membaca-csv\/\" target=\"_blank\" rel=\"noopener\">Cara membaca file CSV dengan Pandas<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Anda dapat menggunakan fungsi pandas read_html() untuk membaca tabel HTML menjadi DataFrame pandas. Fungsi ini menggunakan sintaks dasar berikut: df = pd. read_html (&#8216; https:\/\/en.wikipedia.org\/wiki\/National_Basketball_Association &#8216;) Contoh berikut menunjukkan cara menggunakan fungsi ini untuk membaca tabel nama tim NBA dari halaman Wikipedia ini . Contoh: Membaca tabel HTML dengan Pandas Sebelum menggunakan fungsi read_html() , [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Cara membaca tabel HTML dengan Pandas (dengan contoh) - Statorials<\/title>\n<meta name=\"description\" content=\"Tutorial ini menjelaskan cara membaca tabel HTML dengan panda, beserta sebuah contoh.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/statorials.org\/id\/panda-membaca-html\/\" \/>\n<meta property=\"og:locale\" content=\"id_ID\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cara membaca tabel HTML dengan Pandas (dengan contoh) - Statorials\" \/>\n<meta property=\"og:description\" content=\"Tutorial ini menjelaskan cara membaca tabel HTML dengan panda, beserta sebuah contoh.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/statorials.org\/id\/panda-membaca-html\/\" \/>\n<meta property=\"og:site_name\" content=\"Statorials\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-21T21:45:15+00:00\" \/>\n<meta name=\"author\" content=\"Benjamin anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Ditulis oleh\" \/>\n\t<meta name=\"twitter:data1\" content=\"Benjamin anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimasi waktu membaca\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 menit\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/statorials.org\/id\/panda-membaca-html\/\",\"url\":\"https:\/\/statorials.org\/id\/panda-membaca-html\/\",\"name\":\"Cara membaca tabel HTML dengan Pandas (dengan contoh) - Statorials\",\"isPartOf\":{\"@id\":\"https:\/\/statorials.org\/id\/#website\"},\"datePublished\":\"2023-07-21T21:45:15+00:00\",\"dateModified\":\"2023-07-21T21:45:15+00:00\",\"author\":{\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\"},\"description\":\"Tutorial ini menjelaskan cara membaca tabel HTML dengan panda, beserta sebuah contoh.\",\"breadcrumb\":{\"@id\":\"https:\/\/statorials.org\/id\/panda-membaca-html\/#breadcrumb\"},\"inLanguage\":\"id\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/statorials.org\/id\/panda-membaca-html\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/statorials.org\/id\/panda-membaca-html\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/statorials.org\/id\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Cara membaca tabel html dengan pandas (termasuk contoh)\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/statorials.org\/id\/#website\",\"url\":\"https:\/\/statorials.org\/id\/\",\"name\":\"Statorials\",\"description\":\"Panduan anda untuk kompetensi statistik!\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/statorials.org\/id\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"id\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81\",\"name\":\"Benjamin anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"id\",\"@id\":\"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"contentUrl\":\"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg\",\"caption\":\"Benjamin anderson\"},\"description\":\"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya\",\"sameAs\":[\"http:\/\/statorials.org\/id\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Cara membaca tabel HTML dengan Pandas (dengan contoh) - Statorials","description":"Tutorial ini menjelaskan cara membaca tabel HTML dengan panda, beserta sebuah contoh.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/statorials.org\/id\/panda-membaca-html\/","og_locale":"id_ID","og_type":"article","og_title":"Cara membaca tabel HTML dengan Pandas (dengan contoh) - Statorials","og_description":"Tutorial ini menjelaskan cara membaca tabel HTML dengan panda, beserta sebuah contoh.","og_url":"https:\/\/statorials.org\/id\/panda-membaca-html\/","og_site_name":"Statorials","article_published_time":"2023-07-21T21:45:15+00:00","author":"Benjamin anderson","twitter_card":"summary_large_image","twitter_misc":{"Ditulis oleh":"Benjamin anderson","Estimasi waktu membaca":"2 menit"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/statorials.org\/id\/panda-membaca-html\/","url":"https:\/\/statorials.org\/id\/panda-membaca-html\/","name":"Cara membaca tabel HTML dengan Pandas (dengan contoh) - Statorials","isPartOf":{"@id":"https:\/\/statorials.org\/id\/#website"},"datePublished":"2023-07-21T21:45:15+00:00","dateModified":"2023-07-21T21:45:15+00:00","author":{"@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81"},"description":"Tutorial ini menjelaskan cara membaca tabel HTML dengan panda, beserta sebuah contoh.","breadcrumb":{"@id":"https:\/\/statorials.org\/id\/panda-membaca-html\/#breadcrumb"},"inLanguage":"id","potentialAction":[{"@type":"ReadAction","target":["https:\/\/statorials.org\/id\/panda-membaca-html\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/statorials.org\/id\/panda-membaca-html\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/statorials.org\/id\/"},{"@type":"ListItem","position":2,"name":"Cara membaca tabel html dengan pandas (termasuk contoh)"}]},{"@type":"WebSite","@id":"https:\/\/statorials.org\/id\/#website","url":"https:\/\/statorials.org\/id\/","name":"Statorials","description":"Panduan anda untuk kompetensi statistik!","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/statorials.org\/id\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"id"},{"@type":"Person","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/3d17a1160dd2d052b7c78e502cb9ec81","name":"Benjamin anderson","image":{"@type":"ImageObject","inLanguage":"id","@id":"https:\/\/statorials.org\/id\/#\/schema\/person\/image\/","url":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","contentUrl":"http:\/\/statorials.org\/id\/wp-content\/uploads\/2023\/10\/Dr.-Benjamin-Anderson-96x96.jpg","caption":"Benjamin anderson"},"description":"Halo, saya Benjamin, pensiunan profesor statistika yang menjadi guru Statorial yang berdedikasi. Dengan pengalaman dan keahlian yang luas di bidang statistika, saya ingin berbagi ilmu untuk memberdayakan mahasiswa melalui Statorials. Baca selengkapnya","sameAs":["http:\/\/statorials.org\/id"]}]}},"yoast_meta":{"yoast_wpseo_title":"","yoast_wpseo_metadesc":"","yoast_wpseo_canonical":""},"_links":{"self":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/2525"}],"collection":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/comments?post=2525"}],"version-history":[{"count":0,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/posts\/2525\/revisions"}],"wp:attachment":[{"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/media?parent=2525"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/categories?post=2525"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statorials.org\/id\/wp-json\/wp\/v2\/tags?post=2525"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}