{"id":11338,"date":"2024-02-14T09:17:09","date_gmt":"2024-02-14T00:17:09","guid":{"rendered":"https:\/\/www.ecomottblog.com\/?p=11338"},"modified":"2024-02-14T10:19:23","modified_gmt":"2024-02-14T01:19:23","slug":"glue-etl-jobs-%e3%81%a7csv%e3%82%92%e5%89%8d%e5%87%a6%e7%90%86%e3%81%97%e3%81%a6parquet%e3%81%ab%e5%a4%89%e6%8f%9b%e3%81%99%e3%82%8b","status":"publish","type":"post","link":"https:\/\/www.ecomottblog.com\/?p=11338","title":{"rendered":"Glue ETL jobs \u3067CSV\u3092\u524d\u51e6\u7406\u3057\u3066Parquet\u306b\u5909\u63db\u3059\u308b"},"content":{"rendered":"<p>\u3053\u3093\u306b\u3061\u306f\u3001\u57fa\u76e4\u6574\u5099\u30b0\u30eb\u30fc\u30d7\u306e\u4e0a\u897f\u3067\u3059\u3002<\/p>\n<p>AWS\u306eGlue Crawlers\u3067\u30af\u30ed\u30fc\u30e9\u30fc\u3092\u5b9f\u884c\u3059\u308b\u969b\u306bCSV\u30d5\u30a1\u30a4\u30eb\u306e\u307e\u307e\u3060\u3068\u3001date\u578b\u306b\u3057\u305f\u3044\u30c7\u30fc\u30bf\u304cstring\u30c7\u30fc\u30bf\u306e\u307e\u307e\u3067\u3042\u3063\u305f\u308a\u3001\u30c0\u30d6\u30eb\u30af\u30a9\u30fc\u30c6\u30fc\u30b7\u30e7\u30f3\u3067\u56f2\u3063\u305f\u6587\u5b57\u5217\u304c\u30b9\u30da\u30fc\u30b9\u3067\u5207\u308c\u3066\u96a3\u306e\u30ab\u30e9\u30e0\u306e\u30c7\u30fc\u30bf\u3092\u4e0a\u66f8\u304d\u3057\u3066\u3057\u307e\u3046\u306a\u3069\u306e\u4e0d\u90fd\u5408\u304c\u751f\u3058\u308b\u3053\u3068\u304c\u3042\u308b\u305f\u3081\u3001CSV\u30d5\u30a1\u30a4\u30eb\u3092Parquet\u30d5\u30a1\u30a4\u30eb\u3068\u3057\u3066\u51fa\u529b\u3057\u305f\u3044\u3053\u3068\u304c\u3042\u308a\u307e\u3057\u305f\u3002<\/p>\n<p>Parquet\u30d5\u30a1\u30a4\u30eb\u306fCSV\u30d5\u30a1\u30a4\u30eb\u306b\u6bd4\u3079\u3066\u5bb9\u91cf\u304c\u5c0f\u3055\u304f\u30ab\u30e9\u30e0\u6bce\u306b\u578b\u3092\u5b9a\u7fa9\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u308b\u306e\u3067\u3001Glue\u30c7\u30fc\u30bf\u30ab\u30bf\u30ed\u30b0\u3067\u30c7\u30fc\u30bf\u30d9\u30fc\u30b9\u3068\u3057\u3066\u6271\u3044\u3084\u3059\u3044\u7279\u5fb4\u304c\u3042\u308a\u307e\u3059\u3002<\/p>\n<p>\u30af\u30ed\u30fc\u30e9\u30fc\u5b9f\u884c\u5f8c\u306bGlue\u306e\u30c7\u30fc\u30bf\u30ab\u30bf\u30ed\u30b0\u304b\u3089\u30b9\u30ad\u30fc\u30de\u306e\u7de8\u96c6\u3067date\u578b\u306b\u30ad\u30e3\u30b9\u30c8\u3059\u308b\u3068\u3044\u3046\u65b9\u6cd5\u3082\u3042\u308a\u307e\u3059\u304c\u3001\u5f8c\u8ff0\u3059\u308b\u4eca\u56de\u306e\u8981\u4ef6\u306b\u9069\u5408\u3057\u306a\u3044\u305f\u3081Glue ETL jobs\u3092\u4f7f\u7528\u3059\u308b\u65b9\u6cd5\u306b\u3057\u307e\u3057\u305f\u3002<\/p>\n<p><!--more--><\/p>\n<h2>\u554f\u984c\u306e\u78ba\u8a8d<\/h2>\n<p>S3\u306b\u4fdd\u5b58\u3057\u3066\u3044\u308bCSV\u306e\u30d5\u30a1\u30a4\u30eb\u3060\u304c\u3053\u3093\u306a&#8221;MM\/dd\/yyyy&#8221;\u306e\u5f62\u5f0f\u306e\u305f\u3081\u3001\u3053\u306e\u307e\u307e\u30af\u30ed\u30fc\u30e9\u30fc\u3092\u5b9f\u884c\u3059\u308b\u3068String\u578b\u306e\u307e\u307e\u3067\u3042\u308a\u3001\u30c7\u30fc\u30bf\u30ab\u30bf\u30ed\u30b0\u304b\u3089\u30b9\u30ad\u30fc\u30de\u3092\u7de8\u96c6\u3057\u3066date\u578b\u306b\u3057\u3066\u3082\u30ad\u30e3\u30b9\u30c8\u3067\u304d\u305a\u306b\u30c7\u30fc\u30bf\u304cnull\u306b\u306a\u3063\u3066\u3057\u307e\u3046\u3002<\/p>\n<table width=\"133\">\n<tbody>\n<tr>\n<td width=\"133\">servicePeriodStartDate<\/td>\n<\/tr>\n<tr>\n<td>11\/21\/2023<\/td>\n<\/tr>\n<tr>\n<td>11\/21\/2023<\/td>\n<\/tr>\n<tr>\n<td>11\/21\/2023<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Glue ETL jobs \u306e\u4f5c\u6210<\/h2>\n<p>\u4e0a\u8a18\u306e\u554f\u984c\u306e\u89e3\u6c7a\u306e\u305f\u3081\u3001Glue ETL jobs\u3092\u4f7f\u7528\u3057\u3066\u30c7\u30fc\u30bf\u306b\u5bfe\u3057\u3066\u524d\u51e6\u7406\u3092\u884c\u3044date\u578b\u306b\u30ad\u30e3\u30b9\u30c8\u3057\u307e\u3059\u3002<\/p>\n<p>ETL jobs\u306evisual ETL\u3092\u4f7f\u7528\u3059\u308b\u3068\u3001\u8996\u899a\u7684\u306bETL\u3092\u7d44\u307f\u7acb\u3066\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3059\u3002\u4eca\u56de\u306fSources\u306bCSV\u30c7\u30fc\u30bf\u304c\u4fdd\u5b58\u3055\u308c\u3066\u3044\u308bS3\u30d0\u30b1\u30c3\u30c8\u3092\u9078\u629e\u3057\u3001Transforms\u306b\u306fChange Schema\u3092\u9078\u629e\u3057\u3066\u3001Targets\u306b\u306fS3\u3092\u9078\u629e\u3057\u307e\u3057\u305f\u3002\u3053\u308c\u306b\u3088\u308aS3\u304b\u3089CSV\u3092\u53d6\u308a\u51fa\u3057\u3066\u3066\u30b9\u30ad\u30fc\u30de\u306e\u7de8\u96c6\u3092\u3057\u3066\u3001CSV\u304b\u3089\u5909\u63db\u3057\u305fParquet\u30d5\u30a1\u30a4\u30eb\u3092S3\u306b\u4fdd\u5b58\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3059\u3002<\/p>\n<p>\u203bGlue\u304b\u3089S3\u3092\u53c2\u7167\u3055\u305b\u308b\u305f\u3081\u306b\u4ee5\u4e0b\u306e\u69d8\u306a\u30dd\u30ea\u30b7\u30fc\u304c\u4ed8\u4e0e\u3055\u308c\u305fIAM\u30ed\u30fc\u30eb\u3092\u7528\u610f\u3059\u308b\u5fc5\u8981\u304c\u3042\u308a\u307e\u3059\u3002<\/p>\n<pre class=\"lang:default decode:true\">{\n    \"Version\": \"2012-10-17\",\n    \"Statement\": [\n        {\n            \"Sid\": \"VisualEditor0\",\n            \"Effect\": \"Allow\",\n            \"Action\": [\n                \"s3:PutObject\",\n                \"s3:GetObject\"\n            ],\n            \"Resource\": \"\u5bfe\u8c61\u306eS3\u30d0\u30b1\u30c3\u30c8\"\n        }\n    ]\n}<\/pre>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-medium wp-image-11345\" src=\"https:\/\/www.ecomottblog.com\/wp-content\/uploads\/2023\/12\/\u30b9\u30af\u30ea\u30fc\u30f3\u30b7\u30e7\u30c3\u30c8-2023-12-12-154528-300x246.png\" alt=\"\" width=\"300\" height=\"246\" srcset=\"https:\/\/www.ecomottblog.com\/wp-content\/uploads\/2023\/12\/\u30b9\u30af\u30ea\u30fc\u30f3\u30b7\u30e7\u30c3\u30c8-2023-12-12-154528-300x246.png 300w, https:\/\/www.ecomottblog.com\/wp-content\/uploads\/2023\/12\/\u30b9\u30af\u30ea\u30fc\u30f3\u30b7\u30e7\u30c3\u30c8-2023-12-12-154528-768x630.png 768w, https:\/\/www.ecomottblog.com\/wp-content\/uploads\/2023\/12\/\u30b9\u30af\u30ea\u30fc\u30f3\u30b7\u30e7\u30c3\u30c8-2023-12-12-154528-261x214.png 261w, https:\/\/www.ecomottblog.com\/wp-content\/uploads\/2023\/12\/\u30b9\u30af\u30ea\u30fc\u30f3\u30b7\u30e7\u30c3\u30c8-2023-12-12-154528-282x231.png 282w, https:\/\/www.ecomottblog.com\/wp-content\/uploads\/2023\/12\/\u30b9\u30af\u30ea\u30fc\u30f3\u30b7\u30e7\u30c3\u30c8-2023-12-12-154528.png 966w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>ETL jobs\u306e\u8a73\u7d30\u8a2d\u5b9a\u306f\u7701\u7565\u3057\u307e\u3059\u304cvisual\u3092\u7d44\u307f\u7acb\u3066\u308b\u3068\u4ee5\u4e0b\u306e\u69d8\u306ascript\u304c\u751f\u6210\u3055\u308c\u307e\u3059\u3002<\/p>\n<pre class=\"lang:default decode:true\">import sys\nfrom awsglue.transforms import *\nfrom awsglue.utils import getResolvedOptions\nfrom pyspark.context import SparkContext\nfrom awsglue.context import GlueContext\nfrom awsglue.job import Job\n\nargs = getResolvedOptions(sys.argv, [\"JOB_NAME\"])\nsc = SparkContext()\nglueContext = GlueContext(sc)\nspark = glueContext.spark_session\njob = Job(glueContext)\njob.init(args[\"JOB_NAME\"], args)\n\n# Script generated for node Amazon S3\nAmazonS3_node123456789 = glueContext.create_dynamic_frame.from_options(\n    format_options={\"quoteChar\": '\"', \"withHeader\": True, \"separator\": \",\"},\n    connection_type=\"s3\",\n    format=\"csv\",\n    connection_options={\n        \"paths\": [\"s3:\/\/hogehoge\/hogehoge\/\"],\n        \"recurse\": True,\n    },\n    transformation_ctx=\"AmazonS3_node123456789\",\n)\n\n# Script generated for node Change Schema\nChangeSchema_node987654321 = ApplyMapping.apply(\n    frame=AmazonS3_node123456789,\n    mappings=[\n        (\"servicePeriodStartDate\", \"string\", \"date\", \"string\"),\n        #~\u7565~\n    ],\n    transformation_ctx=\"ChangeSchema_node987654321\",\n)\n\n# Script generated for node Amazon S3\nAmazonS3_node234567 = glueContext.write_dynamic_frame.from_options(\n    frame=ChangeSchema_node987654321,\n    connection_type=\"s3\",\n    format=\"glueparquet\",\n    connection_options={\n        \"path\": \"s3:\/\/hogehoge\/hogehoge\/\",\n        \"partitionKeys\": [],\n    },\n    format_options={\"compression\": \"snappy\"},\n    transformation_ctx=\"AmazonS3_node234567\",\n)\n\njob.commit()<\/pre>\n<p>\u3053\u306e\u307e\u307e\u5b9f\u884c\u3057\u3066\u3082&#8221;MM\/dd\/yyyy&#8221;\u306e\u65e5\u4ed8\u30c7\u30fc\u30bf\u306fdate\u578b\u306b\u30ad\u30e3\u30b9\u30c8\u3055\u308c\u306a\u304f\u3001\u305f\u3060\u306eParquet\u30d5\u30a1\u30a4\u30eb\u304c\u6307\u5b9a\u3057\u305fS3\u30d0\u30b1\u30c3\u30c8\u306b\u4fdd\u5b58\u3055\u308c\u308b\u3060\u3051\u306a\u306e\u3067\u3001\u751f\u6210\u3055\u308c\u305fscript\u306b\u5bfe\u3057\u3066\u7de8\u96c6\u304c\u5fc5\u8981\u3067\u3059\u3002<\/p>\n<h2>script\u306e\u7de8\u96c6<\/h2>\n<p>\u4eca\u56de\u306e\u8981\u4ef6\u3068\u3057\u3066servicePeriodStartDate\u30ab\u30e9\u30e0\u306e\u30c7\u30fc\u30bf\u3092string\u578b\u304b\u3089date\u578b\u306b\u5909\u3048\u305f\u3044\u304c\u5bfe\u5fdc\u3059\u308b\u65e5\u4ed8\u306e\u5f62\u5f0f\u3067\u306f\u306a\u3044\u305f\u3081\u3001\u30b9\u30ad\u30fc\u30de\u306e\u7de8\u96c6\u524d\u306b&#8221;MM\/dd\/yyyy&#8221;\u304b\u3089&#8221;yyyy\/MM\/dd&#8221;\u306b\u5909\u3048\u3066date\u578b\u3078\u30ad\u30e3\u30b9\u30c8\u3057\u307e\u3059\u3002<\/p>\n<p>\u7de8\u96c6\u7b87\u6240\u306f\u4ee5\u4e0b\u306e\u901a\u308a\u3067\u3059\u3002<\/p>\n<pre class=\"lang:default decode:true\"># \u5fc5\u8981\u306a\u30e2\u30b8\u30e5\u30fc\u30eb\u3068\u95a2\u6570\u3092\u30a4\u30f3\u30dd\u30fc\u30c8\nfrom awsglue.dynamicframe import DynamicFrame\nfrom pyspark.sql.functions import to_date, col\n\n# Script generated for node Amazon S3\nAmazonS3_node123456789 = glueContext.create_dynamic_frame.from_options(\n# \uff5e\u7565\uff5e\n\n# \"\u5217\u306e\u65e5\u4ed8\u5f62\u5f0f\u3092\"MM\/dd\/yyyy\"\u304b\u3089\"yyyy\/MM\/dd\"\u306b\u5909\u63db\ndf = AmazonS3_node1234567.toDF()\ndf = df.withColumn(\"serviceperiodstartdate\", to_date(col(\"serviceperiodstartdate\"), \"MM\/dd\/yyyy\"))\n\n# \u5909\u63db\u3057\u305fDataFrame\u3092DynamicFrame\u306b\u623b\u3057\u307e\u3059\nAmazonS3_node1234567 = DynamicFrame.fromDF(df, glueContext, \"AmazonS3_node1234567\")\n\n# Script generated for node Change Schema\n# \uff5e\u7565\uff5e<\/pre>\n<h3>\u89e3\u8aac<\/h3>\n<p>Glue\u306eDataFrame\u3068DynamicFrame\u306b\u3064\u3044\u3066<\/p>\n<p>DataFrame<\/p>\n<blockquote><p>DataFrame \u306f\u30c6\u30fc\u30d6\u30eb\u3068\u4f3c\u3066\u304a\u308a\u3001\u6a5f\u80fd\u30b9\u30bf\u30a4\u30eb (\u30de\u30c3\u30d7\/\u30ea\u30c7\u30e5\u30fc\u30b9\/\u30d5\u30a3\u30eb\u30bf\u30fc\/\u305d\u306e\u4ed6) \u64cd\u4f5c\u3068 SQL \u64cd\u4f5c (\u9078\u629e\u3001\u30d7\u30ed\u30b8\u30a7\u30af\u30c8\u3001\u96c6\u8a08) \u3092\u30b5\u30dd\u30fc\u30c8\u3057\u3066\u3044\u307e\u3059\u3002<\/p><\/blockquote>\n<p>AWS\u30c9\u30ad\u30e5\u30e1\u30f3\u30c8DynamicFrame\u30af\u30e9\u30b9\u306e\u8aac\u660e\u3088\u308a\u5f15\u7528\u3002<\/p>\n<p>&nbsp;<\/p>\n<p>DynamicFrame<\/p>\n<blockquote><p>DynamicFrame \u306f\u3001DataFrame \u3068\u4f3c\u3066\u3044\u307e\u3059\u304c\u3001\u5404\u30ec\u30b3\u30fc\u30c9\u304c\u81ea\u5df1\u8a18\u8ff0\u3067\u304d\u308b\u305f\u3081\u3001\u6700\u521d\u306f\u30b9\u30ad\u30fc\u30de\u306f\u5fc5\u8981\u3042\u308a\u307e\u305b\u3093\u3002\u4ee3\u308f\u308a\u306b\u3001AWS Glue \u306f\u5fc5\u8981\u306b\u5fdc\u3058\u3066\u30aa\u30f3\u30b6\u30d5\u30e9\u30a4\u3067\u30b9\u30ad\u30fc\u30de\u3092\u8a08\u7b97\u3057\u3001\u9078\u629e (\u307e\u305f\u306f\u5171\u7528) \u30bf\u30a4\u30d7\u3092\u4f7f\u7528\u3057\u3066\u30b9\u30ad\u30fc\u30de\u306e\u4e0d\u4e00\u81f4\u3092\u660e\u793a\u7684\u306b\u30a8\u30f3\u30b3\u30fc\u30c9\u3057\u307e\u3059\u3002\u3053\u308c\u3089\u306e\u4e0d\u6574\u5408\u3092\u89e3\u6c7a\u3057\u3066\u3001\u56fa\u5b9a\u30b9\u30ad\u30fc\u30de\u3092\u5fc5\u8981\u3068\u3059\u308b\u30c7\u30fc\u30bf\u30b9\u30c8\u30a2\u3068\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u3092\u4e92\u63db\u6027\u306e\u3042\u308b\u3082\u306e\u306b\u3067\u304d\u307e\u3059\u3002<\/p><\/blockquote>\n<p>AWS\u30c9\u30ad\u30e5\u30e1\u30f3\u30c8DynamicFrame\u30af\u30e9\u30b9\u306e\u8aac\u660e\u3088\u308a\u5f15\u7528\u3002<\/p>\n<p>\u8981\u7d04\u3059\u308b\u3068\u30c7\u30fc\u30bf\u306e\u62bd\u51fa\u3001\u5909\u63db\u3001\u30ed\u30fc\u30c9\uff08ETL\uff09\u64cd\u4f5c\u306b\u304a\u3044\u3066\u3001\u81ea\u5df1\u8a18\u8ff0\u578b\uff08\u30c7\u30fc\u30bf\u69cb\u9020\u3084\u30aa\u30d6\u30b8\u30a7\u30af\u30c8\u304c\u81ea\u8eab\u306e\u69cb\u9020\u3084\u7279\u6027\u3092\u8a18\u8ff0\u3067\u304d\u308b\uff09\u306e\u7279\u6027\u3092\u6301\u3064\u3053\u3068\u3067\u67d4\u8edf\u6027\u304c\u3042\u308a\u3001\u30b9\u30ad\u30fc\u30de\u306e\u5236\u7d04\u3092\u7de9\u548c\u3059\u308b\u305f\u3081\u306e\u4fbf\u5229\u306a\u30c4\u30fc\u30eb\u3067\u3059\u3002<\/p>\n<p>DynamicFrame\u306fETL\u51e6\u7406\u306b\u5f37\u304f\u3001DataFrame\u306f\u30c6\u30fc\u30d6\u30eb\u51e6\u7406\u306b\u5f37\u3044\u3002\u3053\u308c\u3089\u306e\u7279\u6027\u3092\u751f\u304b\u3057\u3066\u3001\u30c7\u30fc\u30bf\u2f0a\u51fa\u2f12\u3068\u305d\u308c\u306b\u4f34\u3046ETL\u51e6\u7406\u306fDynamicFrame\u3067\u2f8f\u3044\u3001\u30c6\u30fc\u30d6\u30eb\u64cd\u4f5c\u306fDataFrame\u3067\u2f8f\u3046\u3002DynamicFrame\u3092DataFrame\u306b\u5909\u63db\u3059\u308b\u306e\u304ctoDF\u95a2\u6570\u3067\u3001DataFrame\u3092DynamicFrame\u306b\u5909\u63db\u3059\u308b\u306e\u304cfromDF\u3067\u3059\u3002<\/p>\n<p>\u4eca\u56de\u306e\u4f8b\u3060\u3068DynamicFrame\u3067S3\u304b\u3089\u30c7\u30fc\u30bf\u3092\u53d6\u5f97\uff08ETL\u51e6\u7406\uff09\u3057\u3001toDF\u95a2\u6570\u3067DataFrame\u306b\u5909\u63db\u3057\u305f\u5f8cSQL\u64cd\u4f5c\u306eto_date\u3067\u65e5\u4ed8\u306e\u578b\u3092\u5909\u63db\u3057\u3066\u3001DynamicFrame\u306b\u623b\u3057\u3066\u3044\u307e\u3059\u3002<\/p>\n<p>\u3053\u308c\u3089\u306e\u51e6\u7406\u3092\u7d4c\u3066CSV\u304b\u3089Parquet\u30d5\u30a1\u30a4\u30eb\u306b\u5909\u63db\u3059\u308b\u3053\u3068\u3067\u3001Glue\u306e\u30af\u30ed\u30fc\u30e9\u3067\u30af\u30ed\u30fc\u30ea\u30f3\u30b0\u3057\u305f\u969b\u306b\u6b63\u3057\u3044\u8868\u8a18\u306edate\u578b\u306b\u5909\u63db\u3055\u308c\u305f\u30c7\u30fc\u30bf\u30ab\u30bf\u30ed\u30b0\u304c\u751f\u6210\u3067\u304d\u307e\u3059\u3002<\/p>\n<h2>\u307e\u3068\u3081<\/h2>\n<p>date\u578b\u306b\u30ad\u30e3\u30b9\u30c8\u3059\u308b\u969b\u306b\u65e5\u4ed8\u306e\u8868\u8a18\u304c\u72ec\u81ea\u306e\u5f62\u5f0f\u3067\u306a\u3051\u308c\u3070\u901a\u5e38\u306fGlue\u306eCSV\u30d5\u30a1\u30a4\u30eb\u306e\u30af\u30ed\u30fc\u30ea\u30f3\u30b0\u3060\u3051\u3067\u6e08\u307f\u307e\u3059\u304c\u3001\u4eca\u56de\u306e\u69d8\u306a\u30b1\u30fc\u30b9\u3067\u3059\u3068Glue\u306eETL jobs\u304c\u6709\u52b9\u3060\u3068\u611f\u3058\u307e\u3057\u305f\u3002\u4e88\u5099\u77e5\u8b58\u3068\u3057\u3066Python\u306ePandas\u3084Apache Spark\u3067\u30c7\u30fc\u30bf\u51e6\u7406\u3092\u3057\u305f\u7d4c\u9a13\u304c\u3042\u308b\u65b9\u3067\u306a\u3044\u3068\u5b66\u7fd2\u30b3\u30b9\u30c8\u304c\u304b\u304b\u308b\u304b\u3082\u3068\u3044\u3063\u305f\u61f8\u5ff5\u3082\u611f\u3058\u307e\u3057\u305f\u3002\u4eca\u56de\u3001\u5b9f\u969b\u306bGlue ETL jobs\u3092\u4f7f\u7528\u3059\u308b\u306b\u3042\u305f\u308a\u3001\u53c2\u8003\u6587\u732e\u306b\u3082\u6319\u3052\u3066\u3044\u308b\u201dAWS Glue ETL\u30d1\u30d5\u30a9\u30fc\u30de\u30f3\u30b9\u30fb\u30c1\u30e5\u30fc\u30cb\u30f3\u30b0\u2460\u57fa\u790e\u77e5\u8b58\u7de8\u201d\u306e\u8cc7\u6599\u304c\u3068\u3066\u3082\u53c2\u8003\u306b\u306a\u308a\u307e\u3057\u305f\u3002\u30d6\u30e9\u30c3\u30af\u30d9\u30eb\u30c8\u306e\u30bb\u30df\u30ca\u30fc\u304cYouTube\u306b\u3042\u3063\u305f\u306e\u3067\u53c2\u8003\u306b\u5171\u6709\u3057\u3066\u304a\u304d\u307e\u3059\u3002<\/p>\n<p><iframe loading=\"lazy\" title=\"[AWS Black Belt Online Seminar]\u732b\u3067\u3082\u308f\u304b\u308b\u3001AWS Glue ETL\u30d1\u30d5\u30a9\u30fc\u30de\u30f3\u30b9\u30fb\u30c1\u30e5\u30fc\u30cb\u30f3\u30b0 \u524d\u7de8\ud83d\ude3a\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/k-qTKz0xG-0?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<h2>\u53c2\u8003\u6587\u732e<\/h2>\n<p><a href=\"https:\/\/dev.classmethod.jp\/articles\/glue-job-via-datacatalog-cast-column\/\">DevelopersIO Glue\u30c7\u30fc\u30bf\u30ab\u30bf\u30ed\u30b0\u7d4c\u7531\u3067CSV\u3092\u8aad\u307f\u8fbc\u3093\u3067\u65e5\u4ed8\u578b\u306b\u30ad\u30e3\u30b9\u30c8\u3059\u308b\u65b9\u6cd5<\/a><\/p>\n<p><a href=\"https:\/\/qiita.com\/motokazu_ishikawa\/items\/ad870f8d72a7364143b4\">Qiita&nbsp;\u9006\u5f15\u304dPySpark (2.\u65e5\u4ed8\u6642\u523b\u7de8)<\/a><\/p>\n<p><a href=\"https:\/\/qiita.com\/hiroakiyoshii\/items\/eea87148ad5dbed32400\">Qiita&nbsp;AWS Glue \u306e DynamicFrame \u3068\u306f\uff1f<\/a><\/p>\n<p><a href=\"https:\/\/docs.aws.amazon.com\/ja_jp\/glue\/latest\/dg\/glue-etl-scala-apis-glue-dynamicframe-class.html\">AWS Glue Scala DynamicFrame \u30af\u30e9\u30b9<\/a><\/p>\n<p><a href=\"https:\/\/d1.awsstatic.com\/webinars\/jp\/pdf\/services\/202108_Blackbelt_glue_etl_performance1.pdf\">AWS Glue ETL\u30d1\u30d5\u30a9\u30fc\u30de\u30f3\u30b9\u30fb\u30c1\u30e5\u30fc\u30cb\u30f3\u30b0\u2460\u57fa\u790e\u77e5\u8b58\u7de8<\/a><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u3053\u3093\u306b\u3061\u306f\u3001\u57fa\u76e4\u6574\u5099\u30b0\u30eb\u30fc\u30d7\u306e\u4e0a\u897f\u3067\u3059\u3002 AWS\u306eGlue Crawlers\u3067\u30af\u30ed\u30fc\u30e9\u30fc\u3092\u5b9f\u884c\u3059\u308b\u969b\u306bCSV\u30d5\u30a1\u30a4\u30eb\u306e\u307e\u307e\u3060\u3068\u3001date\u578b\u306b\u3057\u305f\u3044\u30c7\u30fc\u30bf\u304cstring\u30c7\u30fc\u30bf\u306e\u307e\u307e\u3067\u3042\u3063\u305f\u308a\u3001\u30c0\u30d6\u30eb\u30af\u30a9\u30fc\u30c6\u30fc\u30b7\u30e7\u30f3\u3067\u56f2\u3063\u305f [&hellip;]<\/p>\n","protected":false},"author":37,"featured_media":11617,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[27],"tags":[],"_links":{"self":[{"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=\/wp\/v2\/posts\/11338"}],"collection":[{"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=\/wp\/v2\/users\/37"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=11338"}],"version-history":[{"count":29,"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=\/wp\/v2\/posts\/11338\/revisions"}],"predecessor-version":[{"id":11623,"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=\/wp\/v2\/posts\/11338\/revisions\/11623"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=\/wp\/v2\/media\/11617"}],"wp:attachment":[{"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=11338"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=11338"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ecomottblog.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=11338"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}