English 中文(简体)
为什么我的胶胶囊爬行器 排除模式不适用?
原标题:Why does my Glue Crawler exclude pattern not apply?

我知道以前有人问过这个问题,但我花了数小时试图使这个工作。

我有一个目录结构,比如:

- datalake
--- datasets
----- foo
------- 00001.json
------- 00002.json
------- latest.json
----- bar
------- 00001.json
------- latest.json

包含我的路径表看起来像

s3: & lt; bucket_ name>/ datalake/ datasets/

我想排除不是latest.json 的东西

我已经尝试了一切 在阳光下。

**0*
**/0**
*/0*
*0*
**0**

和许多其他的。

没有失败,我的爬行者目录 每一个.json。

我正在检查我和雅典娜的爬行结果

我是否真的把排除模式弄错了? 或者我是不是以某种方式思考了整件事,而我的模式却无关紧要?

问题回答

您可以尝试使用 作为排除模式吗?





相关问题
Mount windows shared drive to MWAA in bootscript

In MWAA startup script sudo yum install samba-client cifs-utils -y sudo mount.cifs //dev/test/drop /mnt/dev/test-o username=testuser,password= pwd ,domain=XX Executing above commonds giving error - ...

How to get Amazon Seller Central orders programmatically?

We have been manually been keying Amazon orders into our system and would like to automate it. However, I can t seem to figure out how to go about it. Their documentation is barely there. There is: ...

Using a CDN like Amazon S3 to control access to media

I want to use Amazon S3/CloudFront to store flash files. These files must be private as they will be accessed by members. This will be done by storing each file with a link to Amazon using a mysql ...

unable to connect to database on AWS

actually I have my website build with Joomla hosted on hostmonster but all Joomla website need a database support to run this database is on AWS configuration files need to be updated for that I ...

Using EC2 Load Balancing with Existing Wordpress Blog

I currently have a virtual dedicated server through Media Temple that I use to run several high traffic Wordpress blogs. Both tend to receive sudden StumbleUpon traffic surges that (I m assuming) ...

SSL slowness in EC2

We ve deployed our rails app to EC2. In our setup, we have two proxies on small instances behind round-robin DNS. These run nginx load balancers for a dynamically growing and shrinking farm of web ...