AWS Glue Data Catalog Client for Apache Hive Metastore 项目推荐

铁佛  金牌会员 | 2025-1-10 20:23:31 | 来自手机 | 显示全部楼层 | 阅读模式
打印 上一主题 下一主题

主题 884|帖子 884|积分 2652

AWS Glue Data Catalog Client for Apache Hive Metastore 项目推荐

    aws-glue-data-catalog-client-for-apache-hive-metastore The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions  
项目地址: https://gitcode.com/gh_mirrors/aw/aws-glue-data-catalog-client-for-apache-hive-metastore   
1. 项目基础先容与主要编程语言

本项目是由AWS labs团队开源的AWS Glue Data Catalog Client for Apache Hive Metastore,它是一个用Java语言编写的开源项目。该项目的目的是为了提供一个Apache Hive Metastore兼容的客户端,使得用户可以在Amazon EMR集群上使用AWS Glue Data Catalog作为外部Hive Metastore。
2. 项目的核心功能

AWS Glue Data Catalog是一个完全托管的、与Apache Hive Metastore兼容的元数据存储库。用户可以利用数据目次作为中心存储库,存储数据的布局性和操作性元数据。该项目的核心功能包括:


  • 实现了Apache Hive Metastore客户端,可以在Amazon EMR集群上使用AWS Glue Data Catalog作为外部元数据存储。
  • 作为构建与AWS Glue Data Catalog兼容的Hive Metastore客户端的参考实现。
  • 支持将客户端移植到其他Hive Metastore兼容的平台,如其他Hadoop和Apache Spark发行版。
  • 支持Spark 3和Hive 3。
3. 项目迩来更新的功能

根据项目的更新日记,迩来更新的功能主要包括:


  • 优化了客户端的构建和配置流程,提供了更具体的指南。
  • 支持客户端侧缓存,包括表元数据和数据库元数据的缓存,以提高查询效率。
  • 对缓存策略进行了增强,用户可以自界说缓存大小和过期时间。
  • 修复了一些已知的bug,并提高了客户端的稳定性和性能。
通过这些更新,项目为用户提供了更加便捷的构建和配置体验,同时也提升了元数据处理的效率。
    aws-glue-data-catalog-client-for-apache-hive-metastore The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions  
项目地址: https://gitcode.com/gh_mirrors/aw/aws-glue-data-catalog-client-for-apache-hive-metastore   

免责声明:如果侵犯了您的权益,请联系站长,我们会及时删除侵权内容,谢谢合作!更多信息从访问主页:qidao123.com:ToB企服之家,中国第一个企服评测及商务社交产业平台。

本帖子中包含更多资源

您需要 登录 才可以下载或查看,没有账号?立即注册

x
回复

使用道具 举报

0 个回复

正序浏览

快速回复

您需要登录后才可以回帖 登录 or 立即注册

本版积分规则

铁佛

金牌会员
这个人很懒什么都没写!
快速回复 返回顶部 返回列表