amundsen-io

License

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Under Apache License 2.0
By amundsen-io

metadata amundsen data-catalog data-discovery linuxfoundation

























Amundsen is a data discovery and metadata engine for improving the productivity of data analysts, data scientists and engineers when interacting with data. It does that today by indexing data resources (tables, dashboards, streams, etc.) and powering a page-rank style search based on usage patterns (e.g. highly queried tables show up earlier than less queried tables). Think of it as Google search for data. The project is named after Norwegian explorer Roald Amundsen, the first person to discover the South Pole.



Amundsen is hosted by the LF AI & Data Foundation. It includes three microservices, one data ingestion library and one common library.



Homepage

Documentation

Requirements

User Interface

Please note that the mock images only served as demonstration purpose.












Get Involved in the Community

Want help or want to help?
Use the button in our header to join our slack channel. Contributions are also more than welcome! As explained in CONTRIBUTING.md there are many ways to contribute, it does not all have to be code with new features and bug fixes, also documentation, like FAQ entries, bug reports, blog posts sharing experiences etc. all help move Amundsen forward. If you find a security vulnerability, please follow this guide.


Getting Started

Please visit the Amundsen installation documentation for a quick start to bootstrap a default version of Amundsen with dummy data.


Architecture Overview

Please visit Architecture for Amundsen architecture overview.


Supported Entities

Supported Integrations
Table Connectors

Amundsen can also connect to any database that provides dbapi or sql_alchemy interface (which most DBs provide).


Table Column Statistics

Dashboard Connectors

ETL Orchestration

Installation

Please visit Installation guideline on how to install Amundsen.


Roadmap

Please visit Roadmap if you are interested in Amundsen upcoming roadmap items.


Blog Posts and Interviews

Talks

Related Articles

Community meetings

Community meetings are held on the first Thursday of every month at 9 AM Pacific, Noon Eastern, 6 PM Central European Time. Link to join


Upcoming meetings & notes

You can the exact date for the next meeting and the agenda a few weeks before the meeting in this doc.


Notes from all past meetings are available here.


Who uses Amundsen?

Here is the list of organizations that are using Amundsen today. If your organization uses Amundsen, please file a PR and update this list.


Currently officially using Amundsen:



  1. Asana

  2. Bagelcode

  3. Bang & Olufsen

  4. Brex

  5. Cameo

  6. Chan Zuckerberg Initiative

  7. Cimpress Technology

  8. Coles Group

  9. Convoy

  10. Databricks

  11. Data Sprints

  12. Dcard

  13. Devoted Health

  14. DHI Group

  15. Edmunds

  16. Everfi

  17. Gusto

  18. Hurb

  19. ING

  20. Instacart

  21. iRobot

  22. Lett

  23. LMC

  24. Loft

  25. Lyft

  26. Merlin

  27. PicPay

  28. Plarium Krasnodar

  29. PUBG

  30. Rapido

  31. REA Group

  32. Remitly

  33. Snap

  34. Square

  35. Tile

  36. WeTransfer

  37. Workday


Contributors ✨

Thanks goes to these incredible people: