Airbyte
Airbyte is a data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes. It has the largest catalog of ELT connectors to data warehouses and databases.
Installation and Setupβ
This instruction shows how to load any source from Airbyte into a local JSON file that can be read in as a document.
Prerequisites:
Have docker desktop installed.
Steps:
- Clone Airbyte from GitHub - 
git clone https://github.com/airbytehq/airbyte.git. - Switch into Airbyte directory - 
cd airbyte. - Start Airbyte - 
docker compose up. - In your browser, just visit http://localhost:8000. You will be asked for a username and password. By default, that's username 
airbyteand passwordpassword. - Setup any source you wish.
 - Set destination as Local JSON, with specified destination path - lets say 
/json_data. Set up a manual sync. - Run the connection.
 - To see what files are created, navigate to: 
file:///tmp/airbyte_local/. 
Document Loaderβ
See a usage example.
from langchain_community.document_loaders import AirbyteJSONLoader