Skip to main content

Loader components in Langflow

info

As of Langflow 1.1, loader components are now found in the Components menu under Bundles.

Loaders fetch data into Langflow from various sources, such as databases, websites, and local files.

Use a loader component in a flow

This flow creates a question-and-answer chatbot for documents that are loaded into the flow. The Unstructured.io loader component loads files from your local machine, and then parses them into a list of structured Data objects. This loaded data informs the Open AI component's responses to your questions.

Sample Flow retrieving data with unstructured

Confluence

The Confluence component integrates with the Confluence wiki collaboration platform to load and process documents. It utilizes the ConfluenceLoader from LangChain to fetch content from a specified Confluence space.

Inputs

NameDisplay NameInfo
urlSite URLThe base URL of the Confluence Space (e.g., https://company.atlassian.net/wiki)
usernameUsernameAtlassian User E-mail (e.g., email@example.com)
api_keyAPI KeyAtlassian API Key (Create an API key at: Atlassian)
space_keySpace KeyThe key of the Confluence space to access
cloudUse Cloud?Whether to use Confluence Cloud (default: true)
content_formatContent FormatSpecify content format (default: STORAGE)
max_pagesMax PagesMaximum number of pages to retrieve (default: 1000)

Outputs

NameDisplay NameInfo
dataDataList of Data objects containing the loaded Confluence documents

GitLoader

The GitLoader component uses the GitLoader from LangChain to fetch and load documents from a specified Git repository.

Inputs

NameDisplay NameInfo
repo_pathRepository PathThe local path to the Git repository
clone_urlClone URLThe URL to clone the Git repository from (optional)
branchBranchThe branch to load files from (default: 'main')
file_filterFile FilterPatterns to filter files (e.g., '.py' to include only .py files, '!.py' to exclude .py files)
content_filterContent FilterA regex pattern to filter files based on their content

Outputs

NameDisplay NameInfo
dataDataList of Data objects containing the loaded Git repository documents

Unstructured

This component uses the Unstructured.io Serverless API to load and parse files into a list of structured Data objects.

Inputs

NameDisplay NameInfo
fileFileThe path to the file to be parsed (supported types are listed here)
api_keyAPI KeyUnstructured.io Serverless API Key
api_urlUnstructured.io API URLOptional URL for the Unstructured API
chunking_strategyChunking StrategyStrategy for chunking the document (options: "", "basic", "by_title", "by_page", "by_similarity")
unstructured_argsAdditional ArgumentsOptional dictionary of additional arguments for the Unstructured.io API

Outputs

NameDisplay NameInfo
dataDataList of Data objects containing the parsed content from the input file

Hi, how can I help you?