Skip to main content

Data components in Langflow

Data components load data from a source into your flow.

They may perform some processing or type checking, like converting raw HTML data into text, or ensuring your loaded file is of an acceptable type.

Use a data component in a flow

The URL data component loads content from a list of URLs.

In the component's URLs field, enter a comma-separated list of URLs you want to load. Alternatively, connect a component that outputs the Message type, like the Chat Input component, to supply your URLs with a component.

To output a Data type, in the Output Format dropdown, select Raw HTML. To output a Message type, in the Output Format dropdown, select Text. This option applies postprocessing with the data_to_text helper function.

In this example of a document ingestion pipeline, the URL component outputs raw HTML to a text splitter, which splits the raw content into chunks for a vector database to ingest.

URL component in a data ingestion pipeline

API Request

This component sends HTTP requests to the specified URLs.

Use this component to interact with external APIs or services and retrieve data. Ensure that the URLs are valid and that you configure the method, headers, body, and timeout correctly.

Inputs

NameDisplay NameInfo
URLsURLsThe URLs to target
curlcurlPaste a curl command to fill in the dictionary fields for headers and body
MethodHTTP MethodThe HTTP method to use, such as GET or POST
HeadersHeadersThe headers to include with the request
BodyRequest BodyThe data to send with the request (for methods like POST, PATCH, PUT)
TimeoutTimeoutThe maximum time to wait for a response

Directory

This component recursively loads files from a directory, with options for file types, depth, and concurrency.

Inputs

InputTypeDescription
pathMessageTextInputPath to the directory to load files from
typesMessageTextInputFile types to load (leave empty to load all types)
depthIntInputDepth to search for files
max_concurrencyIntInputMaximum concurrency for loading files
load_hiddenBoolInputIf true, hidden files will be loaded
recursiveBoolInputIf true, the search will be recursive
silent_errorsBoolInputIf true, errors will not raise an exception
use_multithreadingBoolInputIf true, multithreading will be used

Outputs

OutputTypeDescription
dataList[Data]Loaded file data from the directory

File

The FileComponent is a class that loads and parses text files of various supported formats, converting the content into a Data object. It supports multiple file types and provides an option for silent error handling.

Inputs

NameDisplay NameInfo
pathPathFile path to load.
silent_errorsSilent ErrorsIf true, errors will not raise an exception.

Outputs

NameDisplay NameInfo
dataDataParsed content of the file as a Data object.

URL

The URLComponent is a class that fetches content from one or more URLs, processes the content, and returns it as a list of Data objects. It ensures that the provided URLs are valid and uses WebBaseLoader to fetch the content.

Inputs

NameDisplay NameInfo
urlsURLsEnter one or more URLs

Outputs

NameDisplay NameInfo
dataDataList of Data objects containing fetched content and metadata

Gmail Loader

This component loads emails from Gmail using provided credentials and filters.

For more on creating a service account JSON, see Service Account JSON.

Inputs

InputTypeDescription
json_stringSecretStrInputJSON string containing OAuth 2.0 access token information for service account access
label_idsMessageTextInputComma-separated list of label IDs to filter emails
max_resultsMessageTextInputMaximum number of emails to load

Outputs

OutputTypeDescription
dataDataLoaded email data

Google Drive Loader

This component loads documents from Google Drive using provided credentials and a single document ID.

For more on creating a service account JSON, see Service Account JSON.

Inputs

InputTypeDescription
json_stringSecretStrInputJSON string containing OAuth 2.0 access token information for service account access
document_idMessageTextInputSingle Google Drive document ID

Outputs

OutputTypeDescription
docsDataLoaded document data

This component searches Google Drive files using provided credentials and query parameters.

For more on creating a service account JSON, see Service Account JSON.

Inputs

InputTypeDescription
token_stringSecretStrInputJSON string containing OAuth 2.0 access token information for service account access
query_itemDropdownInputThe field to query
valid_operatorDropdownInputOperator to use in the query
search_termMessageTextInputThe value to search for in the specified query item
query_stringMessageTextInputThe query string used for searching (can be edited manually)

Outputs

OutputTypeDescription
doc_urlsList[str]URLs of the found documents
doc_idsList[str]IDs of the found documents
doc_titlesList[str]Titles of the found documents
DataDataDocument titles and URLs in a structured format

Webhook

This component defines a webhook input for the flow. The flow can be triggered by an external HTTP POST request (webhook) sending a JSON payload.

If the input is not valid JSON, the component will wrap it in a "payload" field. The component's status will reflect any errors or the processed data.

Inputs

NameTypeDescription
dataStringJSON payload for testing the webhook component

Outputs

NameTypeDescription
output_dataDataProcessed data from the webhook input

Hi, how can I help you?