Contents Menu Expand Light mode Dark mode Auto light/dark mode
Light Logo Dark Logo
Light Logo Dark Logo
Join

Getting Started

  • What is Argilla?
  • ๐Ÿš€ Quickstart
    • Installation
    • Workflow Feedback Dataset
    • Workflow other datasets
  • ๐ŸŽผ Cheatsheet
  • ๐Ÿ”ง Installation
    • Python
    • Docker
    • Docker Quickstart
    • Docker-compose
    • Cloud Providers and Kubernetes
    • Hugging Face Spaces
    • Google Colab
  • โš™๏ธ Configuration
    • Elasticsearch
    • Server configuration
    • User Management
    • Workspace Management
    • Database Migrations
    • Image Support

Conceptual Guides

  • Argilla concepts
  • Data collection for LLMs
    • Collecting RLHF data
    • Collecting demonstration data
    • Collecting comparison data

Practical Guides

  • ๐Ÿ—บ๏ธ Practical guides overview
  • ๐Ÿง Choose a dataset type
  • ๐Ÿง‘โ€๐Ÿ’ป Create a dataset
  • ๐Ÿ—‚๏ธ Assign records to your team
  • ๐Ÿ’ซ Update a dataset
  • ๐Ÿ”Ž Filter and query datasets
  • โœ๏ธ Annotate a dataset
  • ๐ŸŒŠ Simplify annotation with machine feedback workflows
    • ๐Ÿง‘โ€๐Ÿซ Active Learning
    • ๐Ÿ‘ฎ Weak Supervision
    • ๐Ÿ”ฆ Semantic Search
    • โฒ๏ธ Job Scheduling and Callbacks
  • ๐Ÿ“Š Collect responses and metrics
  • ๐Ÿ“ฅ Export a dataset
  • ๐Ÿฆพ Fine-tune LLMs and other language models

Tutorials and Integrations

  • Tutorials
  • Integrations
    • Monitoring LLMs in LangChain apps, chains, and agents and tools
    • Large scale document processing for LLMs with Unstructured.io
    • Monitor NLP models with FastAPI and ArgillaLogHTTPMiddleware

Reference

  • Python
    • Client
    • Metrics
    • Labeling
    • Training
    • Monitoring
    • Listeners
    • Users
    • Workspaces
  • CLI
  • Argilla UI
    • Pages
    • Features
  • Notebooks
    • ๐Ÿ” Backup and version Argilla Datasets using DVC
    • ๐Ÿš€ Run Argilla with a Transformer in an active learning loop and a free GPU in your browser
    • ๐Ÿ’พ Monitor FastAPI model endpoints
    • ๐Ÿงธ Using LLMs for Text Classification and Summarization Suggestions with spacy-llm
    • ๐Ÿ—บ๏ธ Add bias-equality features to datasets with disaggregators
    • ๐Ÿ’ก Build and evaluate a zero-shot sentiment classifier with GPT-3
    • ๐Ÿ’จ Label data with semantic search and Sentence Transformers
    • ๐Ÿ“ธ Bulk Labeling Multimodal Data
    • ๐Ÿงฑ Augment weak supervision rules with Sentence Transformers
    • ๐Ÿ”ซ Zero-shot and few-shot classification with SetFit
    • ๐Ÿ—‚ Multi-label text classification with weak supervision
    • ๐Ÿ“ฐ Train a text classifier with weak supervision
    • ๐Ÿ—‚๏ธ Assign records to your annotation team
    • ๐Ÿฉน Delete labels from a Token or Text Classification dataset
    • ๐Ÿ”ซ Evaluate a zero-shot NER with Flair
    • ๐Ÿญ Train a NER model with skweak
    • ๐Ÿ’ซ Explore and analyze spaCy NER predictions
    • ๐Ÿง Find label errors with cleanlab
    • ๐Ÿฅ‡ Compare Text Classification Models
    • ๐Ÿ•ต๏ธโ€โ™€๏ธ Analize predictions with explainability methods
    • ๐Ÿงผ Clean labels using your modelโ€™s loss
    • # ๐Ÿค” Fine-tunning a NER model with BERT for Beginners
    • ## Introduction
    • ## Running Argilla
    • ## Setup
    • ## ๐Ÿš€ Exploring our dataset
    • ## โณ Preprocessing the data
    • ## ๐Ÿ” Fine-tunning the model
    • ๐Ÿ“โœ”๏ธ Summary
    • Text classification active learning with classy-classification
    • ๐Ÿค” Text Classification active learning with ModAL
    • ๐Ÿคฏ Few-shot classification with SetFit
    • ๐Ÿค— Train a sentiment classifier with SetFit
    • ๐Ÿ‘‚ Text Classification active learning with small-text
    • ๐Ÿท๏ธ Fine-tune a sentiment classifier with your own data
    • ๐Ÿ•ธ๏ธ Train a summarization model with Unstructured and Transformers
  • Telemetry
  • Terminology

Community

  • Slack
  • Github
  • Discussion forum
  • Developer documentation
  • Contributor Documentation
  • Migration from Rubrix
  v: latest
Versions
latest
v1.16.0
v1.15.0
v1.14.0
v1.13.0
v1.12.0
v1.11.0
v1.10.0
v1.9.0
v1.8.0
v1.7.0
v1.6.0
v1.5.0
v1.4.0
v1.3.0
v1.2.0
v1.1.0
update-changelog-for-singlelabel-multilabel
releases-v1.9.0
releases-1.8.0
releases-1.15.1
releases-1.13.3
releases-1.13.2
releases-1.13.1
releases-1.13.0
releases-1.12.1
releases-1.11.0
releases-1.10.0
refactor-using-more-general-409-api-error
refactor-split-remote-schemas
refactor-poc-change-component-questionform-to-composition-api
refactor-pagination-component
refactor-configure-elasticsearch-timeouts
pre-commit-ci-update-config
hotfix-updated_save
fix-remove-sanitize-html
fix-frontend-dependencies
fix-close-pr-2
fix-babel-wrong-import-dependency
feature-refresh-search-index-endpoint
feature-huggingface-agents-integration
feature-feedback-dataset-semantic-similarity
feature-environment-per-branch-2
feature-background-tasks-poc
feature-add-vectors-endpoints
feat-resize-annotation-mode
feat-required-widget-for-question
feat-poc-change-questionsform-to-composition-api
feat-local-remote-alignment
feat-improve-pagination
feat-feedback_task_helpbox
feat-cleanup-folder-components
feat-allow-pass-extra-config-to-search-backend-client
feat-3616-autotrain
docs-suggestions-peppinob-ol
docs-fine-tune-openai-rag
docs-correct-version
docs-add_rm_example_trl_01
docs-3787-docs-token-classification-tutorial-using-spacy-llm
docs-3609-docs-add-visualisation-to-docs-wrt-argilla-structure
develop
dependabot-pip-wrapt-gte-1.13-and-lt-1.16
dependabot-pip-uvicorn-standard-gte-0.15.0-and-lt-0.24.0
dependabot-pip-pandas-gte-1.0.0-and-lt-3.0.0
dependabot-pip-opensearch-py-gte-2.0-and-lt-2.4
dependabot-pip-huggingface-hub-gte-0.5.0-and-lt-0.18
ci-change-default-board-for-new-issues
chore-ruff-unused-imports
Downloads
On Read the Docs
Project Home
Builds
Back to top
Join

๐Ÿ”ฆ Semantic search#

These tutorials show you how to use semantic search with Argilla.

๐Ÿ“ธ Bulk Labelling Multimodal Data

MLOps Steps: Labelling
NLP Tasks: TextClassification (images)
Libraries: Argilla, sentence-transformers
Techniques: Semantic search

๐Ÿ’จ Speed-up data labelling with Sentence Transformer embeddings

MLOps Steps: Labelling
NLP Tasks: TextClassification
Libraries: Argilla, sentence-transformers
Techniques: Semantic search

Copyright © 2023, Argilla.io
Made with Sphinx and @pradyunsg's Furo