teaching materials for distant reading
-
Updated
Jun 12, 2024 - Python
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
teaching materials for distant reading
LLM-based ontological extraction tools, including SPIRES
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
This is a project using vue.js as frontend and flask as backend. Its aim is to take voice input from the web application and generate a shopping list including item name and quantity. It is a combined project (Data Science + Web development). For language processing NLP (spacy) is used.
OCR, extract and classify documents. In addition, annotate documents and build your own NLP and Computer Vision models using Python by downloading the data. Find examples in our Colab Notebooks, e. g. how to fine-tune Flair.
Finite state and Constraint Grammar based analysers and proofing tools + language resources for Lule Sámi
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Developed a Named Entity Recognition (NER) model with an integrated text summarizer to efficiently extract and summarize key information from unstructured text data.
Neural Network Compression Framework for enhanced OpenVINO™ inference
Open language modeling toolkit based on PyTorch
Different projects to discover Data Science
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Medical Concept Annotation Tool
FrameDeployer is an innovative tool designed to automate the creation of engaging video content using trending topics. With features like trend analysis, linguistic summarization, automated image search, sentiment analysis, text-to-speech, and subtitle generation, you can effortlessly create professional-quality videos. 🚀📹✨
This nTamil project aims to create a comprehensive and high-quality collection of Tamil text data for natural language processing (NLP) especially for LLMs and linguistic research.
Created by Alan Turing