Artificial Intelligence
Transloadit offers Artificial Intelligence as a service, so you don't have to run your own AI models or install complicated software in order to detect faces in images, for example. Artificial Intelligence offers advanced methods for processing, analyzing, and understanding digital image, audio and video files. Leverage the AI capabilities available right inside our encoding pipelines to further automate your media processing.
Robots
At Transloadit, we call our features Robots because you can link them together to create encoding pipelines unique to your use case.
-
/document/ocr
recognizes text in documents and returns it in a machine-readable format -
/image/describe
recognizes objects in images and returns them as English words -
/image/facedetect
detects faces in images and can return either their coordinates or the faces themselves as new images -
/image/ocr
recognizes text in images and returns it in a machine-readable format -
/speech/transcribe
transcribes speech in audio or video files -
/text/speak
synthesizes speech in documents -
/text/translate
translates text in documents
Live demos
See our features in action through live demos and code samples, right here on our website:
- Automatically rename images based on text found within them
- Convert text into speech
- Detect faces in images
- Extract all the faces into a single image
- Generate a slideshow from AI-filtered images
- Produce a SRT file from audio or video files
- Recognize and reject certain objects in images
- Recognize and reject nudity in images (NSFW content)
- Recognize text in images
- Transcribe speech in audio or video files
- Translate a text file
Related blog posts
- Launching face detection Robot for images February 5, 2016
- Tech preview: new AI Robots for enhanced media processing February 17, 2020
- Building a screen reader plugin with /text/speak Robot June 3, 2021
- Introducing the OCR Robot for easy text extraction August 26, 2021
- Building an alt-text to speech generator with Transloadit May 9, 2022
- Implementing OCR in Android and iOS apps with open-source SDKs September 23, 2024
- Implementing OCR in Android apps with Google ML Kit November 4, 2024
- Extracting text from images in Node.js using AWS Rekognition November 29, 2024