Artificial Intelligence
Transloadit offers Artificial Intelligence as a service, so you don't have to run your own AI models or install complicated software in order to detect faces in images, for example. Artificial Intelligence offers advanced methods for processing, analyzing, and understanding digital image, audio and video files. Leverage the AI capabilities available right inside our encoding pipelines to further automate your media processing.
Robots
At Transloadit, we call our features Robots because you can link them together to create encoding pipelines unique to your use case.
-
/image/describe
recognizes objects in images and returns them as English words -
/image/facedetect
detects faces in images and returns their coordinates, or cuts them from the original images and returns those as new images -
/image/ocr
recognizes text in images and returns it in a machine-readable format -
/speech/transcribe
transcribes speech in audio or video files -
/text/speak
synthesizes speech in documents -
/text/translate
translates text in documents
Live demos
See our features in action through live demos and code samples, right here on our website:
- Recognize and reject nudity in images
- Detect faces in images
- Extract all the faces into a single image
- Recognize and reject certain objects in images
- Convert text into speech
- Produce a SRT file from audio or video files
- Transcribe speech in audio or video files
- Translate a text file
- Automatically make a slideshow from recognized objects in an image
- Automatically rename images based on text found within them
- Recognize text in images
Related blog posts
- Adding support for image face detection February 5, 2016
- 🧠 Tech Preview of our new AI bots February 17, 2020
- Let’s Build: Screen Reader Plugin June 3, 2021
- Introducing the OCR Robot August 26, 2021
- Let's Build: An Image Alt-Text Generator May 9, 2022