- Services ›
Artificial Intelligence
Transloadit offers Artificial Intelligence as a service, so you don't have to run your own AI models or install complicated software in order to detect faces in images, for example. Artificial Intelligence offers advanced methods for processing, analyzing, and understanding digital image, audio and video files. Leverage the AI capabilities available right inside our encoding pipelines to further automate your media processing.
Robots
At Transloadit, we call our features Robots because you can make them work together to create encoding pipelines unique to your use case.
-
/image/describe
recognizes objects in images and returns them as English words -
/image/facedetect
detects faces in images and returns their coordinates, or cuts them from the original images and returns those as new images -
/image/ocr
recognizes text in images and returns it in a machine-readable format -
/speech/transcribe
transcribes speech in audio or video files -
/text/speak
synthesizes speech in documents -
/text/translate
translates text in documents
Live demos
See our features in action through live demos and code samples, right here on our website:
- Automatically make a slideshow from recognized objects in an image
- Automatically rename images based on text found within them
- Convert text into speech
- Detect faces in images
- Extract all the faces into a single image
- Produce a SRT file from audio or video files
- Recognize and reject certain objects in images
- Recognize and reject nudity in images
- Recognize text in images
- Transcribe speech in audio or video files
- Translate a text file
Related blog posts
- Let's Build: An Image Alt-Text Generator May 9, 2022
- Introducing the OCR Robot August 26, 2021
- Let’s Build: Screen Reader Plugin June 3, 2021
- 🧠 Tech Preview of our new AI bots February 17, 2020
- Adding support for image face detection February 5, 2016