
Convert documents into different formats
🤖/document/convert converts documents into different formats.
Note
This Robot can convert files to PDF, but cannot convert PDFs to different formats. If you want to convert PDFs to say, JPEG or TIFF, use 🤖/image/resize. If you want to turn them into text files or recognize (OCR) them to make them searchable, reach out, as we have a new Robot in the works for this.
Sometimes, a certain file type might not support what you are trying to accomplish. Perhaps your company is trying to automate document formatting, but it only works with docx, so all your docs need to be converted. Or maybe your stored jpg files are taking up too much space and you want a lighter format. Whatever the case, we have you covered.
Using this Robot, you can bypass the issues that certain file types may bring, by converting your file into the most suitable format. This also works in conjunction with our other Robots, allowing for even greater versatility when using our services.
Warning
A general rule of this Robot is that converting files into an alien format category will result in an error. For example, SRT files can be converted into the VTT format, but not to an image.
The following file formats can be converted from:
aicsvdocdocxepsgifhtmljpglatexmdodaoddodtottpngpotppspptpptxppzpsrtfrtxsvgtexttxtxhtmlxlaxlsxlsxxml
Usage example
Convert uploaded files to PDF documents:
{
"steps": {
"converted": {
"robot": "/document/convert",
"use": ":original",
"format": "pdf"
}
}
}Parameters
output_metaRecord<string, boolean> | boolean | Array<string>Allows you to specify a set of metadata that is more expensive on CPU power to calculate, and thus is disabled by default to keep your Assemblies processing fast.
For images, you can add
"has_transparency": truein this object to extract if the image contains transparent parts and"dominant_colors": trueto extract an array of hexadecimal color codes from the image.For videos, you can add the
"colorspace: true"parameter to extract the colorspace of the output video.For audio, you can add
"mean_volume": trueto get a single value representing the mean average volume of the audio file.You can also set this to
falseto skip metadata extraction and speed up transcoding.resultboolean(default:false)Whether the results of this Step should be present in the Assembly Status JSON
queuebatchSetting the queue to 'batch', manually downgrades the priority of jobs for this step to avoid consuming Priority job slots for jobs that don't need zero queue waiting times
force_acceptboolean(default:false)Force a Robot to accept a file type it would have ignored.
By default, Robots ignore files they are not familiar with. 🤖/video/encode, for example, will happily ignore input images.
With the
force_acceptparameter set totrue, you can force Robots to accept all files thrown at them. This will typically lead to errors and should only be used for debugging or combatting edge cases.usestring | Array<string> | Array<object> | objectSpecifies which Step(s) to use as input.
- You can pick any names for Steps except
":original"(reserved for user uploads handled by Transloadit) - You can provide several Steps as input with arrays:
{ "use": [ ":original", "encoded", "resized" ] }
Tip
That's likely all you need to know about
use, but you can view Advanced use cases.- You can pick any names for Steps except
format— requiredai | csv | doc | docx | eps | gif | html |The desired format for document conversion.
markdown_formatcommonmark | gfm(default:"gfm")Markdown can be represented in several variants, so when using this Robot to transform Markdown into HTML please specify which revision is being used.
markdown_themebare | github(default:"github")This parameter overhauls your Markdown files styling based on several canned presets.
pdf_marginstring(default:"6.25mm,6.25mm,14.11mm,6.25mm")PDF Paper margins, separated by
,and with units.We support the following unit values:
px,in,cm,mm.Currently this parameter is only supported when converting from
html.pdf_print_backgroundboolean(default:true)Print PDF background graphics.
Currently this parameter is only supported when converting from
html.pdf_formatA0 | A1 | A2 | A3 | A4 | A5 | A6 |(default:"Letter")PDF paper format.
Currently this parameter is only supported when converting from
html.pdf_display_header_footerboolean(default:false)Display PDF header and footer.
Currently this parameter is only supported when converting from
html.pdf_header_templatestringHTML template for the PDF print header.
Should be valid HTML markup with following classes used to inject printing values into them:
dateformatted print datetitledocument titleurldocument locationpageNumbercurrent page numbertotalPagestotal pages in the document
Currently this parameter is only supported when converting from
html, and requirespdf_display_header_footerto be enabled.To change the formatting of the HTML element, the
font-sizemust be specified in a wrapper. For example, to center the page number at the top of a page you'd use the following HTML for the header template:<div style="font-size: 15px; width: 100%; text-align: center;"><span class="pageNumber"></span></div>pdf_footer_templatestringHTML template for the PDF print footer.
Should use the same format as the
pdf_header_template.Currently this parameter is only supported when converting from
html, and requirespdf_display_header_footerto be enabled.To change the formatting of the HTML element, the
font-sizemust be specified in a wrapper. For example, to center the page number in the footer you'd use the following HTML for the footer template:<div style="font-size: 15px; width: 100%; text-align: center;"><span class="pageNumber"></span></div>