Introduction

With the rapid surge in the use of digital documentation and electronic publications for personal, professional, as well as commercial purposes has given birth to the need for a comprehensive platform to collect and consolidate all documents together in a single window frame. In today's digital era, it is impossible for an organization to function without having a centralized repository of its digital assets, specially official business contracts and certifications. With internet webbing the world together, document management software has become an imperative tool to collaborate and compete with other corporations around the globe.


Concept

VIDIZMO, a robust content management platform, has the capabilities to cater to all your digital document management needs. VIDIZMO offers support for all sorts of digital assets such as video, audio, image, document etc, allowing users to not just view/playback them but derive various other valuable information and insights, which is known as metadata, from within your content. This, in turn, helps to create monetary as well as intellectual value for their content by optimizing its usability, accessibility, search and discovery.


Processing of a Document

As soon as a document is uploaded:

  • Document Processing workflow is initiated for its processing;
  • During processing, the standard windows attributes stored in the file are picked up and added in each of the content files (renditions) produced. These will then be displayed on the Media Info screen.
  • The document is converted to PDF file format as per its default profile defined in Encoding Profiles which includes details like the size of the thumbnail and the output format. To learn more about them, see Understanding Encoding Profiles.
  • The snapshot of the first page of the document will be used as the thumbnail for the document in the portal. This is made customizable for the user as per convenience.
  • Optical Character Recognition is one of the sub-processes that the document is run through which results in editable text extraction from a document in accordance with the page it has been found on. This text is then indexed and added into the timed data through which a document is made searchable.
  • If the uploaded document is protected by a password and user passes that password using API, then such protected documents will be processed as well.
  • After processing, the document will be uploaded to the respective storage provider. To learn more about these providers, see: Storage Providers in VIDIZMO.


Supported Formats

List of supported formats for Document Processing are as follows:

 

FormatExtensions
Adobe Portable Document Format (PDF).pdf
Microsoft XML Paper Specification (XPS).xps
Microsoft Open XML Paper Specification (OpenXPS).oxps
DocumentUltimate Web Viewer Format (XPZ).xpz
Microsoft Word Document.docx
Microsoft Word Macro-Enabled Document.docm
Microsoft Word Template.dotx
Microsoft Word Macro-Enabled Template.dotm
Microsoft Word 97-2003 Document.doc
Microsoft Word 97-2003 Template.dot
Rich Text Document.rtf
OpenDocument Text.odt
OpenDocument Text Template.ott
Microsoft Excel Worksheet.xlsx
Microsoft Excel Macro-Enabled Worksheet.xlsm
Microsoft Excel Template.xltx
Microsoft Excel Macro-Enabled Template.xltm
Microsoft Excel Macro-Enabled Add-In.xlam
Microsoft Excel Binary Worksheet.xlsb
Microsoft Excel 97-2003 Worksheet.xls
Microsoft Excel 97-2003 Template.xlt
Comma Separated Values File.csv
Tab Separated Values File.tsv
OpenDocument Spreadsheet.ods
OpenDocument Spreadsheet Template.ots
Microsoft PowerPoint Presentation.pptx
Microsoft PowerPoint Macro-Enabled Presentation.pptm
Microsoft PowerPoint Template.potx
Microsoft PowerPoint Macro-Enabled Template.potm
Microsoft PowerPoint Slide Show.ppsx
Microsoft PowerPoint Macro-Enabled Slide Show.ppsm
Microsoft PowerPoint 97-2003 Presentation.ppt
Microsoft PowerPoint 97-2003 Slide Show.pps
OpenDocument Presentation.odp
OpenDocument Presentation Template.otp
Microsoft Visio Drawing.vsdx
Microsoft Visio Macro-Enabled Drawing.vsdm
Microsoft Visio Template.vstx
Microsoft Visio Macro-Enabled Template.vstm
Microsoft Visio Stencil.vssx
Microsoft Visio Macro-Enabled Stencil.vssm
Microsoft Visio 2003-2010 XML Drawing.vdx
Microsoft Visio 2003-2010 XML Stencil.vsx
Microsoft Visio 2003-2010 XML Template.vtx
Microsoft Visio 2003-2010 Drawing.vsd
Microsoft Visio 2003-2010 Stencil.vss
Microsoft Visio 2003-2010 Template.vst
Microsoft Visio 2010 Web Drawing.vdw
Microsoft Project Document.mpp
Microsoft Project Template.mpt
Microsoft Outlook E-mail Message.msg
E-mail Message.eml
Apple Mail E-mail File.emlx
HyperText Markup Language (HTML).html, .htm
Mime HTML (MHTML).mht, .mhtml
Plain Text Document.txt, .xml
AutoCAD Drawing (R13 to 2018).dwg
AutoCAD Drawing Interchange (R12 to 2018).dxf
Tagged Image File Format (TIFF).tif, .tiff
Deja Vu (DjVu).djvu
Digital Imaging and Communications in Medicine (DICOM).dcm
Scalable Vector Graphics (SVG).svg
Windows Enhanced Metafile (EMF).emf
Adobe Photoshop Document (PSD).psd
Joint Photographic Experts Group (JPEG).jpg, .jpeg, .jpe, .jfif
JPEG 2000 (JP2).jp2, .jpf, .jpx, .j2k, .j2c, .jpc
JPEG XR (HD Photo).jxr, .wdp, .hdp
Portable Network Graphics (PNG).png
Graphics Interchange Format (GIF).gif
WebP Image.webp
Bitmap Picture (BMP).bmp
Windows Metafile (WMF).wmf
Device Independent Bitmap (DIB).dib
Computer Graphics Metafile.cgm
Corel Metafile Exchange Image.cmx
Design Web Format.dwf
DreamWeaver Template.dwt
Encapsulated PostScript Language.eps
Electronic Publication.epub
Icon File.ico
Industry Foundation Classes.ifc
Lossless JPEG.jpeg-ls
Microsoft Exchange File Format.mpx
Printer Command Language.pcl
Microsoft PowerPoint template.pot
PostScript.ps
Stereolithrography.stl
Tagged Image File Format.tif
Text Document.txt
Excel Spreadshet 2003.xls2003