Oracle PDF Import Extension (for OpenOffice 3.x)

×

Warning message

This extension was not updated recently. It might not work with latest versions of OpenOffice.

Primary tabs

Provider:
Oracle
Maintainer:
mh
Rating:
4.036365

Average: 4 (110 votes)

Application:
OpenOffice.org
Tags:
Sun, pdf, Pdfimport, import, oracle
Source code:
http://svn.services.openoffice.org/ooo/trunk/sdext/source/pdfimport/
Post date:
Monday, 9 June, 2008 - 10:28
Statistics
Week: Not tracked - Month: Not tracked - Year: Not tracked - Timeline
Download extension
Windows version - All releases
Compatible with OpenOffice 4: Unknown
User feedback:
Compatible with OpenOffice 4.x?

If you are using Apache OpenOffice 4.0 or later, you must use another version: PDF Import for Apache OpenOffice.


The PDF Import Extension allows you to import and modify PDF documents. Best results with 100% layout accuracy can be achieved with the "PDF/ODF hybrid file" format, which this extension also enables. A hybrid PDF/ODF file is a PDF file that contains an embedded ODF source file. Hybrid PDF/ODF files will be opened in OpenOffice.org as an ODF file without any layout changes. Users without this extension can open the PDF part of the hybrid file with their PDF viewer.


The PDF Import Extension also allows you to import and modify PDF documents for non hybrid PDF/ODF files. PDF documents are imported in Draw to preserve the layout and to allow basic editing. This is the perfect solution for changing dates, numbers or small portions of text with a minimum loss of formatting information for simple formatted documents.


Documents with more sophisticated layouts, such as those created with professional Desktop Publishing applications that use special fonts and complex vector graphics are not suitable for the PDF Import Extension. Similarly, longer documents (> 20 pages) may cause some performance and stability problems.

Features:

  • Text attributes like font family, font size, weight (bold, not bold), style (italic, not italic) are imported together with their respective text
  • Retain font appearance, when a PDF file uses a font not installed on your system, the font is replaced with the best alternative font
  • Converts images and vector graphics
  • Each line in a paragraph is converted into one text object
  • Import of password-protected PDF files
  • Import shapes with default style
  • Support for colors and bitmaps
  • Backgrounds remain behind other elements

Not supported:

  • Native PDF forms
  • Proper paragraphs
  • Processing layout of LaTeX PDF
  • Import of complex vector graphics elements
  • Conversion of tables
  • Import of EPS graphics
  • RTL (right-to-left) text/font support



Please notice: Use this extension cautiously as editing PDF files is not a trivial task. Encouraged by the huge number of positive comments and the improved quality of this extension, we decided to remove the Beta tag from this release. Of course we will continue to work on this extension to reflect user needs and any changes to the PDF standard.

Oracle PDF Import Extension (for OpenOffice 3.x)

Version Operating system Compatibility Release date
1.0.4 Windows 3.0, 3.1, 3.2, 3.3, 3.4 14/12/2010 - 06:17 More information Download
1.0.4-SolarisSparc Solaris Sparc 3.0, 3.1, 3.2, 3.3, 3.4 14/12/2010 - 06:15 More information Download
1.0.4-Solarisx86 Solaris x86 3.0, 3.1, 3.2, 3.3, 3.4 14/12/2010 - 06:10 More information Download
1.0.4-MacOSX Intel MacOS X 3.0, 3.1, 3.2, 3.3, 3.4 14/12/2010 - 06:06 More information Download
1.0.4-Linuxx86-64 Linux x86-64 3.0, 3.1, 3.2, 3.3, 3.4 14/12/2010 - 06:01 More information Download
1.0.4-LinuxIntel Linux 3.0, 3.1, 3.2, 3.3, 3.4 14/12/2010 - 05:35 More information Download
1.0-MacOSX-PPC MacOS X -- 12/10/2009 - 05:30 More information Download

Comments

PCBSD is really coming into its own and will be getting more popular.

Pity that OO extensions aren't all platform independent (Java?), but would love to see the PDF importer available for FreeBSD. If it's available for Linux, it should be easy to compile in BSD.

Tnx, Jeff

You can compile PDF import extension via ports. From 3.1.1 (not ready yet), it will be installed at
/usr/local/openoffice.org-3.1.1/extensions/
if you are hurry, use ports/editors/openoffice.org-3-RC/ and copy extensions/pdfimport.oxt to somewhere else.

I've been waiting a thousand years for something like this - fantastic! Once this is cleaned up a bit, I will be using it every day - I'll get around to explaining why and how when I've got more time.

Problems up to now:
1. Very slow to load with big files. Unelegantly but a lot quicker, I can get around this problem by splitting the big file into 1-page files with PDFTK Builder, for example, working on the pages that I want to modify, and then rebuilding the file, again with PDFTK Builder.
2. It takes the orientation (landscape or portrait) of the last page in the file as the orientation for the whole file on loading; when I change the orientation of one page, it changes all other pages in the file to conform to this change.
3. Justified text became unjustified with one file.
4. Tables with boxes: With one document, the boxes structure of the table was perfectly rendered, but the contents of most, but not all, of the boxes were shunted over to the left of the page as if the boxes were not there. But with several other files the reverse happened - the contents were perfectly rendered, but the lines of the boxes were all over the page in an apparently random way. (Yeah, I know, it's not truly random ... at least, I hope not!)

There are a few revisions to make still (from the other comments and my own experience), but this tool does it's job; open a PDF (a read only doc) and make changes to it. I'm having to add in links each time I make an edit as well as not being able to re-edit protected documents, so I just make those the last steps in any document changes. This beats having to fix a problem with a source (or missing source) document in order to fix a typo, add an image/text, remove a specific object of information, etc...

If all of the features included in the current GUI are tuned up, this is going to be a huge boost for OOO and OOODraw in particular. Great Job on getting this Beta out to the field, keep it up.

Michael Boardman

Hi,

I am using OpenOffice 3.1 on ubuntu hardy from here: deb http://ppa.launchpad.net/openoffice-pkgs/ubuntu hardy main
I get the error below trying to install the extension:

(com.sun.star.RuntimeException){ { Message = "unsatisfied query for interface of type com.sun.start.script.XLibraryContainer!", Context = (com.sun.star.uno.Xinterface) @0 } }

Thanks

Murray

With OpenOffice3 installed from the PPA on Ubuntu 8.10 I just could not get this extension to produce anything but gibberish, but the other day for some reason I installed the poppler-utils package from the repositories and the extension started working, miraculously!

It would be really nice to know (have known) if this extension depends on poppler utils (or some part of it).

I'm a translator and clients send me pdfs to translate and it's a nightmare. Despite the problems with this extension I'm pretty excited about what it might become, overall nice work.

I tried searching here for poppler, poppler-utils, etc., but no luck.

Wikipedia sent me to http://poppler.freedesktop.org/

ubuntu 8.10的支持性能良好,能很好的打开pdf文件,与acrobat design 8.0相比也不逊色。
看到很多以上不能支持的问题,是否是自己的安装和系统设置有问题。
windows的支持还未使用过。

On Mandriva 10, the extension install windows pops up a Dialog Window for reading the licesnse agreement and then getting to an Accept button to push but, there is no scroll bar to use on this window!!!! PageUp, PageDown, up or down arrows also do not scroll this window. There is no way to get to the Accept button at all!!! Pressing the Close button closes this window and the install is cancelled. Advanced->Windows behavior has no feature for turning scroll bars either. I need another way to install this Sun pdf import extension for Open office!

While installing, I could see that openoffice.org-core and openoffice.org-common were also in the Sun pdf pack, which created a conflict with my installation of Oo before. And as said, I tried to find Oo software to see how this pdf extension worked, and found that Oo was desinstalled.

I had to run again the installation of the suite, only to find the old messages: conflicts with openoffice.org-core and common. So I removed them and I read that Synaptic Pack Manager (I have a version in Spanish, dunno the actual name in English) uninstalled also this Sun pdf extension.

Finally, no pdf manager... :(

Cheeers

I must be doing something wrong, but I am running 3.01 and have installed the PDF Import Extension. When I go to open a PDF (which I can confirm is based on Acrobat 1.3), both Writer and Impress coming up with ASCII filter and then regardless of which options I choose, I get gibberish presented back.

Please advise.

I tried every possible approach to opening the PDF, but all I got was the ASCII filter box and gibberish.

Installing the extension also seems to have disabled the Extension Manager.

When open PDF, change the 'File type' dropdown list to 'PDF - Protable Document Format (*.pdf)', then select your pdf file, everything will be OK.

PS, my environment: Ubuntu 9.04 AMD64 / OOo 3.0.1

Try installing the extension while you are root/administrator.

PDF import fails to install with an error "loading component library failed" and identifying pdfimport.uno.dll as the failed library. I'm using OxygenOffice on a Windows XP laptop.

I am running Windows 7 and am having the same problem. Would really love some help..

The PDF-extension works fine, but all black/white PDFs we create with paperport or acrobat are shown in negative mode (white paper is shown black, black letters are white. It´s possible to invert every page by hand, but that´s not the solution. Has anybody an idea?

[Ed: Links removed as unrelated to OpenOffice.org]

Hi all!
When I export to the hybrid format from writer the files open fine in adobe reader. It is however not possible to reopen them in writer. I got a fat document with nonsensical text and symbols, not at all similar to the original. I have tested with several documents and different settings with the same result. I use OOO 3.01 with a Swedish language pack (OOO 3.01 RC2) on Windows Vista. Does anyone else have the same problem? Any suggestions?

Hi all,

I'm trying to import a pdf file into OO Impress but instead it is always loaded into OO Draw. Is it Impress import feature enabled under this version? I have tried it with both Windows XP and Linux Ubuntu 8.04 but the same result applies.

Any feedback would be appreciated. Thanks for all your time,

Hector

I'm using OO.o 3.0.1 on a Kubuntu 7.10 laptop.
When I try to install the extension I get the "InvalidValueException" error.
Does anyone know what the probem might be? Any help will be appreciated

Hi,
If I export a pdf from draw and restrict permissions (enter password and select changes not permitted), when I try to open the exported file in draw I am unable to view it (just says it is an encrypted file). Is it possible to re-enable change permissions by entering the password ? I don't seem to be able to find this option ?

Regards,
Mike.

We use automated software deployment tools and would like to be able to include this and a number of other sun extensions on a 'per machine' basis. However, when using the unopkg (the OOo extension installation commandline tool), the package issues a EULA which has to be accepted, breaking our installation. I am very confused by this, firstly as a EULA is very rare for administrative installations but also as the software is covered under the GPL which is a author license not a user license.
It would be interesting to know if there is any legal reason that Sun have opted for this strategy and also if there is any kind of command-line switch to turn it off (or to except the EULA).
Many thanks for an excellent and valuable extension.
Matt

Hello,

I tried opening a document in Draw and the layout was fine, however the Chinese characters (simplified) were not present. It may have to do with my system fonts, but would there not be a default to fall back on if the font is not included in the PDF?

Any ideas ?

Otherwise this is a great tool!

Thanks

I also get reversed images when scanning in PDFs. I am scanning from a canon copier. I usually use NitroPDF and have never had this problem. Using the newest version of this extension, when i open the pdf, the color is reversed. White paper and black text becomes black paper and white text.
Is there anyway around this?
Is there a forum on this extension? or way to contact the makers?

Thanks.

I have the same problem and an easy solution.
In my case some Logos and other bitmaps in the PDF are inverted.

I just click the inverted objects and apply the "invert filter" form the "graphic filter bar".
That is all.

regards Bernhard

The PDF-extension works fine, but all black/white PDFs we create with paperport or acrobat are shown in negative mode (white paper is shown black, black letters are white. It´s possible to invert every page by hand, but that´s not the solution. Has anybody an idea?

were you able to find a fix for your "negative" problem? I get the same results when scanning in from our canon copier. You mention inverting by hand? what did you mean by this as it may be a temporary workaround for me. I have emailed the general feedback on the extension main page, maybe i will hear back from them.

Thanks

In case it appeared as an aberration, I got the same error on the same distro.

The Windows version works fine. Can't wait for the Linux version to be fixed.

Getting this message during installing: "(com.sun.star.uno.RunTimeException) {{Message ="unsatisfied query for interface of type com.sun.star.script.XLibraryContainer!", Context= (com.sun.star.uno.XInterface) @0}}", so it did not install. Tried different things, even uninstall and reinstall ooo3. What to do next? (windows xp sp3)

Did you find a solution? I have the same problem.

There some DLL files are missing while trying to implement this extention
Following error :
cannot get symbol : component_getImplementationEnviroment- nor:
msvcr90.dll
msvcp90.dll
msvcm90.dll

I am having the same problem

I get this error:
loading component library failed: file:///home/andresmh/.openoffice.org/3/user/uno_packages/cache/uno_packages/6x2KSp_/pdfimport.oxt/pdfimport.uno.so

This is on openoffice.org-core 1:3.0.0-2ubuntu1

You may have already discovered this link: http://ubuntuforums.org/showthread.php?t=889093&page=23
However I think it needs to be clarified that this instruction just sets up the necessary links in Ubuntu, so that the pdfimport extension can be successfully installed.
So before trying to install this extension type this in a terminal:-
sudo ln -s /usr/lib/libstlport_gcc.so.4.6 /usr/lib/libstlport_gcc.so

Then proceed with the install.
Cheers
Pieter

If you installed OO3 from the PPA you can just run this in a terminal to install the extension:

sudo apt-get install openoffice.org-pdfimport

Installing the PDF Import extension from the PPA works like a charm. Just do as fastfret says:

sudo apt-get install openoffice.org-pdfimport

FYI I tried downloading the extension and using the Extension Manager (under the tools menu) to install it and was unsuccessful.

Hi, same here. I was hoping it might have been a permisisons issues. When I tried to import the extension as root I got asked if I wanted to add this extension for all users. After confirming, I got the 'same' message as you have. Maybe it is a packaging issue. Did you get your OO3 from PPA too?

Seems to work perfectly for the basics. In any case, this extension is very helpful in its concept.

off-topic: at least in ubuntu the font of this site renders very small. please help senior and netbook users increasing the size of the font.

What about a version for Mac OS X PPC?

How should we report problems ?

I have a big problem, i can't use becouse it show me that i don't have OpenOffice 3, i seach this but i can't find. ¿anypeople can help me? please

I try to import a file (*) in order to modify it and it succed.
However, the text in the begining of the second page is not well rendered, certainly because it is reverse in the original pdf document.

(*) link to the file (2 pages, 346 ko): http://www.amap-idf.org/images/pratique/plaquette_amap_simple.pdf

Hi, nice initiative ! PDF import tool will be pretty nice but just now is not fully functional.

I created a document with OOWriter 2.x and exported as PDF in [1] (in portuguese only, sorry) and the PDFImport utility output wasn't fine [2].

Anyway I hope that will get better soon. :)

PS: The PDF Importer works fine only I open the file selecting "document type" as "Portable Document Format (pdf)" in open dialog box.

[1] http://www.inf.puc-rio.br/~ajunior/puc/inf3006/proposta.pdf
[2] http://www.inf.puc-rio.br/~ajunior/PDFImporter-OOWriter.png
--
Amadeu A. Barbosa Jr