ePub (e-books)

Formless Content

Posted in CSS & Books, ePub (e-books), Rant on March 12th, 2012 by adam – Comments Off

There is a lot of interesting stuff happening to the page right now. The page is changing in so many ways – time based media is making its way into book pages, reactive content, scrollable space, and a multitude of differing display devices make designing pages pretty hard work these days. How to design for so many possibilities? How to understand so many possibilities?

Craig Mod of flipboard makes a very compelling argument for two forms of page : formless and definite content in an article he wrote for Book: A Futurists Manifesto - the first book to be produced by PressBooks. Craigs argument in a nutshell and in his own words is:

the key difference between Formless and Definite Content is the interaction between the content and the page. Formless Content doesn’t see the page or its boundaries; Definite Content is not only aware of the page, but embraces it. It edits, shifts, and resizes itself to fit the page [...] Put very simply, Formless Content is unaware of the container. Definite Content embraces the container as a canvas.

Craig argues that most book content we know is formless – the text can reflow into other containers without effecting the meaning.  Its a really well argued position and one that is in tension to the current design methodologies of book designers today. Book designers are taught to design contained space – books are a very definite context in which they work. Desktop Publishing Applications are built to meet this methodology. Pixel perfect manipulation within a strictly contained space. If the designed digital article does not exactly match the printed artefact then something went wrong. A lot of energy has gone into this process.

Formless design principles are uneasy to consider for traditional book designers – how can you design for a page that does not yet know its container? It is literally like asking a book designer to design a book without telling them the page dimensions.

As it happens web designers have been thinking about page design too. For a long time now web designers have made pages that embrace differing containers – they have been working, at least in part, with formless content.

What is missing however are good tools for taking the web designers aptitude for working with formless content to enable them to produce books. A good tool set for designing formless books should not work with a constrained page dimensions. It is tempting, for example, to think of working with a design environment with constrained page-like artefacts - think of Google Docs as an example. Could something like Google Docs with its digital, scrollable, yet fixed page size be a good starting point for some kind of design tool? Place layout and typographical controls on top of Google Docs and do we have the next book design environment?

I don’t think so. I don’t think so because it is exactly the kind of idea that is blinded by the media of the past and cannot accept that things have changed. We must design tools that enable book design for formless content. What those tools look like is a very interesting question and one which Aleksandar Erkalovic (Booktype lead dev) and I have been working on with students (Hannes Bernard and Aiwen Yin) from the Sandberg Institute in Amsterdam.

Our argument is that the design of formless content is really a partially constrained environment since elements within the page have some kind of relationship to each other. This is an argument web designers are familiar with when using design tools like position:relative – a rule which sets a relative position relationships between objects. Relationships can be constrained or shaped by rules which will be at least partially preserved when displayed in different contexts. The meaning is preserved by the relationship between the elements more than by their relationship to the constraints of a page.

This is the reasoning behind Cascading Style Sheets – the design language of the web. It is rule based design and even partly conditional. It is possible to express conditions in CSS even though it is not done that often. A CSS rule such as :

h2+p {page-break-inside:avoid;}

is a conditional CSS rule which will apply the style only when a paragraph follows heading 2 (h2) element.

Web designers know this kind of thinking but book designers are going to have to let go of pixel perfect design and enjoy thinking and designing this way. It seems like a simple idea but it takes a lot to overcome legacy. The legacy is so strong that designers are pretending the issue does not exist. There are tools now appearing and sold as design environments for iPad books. They give near 1:1 page relationship between design environment and the final result. However we all know what happens to digital hardware – it changes. What is true now will not be true 5 years from now so the idea that an ebook is a contained space is very appealing to traditional book designers but it will be a short lived myth. iPads might keep the same form for 5 years, they might not but they certainly will not keep it over the next 5-10 years. Better to learn how to design in the new way than be fooled into thinking you can bring all the old methods to a new medium and get away with it for long.

What we are working on now is a way to meet the designers half way – a visual design environment which is used for rule and condition based design. Can book designers accept a tool like like this? Will web designers just step in and take the role of book designers? Its an interesting question and one we hope to have some more experience with soon.

Stupid iBooks Author

Posted in ePub (e-books) on March 5th, 2012 by adam – Comments Off

I feel genuinely sorry for people that have authored content in iBooks author. Its not because I am feeling defensive since I am involved in the development of a book production platform. I just feel sorry for them because they are part of Apples lock in strategy and it has some very significant impacts on their content.

I was reading a slashdot thread about Booktype and enjoying the conversation. Some very interesting and informed takes on the whole book world (and some not so informed rants!). One of the very last comments was by someone that made a book with the iBook Author. Naturally curious I downloaded the book to have a look. Knowing that iBooks Author makes epubs I was surprised to see the ‘.ibook’ suffix to the file instead of ‘.epub’. So first I tried opening this with calibre. The book loaded into Calibre ok but when displayed all I got was a list of the folder contents. Epubs are archived zip files, when you open them with an ‘archive manager’ you just see a folder structure. Since Apple uses the wrong suffix any other application that opens it will only see that the file is a zip archive and open it like it would any other zip file and display the folders and files inside.

Apple .ibook viewed in calibre

It kind of kills me that this happens. Its not Cailbres issue its the non-standard way of Apple to screw everything up for everyone. Why not just call the file an epub?

So I changed the name of the file so that it ended with the ‘.epub’ suffix and I tried again. Of course now it opened. However the pages were scrambled.  Why? Because Apple uses non-standard CSS controls to layout the books. For a good article and insight into this strategy read Baldur Bjarnasons post about this.

Its really a pity. The author spent a lot of time on this book it seems and I would really like to enjoy their work. So even though this looks like it could be a very nice book I can’t read it. It really can only be read by the iPad. If thats ok by you then sure – but its probably a good idea to ask yourself why. Would you accept these conditions if they were true for books before the iPad came along? Imagine if all paper books made with an Apple desktop publisher application looked scrambled unless viewed with special Apple glasses? Well maybe for an art project it might sound kinda cool but for a normal book? Sound ridiculous? Thats where you are now if you make content with iBook Author.

Finding Free Books Online (and importing them)

Posted in ePub (e-books), Tutorials on February 9th, 2011 by adam – Comments Off

So, the web has a lot of resources for finding free books. First port of call is of course Project Gutenberg and the Internet Archive. However if you like picking through the debris of the web looking for some buried gem you might wish to try another tactic – a filetype search.

Yahoo and Google both support this type of search and it is very useful for finding free books online. For example if you would like to find some free Shakespeare EPUBS (an ebook format that is compatible with almost all devices – ipad, Kindle, Sony e-reader etc) then enter this into either search engine:

shakespeare filetype:epub

This will give you some links either directly to the EPUB online or to torrents (a method for downloading content quickly from the web).

You can also do the same search but for PDF:

shakespeare filetype:pdf

You can find a lot of good content this way. All downloads are free of course but that does not mean that the content is available under a free license. If the content is under a free license or is in the public domain (as with the case with Shakespeare) then you can also import the book into Booki.

Booki can import any EPUB online. To do this register with Booki and then follow these steps:

  1. go to the ‘My Books’ page (linked from the menu on the left)
  2. choose ‘Import Epub’ from the drop down menu in the Import section of the page
  3. Enter in the URL of the EPUB and press ‘Import’.

The book will be downloaded and copied to Booki.

For example I entered this EPUB (I found it using the Yahoo filetype search) on the complete works of Shakespeare (http://pd.contentreserve.com/gutenberg/0/100-TheCompleteWorksOfWilliamShakespeare.epub) into the Booki importer. You can see it here:

http://www.booki.cc/the-complete-works-of-william-shakespeare

and edit it here:

http://www.booki.cc/the-complete-works-of-william-shakespeare/edit

Why would you do this? Well, there are many reasons. Consider for example you wish to make text books for your class. You can import the EPUB into Booki, format, and then make a print-ready PDF – take it to a printer or bind it yourself. Sound like fantasy? Well I might have thought so too until last week when a group of students told me they were using using Booki for just this in France. They have a textbook produced by FLOSS Manuals that is free content but they only have one copy. So they are using Booki to produce more copies since their school has cheap binding equipment.

Or you could just wish to export the content to another format. Booki supports export to PDF (screen and book formatted), ODT (for Open Office etc), EPUB and templated HTML. The last option being interesting if you want to put the content to be read online via a browser.

Another reason to import EPUB is to correct them or format them better for others. This is the project we are working on with Archive.org – we are working on a workflow so that EPUB are imported from Archive.org into Booki for proofing.

Another reason for importing EPUB is so that you can translate or improve the book…Improve Shakespeare? I hear you say…well yes. Ever heard of Pride and Prejudice and Zombies? Check it out its an interesting example of an expired copyright work (Pride and Prejudice by Jane Austin) being adapted and ‘improved’ – well at least re-interpreted.

So, check out the filetype search – it can deliver some excellent material if you are in the hunt for electronic books. If you want to work/mess with the material consider importing them into Booki!

Experimental Wikibooks Importing

Posted in Booktype, ePub (e-books) on September 2nd, 2010 by adam – 1 Comment

Jan Gerber, a friend of Booki and a great programmer (author of FFMPEG2Theora and other essential tools) wrote a extension to Booki to import books from the Wikimedia Foundation project Wikibooks. There is information about how to import Wikibooks into Booki in the Booki User Guide.

Wikibooks has been around for a long time and has been compiling collections of free books. We thought it would be interesting to see if its possible to automate the import of these books into Booki. Doing this would open up the export possibilities for the content in Wikibooks. it would be possible, for example, for wikibooks to be exported into ePub, Book formatted PDF, ODT etc through Booki.

Jan developed the code to do this and it works pretty well. We have a few issues to work out with it and some interesting ideas on how to extend this. Mike Linksvlayer suggested:

“as I’m writing this it occurs to me that it would be fairly simple to create a supplement for the book mostly or even entirely consisting of a collection of relevant Wikipedia articles — see examples of such books created using PediaPress; another approach would be to add a feature to Booki (the software used to create Collaborative Futures) to facilitate importing chapters from Wikipedia.”
http://gondwanaland.com/mlog/2010/08/26/collaborative-futures-org/

Good idea! We liked it so much we added it to our development to-dos. If you are interested in hacking away at this feature, perhaps using the work that Jan has already done to get you started then please let us know!

Importing Archive.org Books with Booki

Posted in Booktype, ePub (e-books) on August 20th, 2010 by adam – Comments Off

For some months Booki has been able to import Archive.org books. This development was sponsored by Archive.org. When importing a book Booki requests and ePub from Archive.org, converts this to the ‘native file format’ (booki-zip) and loads this into the Booki database. It is then possible to export the same book back into an ePub file.

So, if Booki can import an Archive.org ePub and then export it as ePub what is the point? Seems like Booki is an unnecessary conduit. Well, one point is that with Booki you can export the book into multiple formats – such as Book formatted PDF. That means you can take any of those luscious out-of-copyright books, import them into Booki and make real books from them. This is pretty exciting when you see just how lovely some of these books are, take for example the copy of Cinderella in the American Libraries section of Archive.org.

Cinderella

Cinderella, Edward Dalziel, 1865

This version of Cinderella is out-of-copyright and you can republish as you like. This is a pretty exciting prospect, opening the door for anyone to start their own publishing house importing content from Booki, styling, and exporting to print formatted PDF for printing.

However, there are a few steps that you may need to go through first, and this is the real reason why we have implemented importing from Archive.org. All the books in the Archive.org libraries have been created using OCR (Optical Character Recognition) scanning. The process involves loading books onto book scanners and scanning each page.

Archive.org Book Scanner

Archive.org Book Scanner.

However scanning creates a certain amount of errors. OCR doesn’t render all text correctly and cannot tell the difference between text on a page and text in an image. Hence images with embedded text are usually split up with the text elements saved as plain text and the surrounding image saved as multiple smaller images. So the OCR scanned books need proofing and the import feature in Booki enables proofing OCR scanned books from Archive.org. This means that teams can get together remotely, choose a selection of Archive.org books, and get to work improving them.

While this is all working we want to build a tighter workflow and a few extra tools to assist the proofing process (if you are a developer familiar with Python and interested in helping us with this good cause then let us know). Douglas Bagnall (Booki/Objavi developer) recently extended the import functionality so that all the meta data imported from Archive.org is preserved. This opens the door to utilising this information to assist proofing of the content – we hope, for example, to eventually be able to show the complete digital image of the original scan, before it was reduced to OCR, alongside the OCR pages to assist proofing. Watch this space!

Incidentally, Booki can import any ePub, so this means that the way is open for the same proofing process to be applied to other OCR scanning projects. If you have a project like this then let us know, maybe we can help.

Booki, OLPC and OER

Posted in Booktype, ePub (e-books) on August 19th, 2010 by adam – Comments Off

You may be familiar with the One Laptop Per Child (OLPC) project. Its pretty well known and aims to provide free laptops to children all over the world that otherwise could not afford them.

The OLPC is a also pretty good ebook reader as well as demonstrated here:

eBook on the OLPC

eBook on the OLPC

The above image is taken from Reading and Sugar – an excellent manual by James Simmons about working with ebooks on the OLPC. The image shows a book taken from Archive.org and imported into Booki – Booki then exported this to an ePub and this was opened on the OLPC as shown.

In the same manual James talks about using Booki on the OLPC to author ebooks. To quote James:

“Booki is one of the best tools available for Sugar users to create e-books.  It can be used on the XO or from Sugar on a Stick.  It supports many authors collaborating on a single book.  It supports translating books into many languages.  It can create PDFs and EPUBs.  It can create books formatted for print-on-demand services.  It can create documents in Open Office ODT format (which Open Office can convert to MS Word format).  It can even be used to download, proofread, and correct EPUBs created by the Internet Archive.

Booki is an excellent option for teachers preparing textbooks, but it can be used by students for their own projects too.”

Below is an image from the same manual showing Booki being used in the Browse activity (the OLPC browser).

Booki on the OLPC

Booki on the OLPC

We are hoping the good work James has been doing will help raise the awareness of Booki as a platform for book authoring on the OLPC which would open up the world of publishing considerably and (we hope) open up exciting possibilities for OER (Open Educational Resources)…

Objavi ePub (‘e-books’) on the iPad

Posted in ePub (e-books) on July 31st, 2010 by micz – Comments Off

This has been posted by Joe Brockmeier on the FLOSS Manuals mailing list, showing some screens on the Reading and Sugar book on iPad. The ePubs were generated using Objavi 2 (the Booki publishing engine).