How to Upload Comics to Kindle Calibre

E-book conversion¶

calibre has a conversion organisation that is designed to be very easy to employ. Normally, yous but add a book to calibre, click convert and calibre will attempt hard to generate output that is as close as possible to the input. Nevertheless, calibre accepts a very large number of input formats, not all of which are as suitable as others for conversion to e-books. In the case of such input formats, or if you simply desire greater control over the conversion system, calibre has a lot of options to fine tune the conversion process. Annotation however that calibre's conversion arrangement is not a substitute for a full diddled e-book editor. To edit e-books, I recommend first converting them to EPUB or AZW3 using calibre and then using the Edit book characteristic to get them into perfect shape. Y'all can and then utilise the edited due east-book as input for conversion into other formats in calibre.

This certificate will refer mainly to the conversion settings as found in the conversion dialog, pictured below. All these settings are also available via command line interface to conversion, documented at ebook-convert. In calibre, y'all can obtain help on whatever individual setting by holding your mouse over it, a tooltip will appear describing the setting.

E-book conversion dialog

Contents

  • Introduction

  • Look & feel

  • Page setup

  • Heuristic processing

  • Search & replace

  • Structure detection

  • Table of Contents

  • Using images equally chapter titles when converting HTML input documents

  • Using tag attributes to supply the text for entries in the Table of Contents

  • How options are set up/saved for conversion

  • Format specific tips

Introduction¶

The showtime thing to understand about the conversion organisation is that it is designed as a pipeline. Schematically, it looks like this:

The conversion pipeline

The input format is showtime converted to XHTML by the appropriate Input plugin. This HTML is and then transformed. In the last stride, the candy XHTML is converted to the specified output format by the appropriate Output plugin. The results of the conversion can vary profoundly, based on the input format. Some formats catechumen much amend than others. A listing of the best source formats for conversion is bachelor here.

The transforms that human action on the XHTML output are where all the work happens. In that location are various transforms, for example, to insert book metadata as a page at the commencement of the book, to detect chapter headings and automatically create a Table of Contents, to proportionally adjust font sizes, et cetera. It is important to remember that all the transforms act on the XHTML output by the Input plugin, not on the input file itself. Then, for case, if you ask calibre to catechumen an RTF file to EPUB, information technology volition first be converted to XHTML internally, the diverse transforms volition be practical to the XHTML then the Output plugin will create the EPUB file, automatically generating all metadata, Tabular array of Contents, et cetera.

Y'all can see this procedure in action by using the debug choice Debug icon. But specify the path to a folder for the debug output. During conversion, calibre will identify the XHTML generated past the various stages of the conversion pipeline in different sub-folders. The 4 sub-folders are:

Stages of the conversion pipeline

Binder

Description

input

This contains the HTML output by the Input plugin. Use this to debug the Input plugin.

parsed

The result of pre-processing and converting to XHTML the output from the Input plugin. Utilise to debug construction detection.

construction

Post structure detection, but before CSS flattening and font size conversion. Utilise to debug font size conversion and CSS transforms.

processed

Simply before the e-volume is passed to the Output plugin. Use to debug the Output plugin.

If you want to edit the input document a little before having calibre convert it, the best thing to do is edit the files in the input sub-binder, then zip it up, and use the Null file every bit the input format for subsequent conversions. To practise this use the Edit meta information dialog to add the Nothing file every bit a format for the volume and then, in the top left corner of the conversion dialog, select ZIP as the input format.

This document will deal mainly with the various transforms that operate on the intermediate XHTML and how to control them. At the end are some tips specific to each input/output format.

Look & feel¶

Contents

  • Fonts

  • Text

  • Layout

  • Styling

  • Transform styles

  • Transform HTML

This group of options controls various aspects of the look and feel of the converted e-book.

Fonts¶

One of the nicest features of the e-reading experience is the ability to hands adjust font sizes to suit private needs and lighting conditions. calibre has sophisticated algorithms to ensure that all the books it outputs have a consistent font sizes, no matter what font sizes are specified in the input document.

The base font size of a document is the nigh common font size in that document, i.e., the size of the bulk of text in that certificate. When you specify a Base of operations font size, calibre automatically rescales all font sizes in the document proportionately, then that the most mutual font size becomes the specified base font size and other font sizes are rescaled appropriately. Past choosing a larger base of operations font size, you can brand the fonts in the document larger and vice versa. When you set the base font size, for best results, yous should also set the font size key.

Normally, calibre will automatically cull a base font size advisable to the output profile you have chosen (see Page setup). However, you can override this here in instance the default is not suitable for yous.

The Font size central selection lets you control how not-base font sizes are rescaled. The font rescaling algorithm works using a font size key, which is simply a comma-separated listing of font sizes. The font size fundamental tells calibre how many "steps" bigger or smaller a given font size should exist compared to the base of operations font size. The idea is that in that location should be a limited number of font sizes in a document. For example, ane size for the body text, a couple of sizes for unlike levels of headings and a couple of sizes for super/sub scripts and footnotes. The font size key allows calibre to compartmentalize the font sizes in the input documents into split up "bins" corresponding to the different logical font sizes.

Permit's illustrate with an example. Suppose the source document nosotros are converting was produced past someone with excellent eyesight and has a base of operations font size of 8pt. That means the bulk of the text in the document is sized at 8pts, while headings are somewhat larger (say x and 12pt) and footnotes somewhat smaller at 6pt. At present if we use the post-obit settings:

                                Base                font                size                :                12                pt                Font                size                key                :                7                ,                8                ,                ten                ,                12                ,                14                ,                16                ,                18                ,                twenty              

The output document will have a base font size of 12pt, headings of 14 and 16pt and footnotes of 8pt. At present suppose we want to make the largest heading size stand out more and make the footnotes a trivial larger equally well. To achieve this, the font primal should be inverse to:

                                New                font                size                key                :                vii                ,                9                ,                12                ,                14                ,                18                ,                xx                ,                22              

The largest headings will now become 18pt, while the footnotes will become 9pt. You can play with these settings to try and figure out what would be optimum for y'all by using the font rescaling wizard, which can be accessed past clicking the little push button side by side to the Font size key setting.

All the font size rescaling in the conversion tin as well be disabled here, if you would like to preserve the font sizes in the input document.

A related setting is Line height. Line acme controls the vertical acme of lines. By default, (a line elevation of 0), no manipulation of line heights is performed. If you specify a non-default value, line heights volition be set up in all locations that don't specify their ain line heights. Nevertheless, this is something of a edgeless weapon and should be used sparingly. If you want to adjust the line heights for some section of the input, it's better to use the Extra CSS.

In this section you tin can also tell calibre to embed any referenced fonts into the book. This will allow the fonts to work on reader devices even if they are non bachelor on the device.

Text¶

Text tin exist either justified or not. Justified text has extra spaces between words to requite a polish correct margin. Some people adopt justified text, others do non. Ordinarily, calibre volition preserve the justification in the original document. If yous want to override it you can employ the Text justification option in this section.

You can too tell calibre to Smarten punctuation which will replace patently quotes, dashes and ellipses with their typographically correct alternatives. Note that this algorithm is non perfect and then it is worth reviewing the results. The opposite, namely, Unsmarted punctuation is as well available.

Finally, at that place is Input character encoding. Older documents sometimes don't specify their grapheme encoding. When converted, this can result in not-English characters or special characters like smart quotes being corrupted. calibre tries to auto-detect the graphic symbol encoding of the source document, merely information technology does not always succeed. You can force it to assume a item character encoding by using this setting. cp1252 is a common encoding for documents produced using Windows software. Yous should also read How do I convert my file containing non-English characters, or smart quotes? for more on encoding bug.

Layout¶

Normally, paragraphs in XHTML are rendered with a blank line between them and no leading text indent. calibre has a couple of options to control this. Remove spacing betwixt paragraphs forcefully ensure that all paragraphs have no inter paragraph spacing. It too sets the text indent to one.5em (tin can be changed) to marker the start of every paragraph. Insert blank line does the opposite, guaranteeing that at that place is exactly one bare line betwixt each pair of paragraphs. Both these options are very comprehensive, removing spacing, or inserting information technology for all paragraphs (technically <p> and <div> tags). This is and then that you lot tin can just gear up the option and exist certain that it performs as advertised, irrespective of how messy the input file is. The one exception is when the input file uses difficult line breaks to implement inter-paragraph spacing.

If you lot want to remove the spacing between all paragraphs, except a select few, don't use these options. Instead add the following CSS lawmaking to Extra CSS:

                                p                ,                div                {                margin                :                0                pt                ;                border                :                0                pt                ;                text                -                indent                :                one.five                em                }                .                spacious                {                margin                -                bottom                :                ane                em                ;                text                -                indent                :                0                pt                ;                }              

Then, in your source document, mark the paragraphs that need spacing with class="spacious". If your input certificate is not in HTML, employ the Debug choice, described in the Introduction to go HTML (use the input sub-folder).

Some other useful options is Linearize tables. Some desperately designed documents apply tables to control the layout of text on the page. When converted these documents often have text that runs off the page and other artifacts. This option will excerpt the content from the tables and present it in a linear manner. Notation that this pick linearizes all tables, so but utilise it if you are sure the input certificate does non utilize tables for legitimate purposes, like presenting tabular information.

Styling¶

                                .                endnote                {                text                -                marshal                :                right                }              

or if you want to modify the indentation of all paragraphs:

Extra CSS is a very powerful option, but you practise need an understanding of how CSS works to use it to its full potential. You lot can use the debug pipeline choice described higher up to run across what CSS is present in your input document.

A simpler option is to employ Filter manner information. This allows you to remove all CSS properties of the specified types from the document. For example, you tin can use it to remove all colors or fonts.

Transform styles¶

This is the about powerful styling related facility. You can apply it to ascertain rules that modify styles based on various conditions. For case you can use it to modify all green colors to blue, or remove all bold styling from the text or color all headings a certain colour, etc.

Transform HTML¶

Similar to transform styles, but allows you to make changes to the HTML content of the book. You can supersede one tag with another, add classes or other attributes to tags based on their content, etc.

Page setup¶

The Page setup options are for controlling screen layout, like margins and screen sizes. In that location are options to setup page margins, which will be used by the output plugin, if the selected output format supports folio margins. In improver, you should cull an Input profile and an output profile. Both sets of profiles basically deal with how to translate measurements in the input/output documents, screen sizes and default font rescaling keys.

If you know that the file y'all are converting was intended to exist used on a particular device/software platform, choose the corresponding input profile, otherwise just cull the default input contour. If you know the files y'all are producing are meant for a particular device blazon, choose the corresponding output profile. Otherwise, choose one of the Generic output profiles. If you are converting to MOBI or AZW3 then you will almost always want to cull i of the Kindle output profiles. Otherwise, your best bet for modern E-book reading devices is to choose the Generic e-ink Hard disk output profile.

The output profile also controls the screen size. This volition crusade, for example, images to be machine-resized to exist fit to the screen in some output formats. So choose a profile of a device that has a screen size similar to your device.

Heuristic processing¶

Heuristic processing provides a variety of functions which can be used to try and discover and right mutual problems in poorly formatted input documents. Employ these functions if your input document suffers from poor formatting. Considering these functions rely on common patterns, be aware that in some cases an option may pb to worse results, so use with intendance. As an example, several of these options will remove all non-breaking-infinite entities, or may include false positive matches relating to the office.

Enable heuristic processing

This option activates calibre'southward Heuristic processing stage of the conversion pipeline. This must exist enabled in order for various sub-functions to exist applied

Unwrap lines

Enabling this option will cause calibre to attempt to detect and right difficult line breaks that exist inside a document using punctuation clues and line length. calibre will first attempt to find whether hard line breaks exist, if they do not appear to exist calibre will non attempt to unwrap lines. The line-unwrap factor can be reduced if you want to 'force' calibre to unwrap lines.

Line-unwrap gene

This choice controls the algorithm calibre uses to remove difficult line breaks. For example, if the value of this pick is 0.4, that means calibre will remove hard line breaks from the stop of lines whose lengths are less than the length of xl% of all lines in the document. If your certificate only has a few line breaks which demand correction, then this value should be reduced to somewhere between 0.1 and 0.ii.

Find and markup unformatted chapter headings and sub headings

If your certificate does not have affiliate headings and titles formatted differently from the rest of the text, calibre tin employ this option to attempt to detect them and surround them with heading tags. <h2> tags are used for affiliate headings; <h3> tags are used for any titles that are detected.

This function will non create a TOC, just in many cases it will cause calibre's default chapter detection settings to correctly detect chapters and build a TOC. Adjust the XPath nether Structure detection if a TOC is not automatically created. If there are no other headings used in the certificate then setting "//h:h2" under Construction detection would be the easiest way to create a TOC for the document.

The inserted headings are not formatted, to utilize formatting utilise the Extra CSS pick under the Await and Feel conversion settings. For instance, to center heading tags, use the following:

                                    h2                  ,                  h3                  {                  text                  -                  marshal                  :                  center                  }                
Renumber sequences of <h1> or <h2> tags

Some publishers format chapter headings using multiple <h1> or <h2> tags sequentially. calibre's default conversion settings will cause such titles to be split into two pieces. This option will re-number the heading tags to prevent splitting.

Delete blank lines between paragraphs

This option will cause calibre to clarify blank lines included within the document. If every paragraph is interleaved with a blank line, then calibre will remove all those blank paragraphs. Sequences of multiple blank lines volition be considered scene breaks and retained as a single paragraph. This option differs from the Remove paragraph spacing choice under Await and Feel in that it actually modifies the HTML content, while the other option modifies the certificate styles. This option can also remove paragraphs which were inserted using calibre's Insert bare line pick.

Ensure scene breaks are consistently formatted

With this choice calibre will attempt to observe common scene-pause markers and ensure that they are center aligned. 'Soft' scene suspension markers, i.east. scene breaks only defined by extra white space, are styled to ensure that they volition not be displayed in conjunction with page breaks.

Replace scene breaks

If this pick is configured then calibre will replace scene intermission markers it finds with the replacement text specified past the user. Please note that some ornamental characters may not be supported across all reading devices.

In general you should avoid using HTML tags, calibre will discard any tags and utilise pre-defined markup. <hr /> tags, i.eastward. horizontal rules, and <img> tags are exceptions. Horizontal rules can optionally be specified with styles, if you choose to add your ain style exist sure to include the 'width' setting, otherwise the manner information will be discarded. Epitome tags tin used, but calibre does not provide the ability to add the image during conversion, this must be done afterwards the fact using the 'Edit book' feature.

Example prototype tag (place the image inside an 'Images' folder within the EPUB later on conversion):

<img style="width:ten%" src="../Images/scenebreak.png" />

Instance horizontal rule with styles:

<hr way="width:20%;padding-top: 1px;border-top: 2px ridge black;border-bottom: 2px groove blackness;"/>

Remove unnecessary hyphens

calibre volition analyze all hyphenated content in the document when this selection is enabled. The document itself is used as a dictionary for analysis. This allows calibre to accurately remove hyphens for any words in the document in any language, along with made-upwardly and obscure scientific words. The primary drawback is words appearing but a single time in the document will non be changed. Analysis happens in two passes, the first pass analyzes line endings. Lines are only unwrapped if the word exists with or without a hyphen in the document. The 2nd pass analyzes all hyphenated words throughout the document, hyphens are removed if the give-and-take exists elsewhere in the document without a lucifer.

Italicize common words and patterns

When enabled, calibre will expect for common words and patterns that denote italics and italicize them. Examples are mutual text conventions such as ~word~ or phrases that should generally exist italicized, e.g. latin phrases like 'etc.' or 'et cetera'.

Replace entity indents with CSS indents

Some documents use a convention of defining text indents using non-breaking space entities. When this option is enabled calibre will endeavor to detect this sort of formatting and catechumen them to a 3% text indent using CSS.

Search & replace¶

These options are useful primarily for conversion of PDF documents or OCR conversions, though they can likewise be used to fix many document specific problems. As an instance, some conversions can leaves backside folio headers and footers in the text. These options use regular expressions to endeavor and detect headers, footers, or other arbitrary text and remove or supervene upon them. Remember that they operate on the intermediate XHTML produced by the conversion pipeline. There is a sorcerer to assist you customize the regular expressions for your certificate. Click the magic wand beside the expression box, and click the 'Test' push after composing your search expression. Successful matches volition be highlighted in Yellow.

The search works by using a Python regular expression. All matched text is simply removed from the document or replaced using the replacement pattern. The replacement pattern is optional, if left blank then text matching the search pattern will exist deleted from the document. You can larn more than virtually regular expressions and their syntax at All almost using regular expressions in calibre.

Structure detection¶

Structure detection involves calibre trying its best to detect structural elements in the input document, when they are non properly specified. For example, chapters, folio breaks, headers, footers, etc. Every bit you can imagine, this process varies widely from book to volume. Fortunately, calibre has very powerful options to control this. With power comes complexity, simply if one time you accept the time to learn the complexity, you will discover information technology well worth the effort.

Capacity and page breaks¶

calibre has two sets of options for chapter detection and inserting page breaks. This can sometimes be slightly confusing, as past default, calibre will insert page breaks earlier detected chapters likewise as the locations detected by the page breaks option. The reason for this is that there are oftentimes location where page breaks should be inserted that are not chapter boundaries. Also, detected chapters can exist optionally inserted into the machine generated Table of Contents.

calibre uses XPath, a powerful language to allow the user to specify chapter boundaries/page breaks. XPath tin seem a little daunting to use at commencement, fortunately, there is a XPath tutorial in the User Transmission. Remember that Structure detection operates on the intermediate XHTML produced by the conversion pipeline. Apply the debug option described in the Introduction to effigy out the advisable settings for your book. There is too a button for a XPath wizard to help with the generation of simple XPath expressions.

By default, calibre uses the following expression for detecting chapters:

                                //*                [((                name                ()                =                'h1'                or                proper noun                ()                =                'h2'                )                and                re                :                examination                (                .                ,                'chapter|book|department|function\due south+'                ,                'i'                ))                or                @class                =                'chapter'                ]              

This expression is rather circuitous, because it tries to handle a number of mutual cases simultaneously. What information technology means is that calibre volition assume chapters kickoff at either <h1> or <h2> tags that have any of the words (affiliate, book, section or role) in them or that take the class="chapter" attribute.

A related option is Affiliate mark, which allows you lot to control what calibre does when it detects a chapter. By default, it will insert a folio break before the affiliate. You tin can have information technology insert a ruled line instead of, or in addition to the page suspension. You lot can also take it do aught.

The default setting for detecting folio breaks is:

                                //*                [                name                ()                =                'h1'                or                name                ()                =                'h2'                ]              

which means that calibre volition insert folio breaks earlier every <h1> and <h2> tag by default.

Annotation

The default expressions may change depending on the input format y'all are converting.

Miscellaneous¶

There are a few more options in this department.

Insert metadata as page at start of book

One of the swell things about calibre is that it allows you lot to maintain very complete metadata about all of your books, for instance, a rating, tags, comments, etc. This pick will create a unmarried page with all this metadata and insert it into the converted east-book, typically simply after the cover. Think of information technology as a fashion to create your own customised book jacket.

Remove first image

Sometimes, the source document you are converting includes the cover equally role of the book, instead of as a carve up embrace. If you likewise specify a cover in calibre, and then the converted book volition have two covers. This option volition simply remove the commencement image from the source document, thereby ensuring that the converted book has only one encompass, the i specified in calibre.

Table of Contents¶

When the input certificate has a Tabular array of Contents in its metadata, calibre will only use that. Nonetheless, a number of older formats either do not support a metadata based Tabular array of Contents, or individual documents practise not have one. In these cases, the options in this section can help you automatically generate a Table of Contents in the converted e-book, based on the bodily content in the input certificate.

Note

Using these options can be a little challenging to become exactly right. If you prefer creating/editing the Table of Contents past hand, convert to the EPUB or AZW3 formats and select the checkbox at the lesser of the Table of Contents section of the conversion dialog that says Manually fine-melody the Tabular array of Contents after conversion. This will launch the ToC Editor tool later on the conversion. It allows y'all to create entries in the Table of Contents past simply clicking the place in the book where you lot want the entry to indicate. You can as well use the ToC Editor past itself, without doing a conversion. Go to Preferences → Interface → Toolbars and add the ToC Editor to the main toolbar. So just select the volume you want to edit and click the ToC Editor button.

The starting time pick is Forcefulness use of auto-generated Table of Contents. Past checking this choice yous can accept calibre override any Table of Contents constitute in the metadata of the input document with the auto generated 1.

The default way that the creation of the automobile generated Table of Contents works is that, calibre will commencement endeavor to add together any detected chapters to the generated table of contents. You can larn how to customize the detection of chapters in the Structure detection department above. If you practice not want to include detected chapters in the generated table of contents, check the Do not add together detected capacity option.

If less than the Chapter threshold number of chapters were detected, calibre will then add any hyperlinks it finds in the input document to the Table of Contents. This often works well: many input documents include a hyperlinked Table of Contents correct at the kickoff. The Number of links option can be used to control this behavior. If set to cypher, no links are added. If fix to a number greater than zero, at most that number of links is added.

calibre will automatically filter duplicates from the generated Table of Contents. Even so, if in that location are some boosted undesirable entries, you tin filter them using the TOC Filter option. This is a regular expression that volition match the championship of entries in the generated table of contents. Whenever a lucifer is institute, information technology will be removed. For example, to remove all entries titles "Next" or "Previous" use:

The Level 1,2,three TOC options permit you to create a sophisticated multi-level Table of Contents. They are XPath expressions that match tags in the intermediate XHTML produced by the conversion pipeline. See the Introduction for how to become access to this XHTML. Also read the XPath tutorial, to learn how to construct XPath expressions. Side by side to each option is a button that launches a magician to help with the creation of basic XPath expressions. The following simple example illustrates how to use these options.

Suppose you have an input certificate that results in XHTML that look similar this:

                            <              html              xmlns              =              "http://world wide web.w3.org/1999/xhtml"              >              <              head              >              <              title              >Sample document</              title              >              </              head              >              <              body              >              <              h1              >Affiliate 1</              h1              >              ...              <              h2              >Section 1.one</              h2              >              ...              <              h2              >Section i.2</              h2              >              ...              <              h1              >Chapter 2</              h1              >              ...              <              h2              >Department 2.1</              h2              >              ...              </              torso              >              </              html              >            

So, we set the options every bit:

                            Level              1              TOC              :              //              h              :              h1              Level              two              TOC              :              //              h              :              h2            

This will issue in an automatically generated 2 level Table of Contents that looks like:

                            Affiliate              one              Section              1.ane              Section              1.two              Affiliate              ii              Section              2.ane            

Warning

Non all output formats support a multi level Table of Contents. Y'all should kickoff try with EPUB output. If that works, then effort your format of selection.

Using images as chapter titles when converting HTML input documents¶

Suppose y'all want to use an image as your affiliate title, merely still desire calibre to be able to automatically generate a Table of Contents for you from the chapter titles. Use the post-obit HTML markup to accomplish this:

                            <              html              >              <              body              >              <              h2              >Chapter 1</              h2              >              <              p              >affiliate i text...</              p              >              <              h2              title              =              "Affiliate 2"              ><              img              src              =              "chapter2.jpg"              /></              h2              >              <              p              >chapter two text...</              p              >              </              body              >              </              html              >            

Set the Level 1 TOC setting to //h:h2 . Then, for affiliate two, calibre will take the title from the value of the title attribute on the <h2> tag, since the tag has no text.

Using tag attributes to supply the text for entries in the Tabular array of Contents¶

If you accept peculiarly long affiliate titles and want shortened versions in the Tabular array of Contents, yous tin utilise the title attribute to reach this, for example:

                            <              html              >              <              body              >              <              h2              title              =              "Affiliate i"              >Affiliate 1: Some very long title</              h2              >              <              p              >chapter 1 text...</              p              >              <              h2              title              =              "Affiliate 2"              >Chapter 2: Some other very long title</              h2              >              <              p              >chapter 2 text...</              p              >              </              trunk              >              </              html              >            

Set the Level 1 TOC setting to //h:h2/@title . And then calibre will accept the title from the value of the title aspect on the <h2> tags, instead of using the text inside the tag. Note the trailing /@title on the XPath expression, you tin can use this class to tell calibre to get the text from any aspect you like.

How options are set/saved for conversion¶

There are ii places where conversion options can be gear up in calibre. The outset is in Preferences->Conversion. These settings are the defaults for the conversion options. Whenever yous try to convert a new book, the settings ready here will be used by default.

Yous can besides change settings in the conversion dialog for each book conversion. When you lot catechumen a book, calibre remembers the settings you used for that book, and then that if you lot catechumen it once again, the saved settings for the individual book will have precedence over the defaults ready in Preferences. You can restore the individual settings to defaults past using the Restore defaults button in the individual book conversion dialog. You can remove the saved settings for a grouping of books by selecting all the books and then clicking the Edit metadata button to bring up the bulk metadata edit dialog, near the bottom of the dialog is an option to remove stored conversion settings.

When you majority convert a set of books, settings are taken in the post-obit order (last one wins):

  • From the defaults ready in Preferences->Conversion

  • From the saved conversion settings for each book being converted (if any). This tin be turned off by the pick in the summit left corner of the Majority conversion dialog.

  • From the settings fix in the Bulk conversion dialog

Notation that the final settings for each book in a Bulk conversion will be saved and re-used if the volume is converted again. Since the highest priority in Bulk Conversion is given to the settings in the Bulk conversion dialog, these will override whatsoever book specific settings. So yous should merely bulk catechumen books together that need like settings. The exceptions are metadata and input format specific settings. Since the Majority conversion dialog does not have settings for these two categories, they will be taken from volume specific settings (if any) or the defaults.

Note

Yous can see the actual settings used during any conversion by clicking the rotating icon in the lower right corner and so double clicking the private conversion task. This will bring upwardly a conversion log that will contain the bodily settings used, near the top.

Format specific tips¶

Here you volition find tips specific to the conversion of particular formats. Options specific to item format, whether input or output are available in the conversion dialog under their own section, for example TXT input or EPUB output.

Catechumen Microsoft Word documents¶

calibre can automatically convert .docx files created by Microsoft Give-and-take 2007 and newer. Just add the file to calibre and click convert.

Note

There is a demo .docx file that demonstrates the capabilities of the calibre conversion engine. Just download it and convert it to EPUB or AZW3 to see what calibre tin do.

calibre will automatically generate a Table of Contents based on headings if you lot marker your headings with the Heading ane , Heading 2 , etc. styles in Microsoft Word. Open the output e-book in the calibre E-book viewer and click the Tabular array of Contents button to view the generated Tabular array of Contents.

Older .medico files¶

For older .doc files, you lot can save the document as HTML with Microsoft Word and and so catechumen the resulting HTML file with calibre. When saving as HTML, be sure to use the "Save as Web Page, Filtered" pick as this will produce make clean HTML that will convert well. Note that Give-and-take produces really messy HTML, converting it tin have a long time, so be patient. If you have a newer version of Word bachelor, you tin directly save it as .docx equally well.

Another alternative is to use the free LibreOffice. Open your .doc file in LibreOffice and salve information technology every bit .docx, which can be straight converted in calibre.

Convert TXT documents¶

TXT documents take no well defined style to specify formatting like assuming, italics, etc, or certificate structure similar paragraphs, headings, sections and then on, but there are a variety of conventions commonly used. By default calibre attempts automatic detection of the correct formatting and markup based on those conventions.

TXT input supports a number of options to differentiate how paragraphs are detected.

Paragraph fashion: Machine

Analyzes the text file and attempts to automatically make up one's mind how paragraphs are defined. This pick volition more often than not work fine, if you achieve undesirable results effort one of the manual options.

Paragraph way: Block

Assumes 1 or more blank lines are a paragraph boundary:

                                                This                        is                        the                        beginning                        .                        This                        is                        the                        second                        paragraph                        .                      
Paragraph fashion: Single

Assumes that every line is a paragraph:

                                                This                        is                        the                        first                        .                        This                        is                        the                        second                        .                        This                        is                        the                        third                        .                      
Paragraph style: Print

Assumes that every paragraph starts with an indent (either a tab or 2+ spaces). Paragraphs end when the next line that starts with an indent is reached:

                                                This                        is                        the                        beginning                        .                        This                        is                        the                        2nd                        .                        This                        is                        the                        third                        .                      
Paragraph style: Unformatted

Assumes that the document has no formatting, but does use hard line breaks. Punctuation and median line length are used to effort to re-create paragraphs.

Formatting manner: Auto

Attempts to detect the type of formatting markup being used. If no markup is used so heuristic formatting will be applied.

Formatting way: Heuristic

Analyzes the document for mutual chapter headings, scene breaks, and italicized words and applies the appropriate HTML markup during conversion.

Formatting style: Markdown

calibre too supports running TXT input though a transformation preprocessor known as Markdown. Markdown allows for basic formatting to exist added to TXT documents, such as bold, italics, section headings, tables, lists, a Table of Contents, etc. Marking chapter headings with a leading # and setting the affiliate XPath detection expression to "//h:h1" is the easiest style to have a proper tabular array of contents generated from a TXT document. You tin learn more than most the Markdown syntax at daringfireball.

Formatting mode: None

Applies no special formatting to the text, the document is converted to HTML with no other changes.

Convert PDF documents¶

PDF documents are one of the worst formats to convert from. They are a stock-still folio size and text placement format. Significant, information technology is very difficult to decide where i paragraph ends and another begins. calibre volition try to unwrap paragraphs using a configurable, Line united nations-wrapping factor. This is a calibration used to determine the length at which a line should exist unwrapped. Valid values are a decimal between 0 and 1. The default is 0.45, just under the median line length. Lower this value to include more than text in the unwrapping. Increase to include less. You tin arrange this value in the conversion settings under PDF Input.

Likewise, they frequently take headers and footers as part of the document that will become included with the text. Use the Search and replace panel to remove headers and footers to mitigate this event. If the headers and footers are non removed from the text it tin can throw off the paragraph unwrapping. To larn how to use the header and footer removal options, read All about using regular expressions in calibre.

Some limitations of PDF input are:

  • Complex, multi-column, and image based documents are not supported.

  • Extraction of vector images and tables from within the document is also non supported.

  • Some PDFs use special glyphs to represent ll or ff or fi, etc. Conversion of these may or may non work depending on just how they are represented internally in the PDF.

  • Links and Tables of Contents are not supported

  • PDFs that use embedded non-Unicode fonts to represent not-English characters will result in garbled output for those characters

  • Some PDFs are made up of photographs of the page with OCRed text backside them. In such cases calibre uses the OCRed text, which can be very different from what you encounter when you view the PDF file

  • PDFs that are used to display complex text, like right to left languages and math typesetting will non catechumen correctly

To re-iterate PDF is a really, really bad format to use as input. If you absolutely must utilize PDF, and then exist prepared for an output ranging anywhere from decent to unusable, depending on the input PDF.

Comic book collections¶

A comic book collection is a .cbc file. A .cbc file is a ZIP file that contains other CBZ/CBR files. In addition the .cbc file must contain a elementary text file called comics.txt, encoded in UTF-8. The comics.txt file must contain a list of the comics files inside the .cbc file, in the class filename:title, as shown below:

                                1                .                cbz                :                Chapter                I                two                .                cbz                :                Chapter                Two                three                .                cbz                :                Affiliate                Three              

The .cbc file volition then comprise:

                                comics                .                txt                one                .                cbz                2                .                cbz                three                .                cbz              

calibre will automatically convert this .cbc file into a e-volume with a Table of Contents pointing to each entry in comics.txt.

EPUB advanced formatting demo¶

Various advanced formatting for EPUB files is demonstrated in this demo file. The file was created from hand coded HTML using calibre and is meant to exist used as a template for your own EPUB creation efforts.

The source HTML information technology was created from is available demo.null. The settings used to create the EPUB from the ZIP file are:

                                ebook                -                catechumen                demo                .                zip                .                epub                -                vv                --                authors                "Kovid Goyal"                --                language                en                --                level1                -                toc                '//*[@course="title"]'                --                disable                -                font                -                rescaling                --                page                -                breaks                -                earlier                /                --                no                -                default                -                epub                -                comprehend              

Annotation that because this file explores the potential of EPUB, nigh of the avant-garde formatting is not going to work on readers less capable than calibre'due south built-in EPUB viewer.

Convert ODT documents¶

calibre can direct convert ODT (OpenDocument Text) files. You should use styles to format your certificate and minimize the use of straight formatting. When inserting images into your certificate you need to anchor them to the paragraph, images anchored to a page will all end upward in the front of the conversion.

To enable automatic detection of chapters, you demand to mark them with the built-in styles called Heading ane, Heading 2, …, Heading half-dozen (Heading 1 equates to the HTML tag <h1> , Heading 2 to <h2> , etc). When y'all catechumen in calibre you can enter which style you lot used into the Detect chapters at box. Example:

  • If you lot mark Chapters with style Heading ii, y'all have to fix the 'Discover chapters at' box to //h:h2

  • For a nested TOC with Sections marked with Heading ii and the Capacity marked with Heading 3 you need to enter //h:h2|//h:h3 . On the Convert - TOC page set the Level 1 TOC box to //h:h2 and the Level 2 TOC box to //h:h3 .

Well-known document properties (Championship, Keywords, Description, Creator) are recognized and calibre will use the first image (non to small, and with adept aspect-ratio) as the encompass image.

In that location is also an advanced property conversion style, which is activated past setting the custom property opf.metadata ('Yes or No' blazon) to Yes in your ODT document (File->Backdrop->Custom Backdrop). If this property is detected by calibre, the following custom properties are recognized ( opf.authors overrides document creator):

                                opf                .                titlesort                opf                .                authors                opf                .                authorsort                opf                .                publisher                opf                .                pubdate                opf                .                isbn                opf                .                linguistic communication                opf                .                series                opf                .                seriesindex              

In addition to this, you can specify the motion picture to apply as the comprehend past naming it opf.comprehend (right click, Picture->Options->Proper name) in the ODT. If no picture with this name is found, the 'smart' method is used. As the cover detection might consequence in double covers in certain output formats, the process will remove the paragraph (simply if the only content is the cover!) from the certificate. Merely this works just with the named picture!

To disable cover detection you can set the custom property opf.nocover ('Yep or No' type) to Yeah in advanced fashion.

Converting to PDF¶

The first, most important, setting to decide on when converting to PDF is the folio size. By default, calibre uses a page size of "U.S. Letter of the alphabet". You can change this to another standard page size or a completely custom size in the PDF Output department of the conversion dialog. If you are generating a PDF to exist used on a specific device, you can plow on the option to use the folio size from the output profile instead. So if your output profile is set to Kindle, calibre will create a PDF with page size suitable for viewing on the small-scale Kindle screen.

Printable Table of Contents¶

You can as well insert a printable Table of Contents at the end of the PDF that lists the page numbers for every section. This is very useful if you intend to print out the PDF to paper. If you wish to employ the PDF on an electronic device, then the PDF Outline provides this functionality and is generated by default.

You can customize the look of the generated Tabular array of contents by using the Extra CSS conversion setting under the Await & feel function of the conversion dialog. The default CSS used is listed below, simply copy it and make whatever changes yous like.

                                    .                  calibre-pdf-toc                                                      tabular array                                                      {                                                      width                  :                                                      100                  %%                                                      }                                    .                  calibre-pdf-toc                                                      table                                                      tr                                                      td                  :                  final-of-type                                                      {                                                      text-align                  :                                                      right                                                      }                                    .                  calibre-pdf-toc                                                      .                  level-0                                                      {                                                                        font-size                  :                                                      larger                  ;                                    }                                    .                  calibre-pdf-toc                                                      .                  level-1                                                      td                  :                  starting time-of-blazon                                                      {                                                      padding-left                  :                                                      1.4                  em                                                      }                                    .                  calibre-pdf-toc                                                      .                  level-two                                                      td                  :                  first-of-type                                                      {                                                      padding-left                  :                                                      2.viii                  em                                                      }                                  

Custom page margins for individual HTML files¶

If you are converting an EPUB or AZW3 file with multiple individual HTML files inside it and you want to change the folio margins for a particular HTML file yous can add the following fashion block to the HTML file using the calibre Eastward-book editor:

                                    <                  style                  >                                    @                  page                                                      {                                                                        margin-left                  :                                                      10pt                  ;                                                                        margin-correct                  :                                                      10pt                  ;                                                                        margin-elevation                  :                                                      10pt                  ;                                                                        margin-bottom                  :                                                      10pt                  ;                                    }                                    </                  fashion                  >                

Then, in the PDF output section of the conversion dialog, turn on the choice to Use page margins from the document being converted. Now all pages generated from this HTML file will take 10pt margins.

mccormickphrehing.blogspot.com

Source: https://manual.calibre-ebook.com/conversion.html

0 Response to "How to Upload Comics to Kindle Calibre"

Post a Comment

Iklan Atas Artikel

Iklan Tengah Artikel 1

Iklan Tengah Artikel 2

Iklan Bawah Artikel