Commit 9455de21 authored by Guillaume Lazzara's avatar Guillaume Lazzara
Browse files

io/xml/save_text_lines.hh: New. Add partial support for PageContent XML format.

parent 03f61d76
2010-02-19 Guillaume Lazzara <z@lrde.epita.fr>
* io/xml/save_text_lines.hh: New. Add partial support for
PageContent XML format.
2010-03-11 Guillaume Lazzara <z@lrde.epita.fr>
Introduce new Scribo core classes and start using them.
......@@ -48,58 +53,58 @@
Add anchor support in debug routines.
* scribo/debug/alignment_decision_image.hh,
* scribo/debug/links_decision_image.hh,
* scribo/debug/save_linked_bboxes_image.hh,
* scribo/draw/bounding_box_links.hh: Make use of anchor points to
* debug/alignment_decision_image.hh,
* debug/links_decision_image.hh,
* debug/save_linked_bboxes_image.hh,
* draw/bounding_box_links.hh: Make use of anchor points to
draw debug outputs.
2010-02-19 Guillaume Lazzara <z@lrde.epita.fr>
Add new link filters.
* scribo/filter/object_links_non_aligned_simple.hh: Handle new
* filter/object_links_non_aligned_simple.hh: Handle new
cases.
* scribo/filter/object_links_left_aligned.hh,
* scribo/filter/object_links_right_aligned.hh: New filters.
* filter/object_links_left_aligned.hh,
* filter/object_links_right_aligned.hh: New filters.
2010-02-19 Guillaume Lazzara <z@lrde.epita.fr>
Improve object linking backend.
* scribo/primitive/internal/find_left_link.hh,
* scribo/primitive/internal/find_right_link.hh,
* scribo/primitive/internal/is_invalid_link.hh: Remove.
* primitive/internal/find_left_link.hh,
* primitive/internal/find_right_link.hh,
* primitive/internal/is_invalid_link.hh: Remove.
* scribo/primitive/link/internal/compute_anchor.hh,
* scribo/primitive/link/internal/link_ms_dmax_base.hh,
* scribo/primitive/link/internal/link_ms_dmax_ratio_base.hh,
* scribo/primitive/link/internal/link_single_dmax_base.hh,
* scribo/primitive/link/internal/link_single_dmax_ratio_base.hh,
* scribo/primitive/link/with_single_down_link.hh,
* scribo/primitive/link/with_single_left_link.hh,
* scribo/primitive/link/with_single_left_link_dmax_ratio.hh,
* scribo/primitive/link/with_single_right_link.hh,
* scribo/primitive/link/with_single_right_link_dmax_ratio.hh,
* scribo/primitive/link/with_single_up_link.hh: Introduce the
* primitive/link/internal/compute_anchor.hh,
* primitive/link/internal/link_ms_dmax_base.hh,
* primitive/link/internal/link_ms_dmax_ratio_base.hh,
* primitive/link/internal/link_single_dmax_base.hh,
* primitive/link/internal/link_single_dmax_ratio_base.hh,
* primitive/link/with_single_down_link.hh,
* primitive/link/with_single_left_link.hh,
* primitive/link/with_single_left_link_dmax_ratio.hh,
* primitive/link/with_single_right_link.hh,
* primitive/link/with_single_right_link_dmax_ratio.hh,
* primitive/link/with_single_up_link.hh: Introduce the
anchor concept and make use of it.
2010-02-19 Guillaume Lazzara <z@lrde.epita.fr>
* scribo/filter/objects_with_holes.hh: New component filter.
* filter/objects_with_holes.hh: New component filter.
2010-02-19 Guillaume Lazzara <z@lrde.epita.fr>
* scribo/draw/bounding_boxes.hh: Do not draw box centers anymore.
* draw/bounding_boxes.hh: Do not draw box centers anymore.
2010-02-19 Guillaume Lazzara <z@lrde.epita.fr>
Add dedicated routines for AFP's use case.
* scribo/src/afp/components.hh,
* scribo/src/afp/link.hh,
* scribo/src/afp/regroup.hh: New.
* src/afp/components.hh,
* src/afp/link.hh,
* src/afp/regroup.hh: New.
2010-02-19 Guillaume Lazzara <z@lrde.epita.fr>
......
// Copyright (C) 2010 EPITA Research and Development Laboratory (LRDE)
//
// This file is part of Olena.
//
// Olena is free software: you can redistribute it and/or modify it under
// the terms of the GNU General Public License as published by the Free
// Software Foundation, version 2 of the License.
//
// Olena is distributed in the hope that it will be useful,
// but WITHOUT ANY WARRANTY; without even the implied warranty of
// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
// General Public License for more details.
//
// You should have received a copy of the GNU General Public License
// along with Olena. If not, see <http://www.gnu.org/licenses/>.
//
// As a special exception, you may use this file as part of a free
// software project without restriction. Specifically, if other files
// instantiate templates or use macros or inline functions from this
// file, or you compile this file and link it with other files to produce
// an executable, this file does not by itself cause the resulting
// executable to be covered by the GNU General Public License. This
// exception does not however invalidate any other reasons why the
// executable file might be covered by the GNU General Public License.
#ifndef SCRIBO_IO_XML_SAVE_TEXT_LINES_HH
# define SCRIBO_IO_XML_SAVE_TEXT_LINES_HH
/// \file
///
/// \brief Save text line information as XML.
#include <fstream>
#include <sstream>
namespace scribo
{
namespace io
{
namespace xml
{
/*! \brief Save text line information as XML.
We use a XML Schema part of the PAGE (Page Analysis and Ground
truth Elements) image representation framework.
This schema was used in the Page Segmentation COMPetition
(PSCOMP) for ICDAR 2009.
Its XSD file is located here:
http://schema.primaresearch.org/PAGE/gts/pagecontent/2009-03-16/pagecontent.xsd
*/
template <typename L>
void
save_text_lines(const std::string& input_name,
const line_set<L>& lines,
const std::string& output_name);
# ifndef MLN_INCLUDE_ONLY
template <typename L>
void
save_text_lines(const std::string& input_name,
const line_set<L>& lines,
const std::string& output_name)
{
trace::entering("scribo::io::xml:save_text_lines");
std::ofstream file(output_name.c_str());
if (! file)
{
std::cerr << "error: cannot open file '" << input_name << "'!";
abort();
}
file << "<?xml version=\"1.0\"?>" << std::endl;
file << "<pcGts xmlns=\"http://schema.primaresearch.org/PAGE/gts/pagecontent/2009-03-16\" xmlns:xsi=\"http://www.w3.org/2001/XMLSchema-instance\" xsi:schemaLocation=\"http://schema.primaresearch.org/PAGE/gts/pagecontent/2009-03-16 http://schema.primaresearch.org/PAGE/gts/pagecontent/2009-03-16/pagecontent.xsd\" pcGtsId=\"" << input_name << "\">" << std::endl;
file << " <pcMetadata>" << std::endl;
file << " <pcCreator>LRDE</pcCreator>" << std::endl;
file << " <pcCreated/>" << std::endl;
file << " <pcLastChange/>" << std::endl;
file << " <pcComments/>" << std::endl;
file << " </pcMetadata>" << std::endl;
file << " <page image_filename=\"" << input_name
<< "\" image_width=\"" << lines.component_set_().labeled_image().ncols()
<< "\" image_height=\"" << lines.component_set_().labeled_image().nrows()
<< "\">" << std::endl;
for_all_lines(l, lines)
{
file << " <text_region id=\"" << lines(l).id()
<< "\" txt_orientation=\"0.000\" "
<< "txt_reading_orientation=\"0.000\" "
<< "txt_reading_direction=\"Left_To_Right\" "
<< "txt_reverse_video=\"No\" "
<< "txt_indented=\"No\">"
<< std::endl;
file << " <coords>" << std::endl
<< " <point x=\"" << lines(l).bbox().pmin().row()
<< "\" y=\"" << lines(l).bbox().pmin().col() << "\"/>"
<< std::endl
<< " <point x=\"" << lines(l).bbox().pmin().row()
<< "\" y=\"" << lines(l).bbox().pmax().col() << "\"/>"
<< std::endl
<< " <point x=\"" << lines(l).bbox().pmax().row()
<< "\" y=\"" << lines(l).bbox().pmin().col() << "\"/>"
<< std::endl
<< " <point x=\"" << lines(l).bbox().pmax().row()
<< "\" y=\"" << lines(l).bbox().pmax().col() << "\"/>"
<< std::endl
<< " </coords>" << std::endl;
}
file << " </text_region>" << std::endl;
file << " </page>" << std::endl;
file << "</pcGts>" << std::endl;
trace::exiting("scribo::io::xml::save_text_lines");
}
# endif // ! MLN_INCLUDE_ONLY
} // end of namespace scribo::io::xml
} // end of namespace scribo::io
} // end of namespace scribo
#endif // ! SCRIBO_IO_XML_SAVE_TEXT_LINES_HH
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment