General :: Convert A Windows Like Text File Into *nix Like Utf-8 Encoding Automately?
Mar 23, 2011
I want to convert many text files(copied from windows workstation) into utf-8 encoding file. Yes, iconv is available for it. However, I have to give source file encoding at the command line parameters! The problem is, at most case, I am not sure the source encoding of it. And, I also want to use a script to convert many files recursively.
View 2 Replies
ADVERTISEMENT
Jun 24, 2011
There are many plain text files which were encoded in variant charsets.
I want to convert them all to UTF-8, but before running iconv, I need to know its original encoding. Most browsers have an Auto Detect option in encodings, however, I can't check those text files one by one because there are too many.
Only having known the original encoding, I then can convert the texts by iconv -f DETECTED_CHARSET -t utf-8.
Is there any utility to detect the encoding of plain text files? It DOES NOT have to be 100% perfect, I don't mind if there're 100 files misconverted in 1,000,000 files.
View 3 Replies
View Related
Jan 24, 2010
What is a good way to do it? I have been trying to do it using sed.
View 9 Replies
View Related
Aug 23, 2011
I need to be able to convert HTML email messages saved as text files (.eml or .msg) to PDF documents, one PDF per email, retaining formatting and images.
Are there any Linux tools that will allow me to do this from the command line (so it can be scripted)?
View 1 Replies
View Related
Dec 27, 2010
I have an pdf file on my linux RHEL 4.7 machine. I can open that file but when i click on 'saveas' to save the file in 'Text' format there are no options i see there. I need to save the 'pdf' file to 'text' format. could anyone tell me how to save the pdf file to Text format. Iam using 'KDE'
View 6 Replies
View Related
Mar 8, 2010
i'm trying to convert a html file into a text file when i simply run "html2text <filename>" the output displayed is the way we want but when i redirect the same using "-o" or ">>" the file is having extra characters in it. i even tried -ascii,but no much use.
View 4 Replies
View Related
Mar 3, 2011
I was working on a text file in open office writer last night, and this morning i tried to open it and a little thing popped up saying "ASCII Filter Options," prompting me for which character set, language and default fonts I wanted to use. (only after it asked if i wanted to recover the document, which i just did since it was asking me...) Now the document, which was very important, is a scrambled bunch of random characters! I cannot find the correct character set. Why is it prompting me for which character encoding I am using in the first place?
View 6 Replies
View Related
Sep 14, 2010
i am collecting usb usage details of all users and convert it into csv files so that i can export it into some database..the output desirable is in csv format for database with some batch or awk script.
View 1 Replies
View Related
Jul 21, 2011
In order to make this conversion I have to use a text editor. This is tedious. Is there an easier way to do it, like some program I can run from the Linux or OSX terminal?
View 1 Replies
View Related
Nov 12, 2009
i just touch linux, may i know how can i convert the core dump file to a readable textfile, which include all the information, which is in core dump, such as all variables, threads information, call trace for each tasks, and so on. i know use the GDB can view this, but it won't dump all the informations to one text file. but sometimes, people want to view the core dump reason without Linux environment.
View 2 Replies
View Related
Sep 28, 2009
I have on my windows machine several hundred files that are a format of .nc .ncs for a CNC machine. I need to convert them to txt which is something as easy as opening in notepad and then saving as .txt but there are so many that this kind of action would take way too long.
The reason I am writing the linuxquestions is because I would feel more comfortable in loading a live CD and using some sort of terminal command to do this than I would to download one of the many "freeware" type programs I have found for windows (even more so since I have had a root kit before and had to start all the way over to get rid of it).
I need to know:
1. Is this possible to do with the terminal without super advanced knowledge.
2. Can one please point me in the right direction; something to read or an example
View 2 Replies
View Related
Nov 6, 2010
What I am trying to do is to convert a mass of files from standard Windows text encoding to UNIX encoding(462 .txt files). After a bit of searching, I found this bit of code, changed the necessary parts, put in "Sleep" so I could see the output and ran it:
Code: #!/bin/bash
FROM=iso-8859-1
TO=UTF-8
[code]...
View 6 Replies
View Related
Apr 26, 2010
I have a file of words and want to encode them in a numeric form, based on position.This is best explained by example:
PEOPLE ==> 123152
Reading left to right:
P was first encountered at position 1 so it is encoded as 1.
E was first encountered at position 2 so it is encoded as 2.
[code]...
View 5 Replies
View Related
Sep 3, 2010
how to search in files text that is one-byte encoding? places - search for files in gnome in ubuntu searches only utf-8 text.i know one way: install wine and total commander, then search with it. what are better ways?[URL]
View 4 Replies
View Related
Sep 17, 2010
I have a few ogg files that I want to put on my phone. My phone doesn't play oggs, so I need to convert. It's just an audiobook so I really don't care about quality. Oddly, nothing I try seems to be up to the task. Here's what I tried so far:
sox my.ogg my.mp3: "sox FAIL formats: can't open output file `x.mp3': SoX was compiled without MP3 encoding support" mencoder: wants video too?? Wha?? vlc (from GUI): "p, li { white-space: pre-wrapStreaming / Transcoding failed: It seems your FFMPEG (libavcodec) installation lacks the following encoder: MPEG Audio layer 1/2/3. If you don't know how to fix this, ask for support from your distribution.
This is not an error inside VLC media player. Do not contact the VideoLAN project about this issue." lame my.ogg my.mp3: "sorry, vorbis support in LAME is deprecated." audicity (via GUI): success..but I'm now going to have to somehow script it. This is very annoying. It makes me want to rip directly into mp3 from now on. Is there an easy way to convert an ogg file to an mp3 from the command-line?
View 9 Replies
View Related
Jun 9, 2010
I am looking for some source package which will convert plain text file to html file without using perl.
I mainly need to do this on an ARM platform, so if I get sources I can cross compile it.
View 5 Replies
View Related
May 2, 2011
Anyone can recommend any ubuntu compatible software to convert a scanned document img file to text?
View 2 Replies
View Related
Apr 13, 2010
i working with a simulator tool that i need to pass to it a file in .BIN format, basically i need to convert from a tex plain file to BIN file How can i do that? there is some command(s) that allow me do
View 2 Replies
View Related
Dec 3, 2010
I have a gedit text file 2.2MB. I want to convert it into two orhree smaller files/volumes, so I can upload them seperately to web pages. Does anyone know a quick and easy way to do this?
View 4 Replies
View Related
Nov 17, 2010
I ftped a text file from Linux to Windows. I changed the mode to ASCII but it still transferred in BINARY mode. Why? Thanks.
ftp> ascii
---> TYPE A
200 Switching to ASCII mode.
[code]...
View 1 Replies
View Related
Jan 15, 2010
I have been having a problem with K9 Copy whenever I try to convert a DVD to MPEG files using the method without encoding, but for some reason lately it seems to be splitting each episode into about 6 or 7 pieces instead of just one. I never usually have a problem with this, but for some reason it seems to be doing this whenever I try to rip something. On a side note, I am running on Obuntu 9.10 and have K9Copy version 2.3.0 installed. I have tried installing several other versions, but it does the same thing with each.
View 9 Replies
View Related
Jan 15, 2010
I have been having a problem with K9 Copy whenever I try to convert a DVD to MPEG files using the method without encoding, but for some reason lately it seems to be splitting each episode I am trying to rip into about 6 or 7 pieces instead of just one.
I never usually have a problem with this, but for some reason it seems to be doing this whenever I try to rip something. I am running on Obuntu 9.10 and have K9Copy version 2.3.0 installed. I have tried installing several other versions, but it does the same thing with each.
View 6 Replies
View Related
Jun 7, 2011
I am only using the 128 character set defined in the original ANSI standard. But as a whole how are the files implmeneted differently. I am not concerned with the display, i.e. if a tab is displayed with 6 or 8 characters but the actual internal representation in memory
One differnce I've hear is the use of (Windows) vs. for line termination (Linux).
View 3 Replies
View Related
Jan 1, 2011
To convert a text file to postscript format is easy with groff:groff -Tps file >file.ps But what if the input file is utf-8 (with non English characters in it)? How can you specify the encoding to groff or do you have to pipe the file through some other command?
View 1 Replies
View Related
May 20, 2010
how i can convert a text to ASCII? >>> ( to encrypt the text ) and how can we use the "hexdump"
View 5 Replies
View Related
Jun 15, 2010
I need to convert a very large latex project (made up of many .tex and style files) into .html (or something similarly non-.pdf). Can someone recommend a quality converter program? Preferably, one that is:
[Code]....
View 1 Replies
View Related
May 7, 2010
I have a number of text files (26 per database x 100+ databases) which need 'correcting' in order to import into postgresql. I think that I have identified all the problem characters and I need to automate the process as much as possible. I have a script to convert the characters and I do them one by one (not effecient but easier to understand).
[Code].....
View 2 Replies
View Related
Mar 16, 2011
is there command in linux which is able to convert binary (0101001010000100) to text like it means something
View 4 Replies
View Related
Dec 6, 2010
I tried to tag late onto a question similar to mine on stackoverflow (Find Non-UTF8 Filenames on Linux File System) to elicit further replies, with no luck so far, so here goes again... I have the same problem as the OP in the link above and convmv is a great tool to fix one's own filesystem. My question is therefore academic, but I find it unsatisfactory (in fact I can't believe) that 'find' is not able to find non standard ascii characters.
Is there anyone out there that would know what combination of options to use to find filenames that contain non standard characters on what seems to be a unicode FS, in my case the characters seem to be 8bits extended ascii rather than unicode, the files come from a Windows machine (iso-8859-1) and I regularly need to fetch them. I'd love to see how find and/or grep can do the same as convmv.
[Code]....
View 2 Replies
View Related
Jul 28, 2011
I am making a text search engine. I need to first convert binary documents to text. I want to go with cross-platform (we develop both on windows and linux) command line (so that I can get the output via python subprocess). What are the choices for this?
View 1 Replies
View Related